Archival Resource Key (ARK) identifiers are persistent URLs designed to support long-term access to information objects. Introduced in 2001, ARK identifiers were designed to identify objects of any type:
ARKs are assigned for a variety of reasons:
Some advantages of ARKs:
The type of a URL-based identifier can often be spotted by how the URL starts, but that’s not true for ARKs, which are spotted by an internal “ark:” label that comes after the URL hostname. For example, here is an ARK,
that gets you to a dissertation. ARKs are high-functioning identifiers that lead you to things and to descriptions of those things. For example, adding ‘?’ on the end of the above ARK should get you to its description:
A common internet identifier is a URL, or part of a URL. For example, this core ARK identifier,
appears inside two different URLs (Uniform Resource Locators, also known as web links or web addresses):
ARKs are especially good at being persistent identifiers (PIDs).
Why do we need persistent identifiers? Websites and databases change. As we redesign or migrate to new systems, the links to our resources break. Citations and links to your resources will produce the dreaded “404 Not Found” error. Irritating as that may be, it’s politically awkward when looking for publicly funded research, and it’s a cultural disaster for libraries, archives, museums, and other memory organizations.
Among the many links that can or once could lead you to things, a persistent identifier is a link that in principle keeps working far into the future. Services that provide discovery and interlinking (such as between research articles, authors, supporting data, and related research) prefer persistent identifiers because of that stability.
Persistent identifiers should keep working even as websites and databases change. Normally when resources move, everyone who ever recorded the old links would need to be told what the new links are, which is next to impossible. ARKs and the systems and tools that support them provide that persistence.
A resolver is a system that specializes in forwarding incoming identifiers (the ones originally advertised to users) to whichever websites are currently best able to deal with them. Overall, forwarding is called resolution; one step in a resolution process is called redirection.
For a resolver to work, its hostname (the n2t.net or ark.bnf.fr in the identifiers above) must be carefully chosen so it won’t ever need to be changed. Memory organizations, some of them centuries old, tend to have hostnames well-suited to be resolvers. Some well-known, younger resolvers are n2t.net (the ARK resolver), identifiers.org, doi.org, handle.net, and purl.org.
An ARK is represented by a sequence of characters that contains the label, “ark:”. When embedded in a URL, it is preceded by the protocol (https://) and name of a service that provides support for that ARK. That service name, or the “Name Mapping Authority” (NMA), is mutable and replaceable, as neither the web server itself nor the current web protocols are expected to last longer than the identified objects. The immutable, globally unique identifier follows the “ark:” label. This includes a “Name Assigning Authority Number” (NAAN) identifying the naming organization, followed by the name that it assigns to the object.
Here is a diagrammed example:
https://example.org/ark:/12345/x54xz321/s3/f8.05v.tiff \_________________/ \__/ \___/ \______/\____/\_______/ | | | | | | | ARK Label | | Sub-parts Variants | | | Name Mapping Authority (NMA) | Assigned Name | Name Assigning Authority Number (NAAN)
A peek at ARK anatomy. You can spot an ARK by its internal label.
More details about ARK structure and syntax are available later in this guide.
While many examples in this guide refer to resources in digital repositories, an ARK can be assigned to anything digital, physical, or abstract. That can include things that don’t yet exist but to which you need to refer from objects that you’re in the process of creating or planning, such as a link from a draft article to a dataset under preparation, or a link from an archived digital letter to a planned finding aid. One caution is that you should generally assign ARKs to things that you own, control, or manage. Assigning ARKs to things you don’t control is discouraged because such identifiers tend to be fragile.
Examples of things that have ARKs are listed below. Numbers are approximate, current as of September 2020, and self-reported by the identified ARK organizations.
By itself, assigning an ARK to a thing will not do anything to guarantee persistence. You’ll need to use the tools and services associated with ARKs to maintain a record of the ARKs you’ve assigned, the things they represent, and the current live URLs for those things. If the things or URLs change, you’ll need to update that information so that persistent URLs continue to work.
All identifier systems are subject to the same weaknesses:
Given how little the schemes do for you, when choosing one you’ll likely want to consider factors such as cost, risk, and openness.