Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  1. Managementsamenvatting
  2. Probleemstelling
  3. Eisen en randvoorwaarden
  4. Mogelijke oplossingen..... 7Conclusies .... 12
  5. Conclusie
  6. Wiki Markup
    Literatuur ...... 13
    [Literatuur|#Literatuur}
    [Bijlage 1 Begrippen...... 14
    |#Bijlage1]
    [Bijlage 2. Voorstel voor gefaseerde implementatie URN:NBN in DARE-verband... . 22|#Bijlage2]
    \\
    \\
    \\
    \| Status versie 0.6: besproken en goedgekeurd in de vergadering van de Repository Managers
    \\
    d.d. 5 april 2007, DARE, SURF, Utrecht.
    \\
    Wijzigingen:
    \\
    1. In het schema: "Huidige Identifiers voor "information assets"" (pg 5) zijn wijzigingen
    \\
    opgenomen voor de Technische Universiteit Eindhoven en de Rijks Universiteit Groningen.
    \\
    2. Op pg 7 is een betere verwijzing opgenomen naar: Hans-Werner Hilse and Jochen Kothe, Implementing Persistent Identifiers \[Hilse/Kothe, 2006\].
    \\
    3. Op de kaft is het DARE-logo opgenomen \|
    \\
    \\
    *Werkgroep*
    De werkgroep _persistente identifi{_}ers was als volgt samengesteld:
    Niels Rozenboom, Universiteit Leiden
    Maarten Hoogerwerf, DANS
    Peter Ruijgrok, Universiteit Utrecht Laurens Sesink, DANS
    Hubert Krekels, Wageningen Universiteit Maurice Vanderfeesten, DARE
    Marlon Domingus, Universiteit Leiden (vz)
    \\
    \\
    \\

...

1. Er dient een unieke identifier te zijn voor elke metadata-record dan wel een jump off page. Deze identifier dient zowel te identificeren als toegang te verschaffen tot de resource (in dit geval zijnde de metadata)
2. Er dient per bitstream een unieke identifier te zijn. Deze identifier dient zowel te identificeren als toegang te verschaffen tot de resource (in dit geval zijnde de bitstream)
3. De unieke identifier heeft een syntax die verhuizingen van de resource (zowel metadata als bitstream(s)) toestaat zonder dat een verhuizing om redenen van identificatie of resolving een wijziging van de syntax van de identifier vereist.
4. de resolving van de identifier, dus het toegang verlenen tot de betreffende resource (zowel metadata als bitstream(s)) dient op generieke wijze te geschieden.
5. Om na verhuizing van een combinatie van metadata-record en bijbehorende
bitstream(s) de resolving goed te laten plaatsvinden is er een resolver buiten de
specifieke in DARE participerende partijen vereist, die de rol van nationale resolver vervult voor DARE-doeleinden. Zie ook een schematische voorstelling van deze situatie hieronder.

Anchor
Oplossingen
Oplossingen

4. Mogelijke oplossingen

Wiki Markup
\\
Technische gezien komen eigenlijk alleen twee oplossingen in aanmerking voor het
beschreven probleem en de bijbehorende randvoorwaarden: URN en DOI/Handle.
Hieronder wordt geschetst hoe deze oplossingen zouden kunnen werken en wat de sterke / zwakke punten zijn van de oplossing.(Zie voor een zeer uitgebreid en volledig overzicht van verschillende benaderingen van implementatie van persistente identifiers ook: Hans-Werner Hilse and Jochen Kothe, Implementing Persistent Identifiers \[Hilse/Kothe, 2006\].)

...

Deze optie gaat uit van een Third party Registration Agency, zoals CrossRef. Het voordeel is dat met een dergelijke RA al het werk uit handen wordt genomen. CrossRef zorgt namelijk dat je op eenvoudige wijze een persistent-identifier aan je wetenschappelijke content kunt koppelen. Crossref kan worden gezien als een betrouwbare en ervaren organisatie die DOI's registreren. Verder biedt deze organisatie services, zoals het gebruik van OpenURL, reference linking en biedt het een doorzoekbare metadata database. Het nadeel is dat we er ook naar betalen. RA's zijn vrij om hun eigen prijs te bepalen. RA's betalen op hun beurt een paar cent per DOI naam aan de IDF. CrossRef rekent voor local hosting $20.000, plus jaarlijks $0.06 - $1.00 per DOI.
Wil een repository zich aansluiten bij CrossRef dan zijn een van de eisen van CrossReff dat de repository de eigenaar is van het artikel. Dit is vaak niet het geval met Open Access, bij uitgevers vaak juist wel. Verder gaat CrossReff uit van uit van full-text werken en niet van datasets, video's, audio bestanden etc.. Het is overigens de vraag of CrossRef zal toestaan dat Voor DSpace gebruikers is het overigens relatief eenvoudig handles om te zetten naar DOI door een DOI-prefix / DSpace-prefix / DSpace-suffix. http://hdl.handle.net/1887/4550 wordt bijvoorbeeld http://dx.doi.org/10.1575/1887/4550.

Anchor
Conclusie
Conclusie

Conclusie

Deze optie heeft voordelen: er is een bestaande technische infrastructuur en er is een
organisatie die de relevante diensten kan beheren en als toekennende instantie kan
functioneren. Vraag is echter of alle gewenste resources (zowel metadata als digitale objecten)
een doi toegekend zullen kunnen krijgen en wat de mate van verwarring bij de eindgebruiker is als aan hetzelfde digitale object meerdere doi's worden toegekend (bijvoorbeeld een vanwege een uitgever en een andere vanwege een repositorium).

...

Tenslotte beveelt de werkgroep aan om na een nationale oplossing samen met Duistland en eventuele andere landen te kijken naar de mogelijkheid van een global URN:NBN-resolver.

Anchor
Literatuur
Literatuur

6. Literatuur

Wiki Markup
\[Hakala, 2003\] Hakala, Juha, (NBN), _Identification of Electronic Resources_, 2003, [http://metadata-wg.mannlib.cornell.edu/forum/2003-01-29/juhahakala.ppt/]<span style="color: #0000ff"><a href="http://metadata-wg.mannlib.cornell.edu/forum/2003-01-29/juhahakala.ppt">http://metadata-wg.mannlib.cornell.edu/forum/2003-01-29/juhahakala.ppt</a></span>\\
\\
\[Hoogerwerf, 2006\] Hoogerwerf, Maarten, _Persistent Identification and linking of Datasets_. Student Thesis, 2006

...

Tim Berners-Lee, Roy T. Fielding, Larry Masinter. (January 2005). "Uniform Resource Identifier (URI): Generic Syntax". Internet Society. RFC 3986; STD 66. http://gbiv.com/protocols/uri/rfc/rfc3986.htmlhttp://gbiv.com/protocols/uri/rfc/rfc3986.html


























Anchor
Bijlage1
Bijlage1

Bijlage 1 Begrippen

Wiki Markup
\\
\\
*Harvester* \[OAIPMH-definitie\]
\\
A harvester is a client application that issues OAI-PMH requests. A harvester is operated by a service provider as a means of collecting metadata from <span style="color: #0000ff">repositories</span>.
\\
Zie: OAI-PMH 2.0: [http://www.openarchives.org/oai/openarchivesprotocol.html#definitionsconcepts/]<span style="color: #0000ff"><a href="http://www.openarchives.org/OAI/openarchivesprotocol.html#DefinitionsConcepts">http://www.openarchives.org/OAI/openarchivesprotocol.html#DefinitionsConcepts</a></span>\\
\\
*Repository* \[OAIPMH-definitie\]
\\
A repository is a network accessible server that can process the 6 OAI-PMH requests in the manner described in this document. A repository is managed by a data provider to expose metadata to <span style="color: #0000ff">harvesters</span>. To allow various repository configurations, the OAI-PMH distinguishes between three distinct entities related to the metadata made accessible by the  OAI-PMH.
\\
\*resource - A resource is the object or "stuff" that metadata is "about". The nature of a   resource, whether it is physical or digital, or whether it is stored in the repository or is a constituent of another database, is outside the scope of the OAI-PMH.
\*item - An <span style="color: #0000ff">item</span> is a constituent of a repository from which metadata about a resource   can be disseminated. That metadata may be disseminated on-the-fly from the associated resource, cross-walked from some canonical form, actually stored in the   repository, etc.
\*record - A <span style="color: #0000ff">record</span> is metadata in a specific <span style="color: #0000ff">metadata format</span>. A record is returned as an   XML-encoded byte stream in response to a protocol request to disseminate a specific   metadata format from a constituent item.
\\
Zie: OAI-PMH 2.0: [http://www.openarchives.org/oai/openarchivesprotocol.html#definitionsconcepts/]<span style="color: #0000ff"><a href="http://www.openarchives.org/OAI/openarchivesprotocol.html#DefinitionsConcepts">http://www.openarchives.org/OAI/openarchivesprotocol.html#DefinitionsConcepts</a></span>\\
*Item* \[OAIPMH-definitie\]
\\
An item is a constituent of a repository from which metadata about a resource can be disseminated. An item is conceptually a container that stores or dynamically generates metadata about a single resource in multiple formats, each of which can be harvested as <span style="color: #0000ff">records</span> via the OAI-PMH. Each item has an <span style="color: #0000ff">identifier</span> that is unique within the scope of the repository of which it is a constituent.
\\
Zie: OAI-PMH 2.0: [http://www.openarchives.org/oai/openarchivesprotocol.html#definitionsconcepts/]<span style="color: #0000ff"><a href="http://www.openarchives.org/OAI/openarchivesprotocol.html#DefinitionsConcepts">http://www.openarchives.org/OAI/openarchivesprotocol.html#DefinitionsConcepts</a></span>\\
\\
\\
*Unique Identifier* \[OAIPMH-definitie\]
\\
A unique identifier unambigiously identifies an item within a repository; the unique identifier is used in OAI-PMH requests for extracting metadata from the item. Items
\\
\\
\\
\\
\\
\\
may contain metadata in <span style="color: #0000ff">multiple formats</span>. The unique identifier maps to the item, and all possible <span style="color: #0000ff">records</span> available from a single item share the same unique identifier.
The format of the unique identifier must correspond to that of the <span style="color: #0000ff">URI (Uniform Resource</span>
<span style="color: #0000ff">Identifier)</span> syntax. Individual communities may develop community-specific <span style="color: #0000ff">URI</span> schemes for coordinated use across repositories. The scheme component of the unique identifiers must not correspond to that of a recognized URI scheme unless the identifiers conform to that  scheme. Repositories may implement the <span style="color: #0000ff">oai-identifier</span> syntax described in the accompanying <span style="color: #0000ff">Implementation Guidelines</span> document.
\\
Unique identifiers play two roles in the protocol:
\\
1.  Response: Identifiers are returned by both the <span style="color: #0000ff">ListIdentifiers</span> and <span style="color: #0000ff">ListRecords</span> requests.
2.  Request: An identifier, in combination with a <span style="color: #0000ff">metadataPrefix</span>, is used in the <span style="color: #0000ff">GetRecord</span>
request as a means of requesting a <span style="color: #0000ff">record</span> in a specific metadata format from an item.
Note that the identifier described here is not that of a resource. The nature of a resource
identifier is outside the scope of the OAI-PMH. To facilitate access to the resource associated with harvested metadata, repositories should use an element in metadata records to establish a linkage between the record (and the identifier of its item) and the identifier (URL, URN, DOI, etc.) of the associated resource. The mandatory Dublin Core format provides the identifier  element that should be used for this purpose.
\\
Zie: OAI-PMH 2.0: [http://www.openarchives.org/oai/openarchivesprotocol.html#definitionsconcepts/]<span style="color: #0000ff"><a href="http://www.openarchives.org/OAI/openarchivesprotocol.html#DefinitionsConcepts">http://www.openarchives.org/OAI/openarchivesprotocol.html#DefinitionsConcepts</a></span>\\
\\
\\
*Record* \[OAIPMH-definitie\]
\\
A record is metadata expressed in a single format. A record is returned in an XML\-
encoded byte stream in response to an OAI-PMH request for metadata from an item. A record is identified unambigiously by the combination of the <span style="color: #0000ff">unique identifier</span> of the item from which the record is available, the metadataPrefix identifying the metadata format of the record, and the <span style="color: #0000ff">datestamp</span> of the record. The XML-encoding of records is organized into the  following parts:
\*_header_ \- contains the unique identifier of the item and properties necessary for
selective harvesting. The header consists of the following parts:
o   the <span style="color: #0000ff">unique identifier</span> \- the unique identifier of an item in a repository;
o   the <span style="color: #0000ff">datestamp</span> \- the date of creation, modification or deletion of the record for   the purpose of <span style="color: #0000ff">selective harvesting</span>.
o   zero or more <span style="color: #0000ff">setSpec</span> elements - the <span style="color: #0000ff">set</span> membership of the item for the   purpose of <span style="color: #0000ff">selective harvesting</span>.
o   an optional status attribute with a value of deleted indicates the withdrawal of
availability of the specified metadata format for the item, dependent on the repository support for <span style="color: #0000ff">deletions</span>.
\*_metadata_ \- a single manifestation of the metadata from an item. The OAI-PMH   supports items with multiple manifestations (formats) of metadata. At a minimum, repositories must be able to return records with metadata expressed in the <span style="color: #0000ff">Dublin Core</span> format, without any <span style="color: #0000ff">qualification</span>. Optionally, a repository may also disseminate other formats of metadata. The specific metadata format of the record to be
disseminated is specified by means of an argument - the <span style="color: #0000ff">metadataPrefix</span> \- in the
<span style="color: #0000ff">GetRecord</span> or <span style="color: #0000ff">ListRecords</span> request that produces the record. The <span style="color: #0000ff">ListMetadataFormats</span> request  returns the list of all metadata formats available from a repository, or for a specific item (which can be specified as an argument to the <span style="color: #0000ff">ListMetadataFormats</span> request).
\*_about_ \- an optional and repeatable container to hold data about the metadata part of   the record. The contents of an about container must conform to an XML Schema.   Individual implementation communities may create XML Schema that define specific   uses for the contents of about containers. Two common uses of about containers are:
o   \_rights statements+: some repositories may find it desirable to attach terms of use   to the metadata they make available through the OAI-PMH. No specific set of   XML tags for rights expression is defined by OAI-PMH, but the about   container is provided to allow for encapsulating community-defined rights   tags.
o _provenance statements_: One suggested use of the about container is to indicate   the provenance of a metadata record, e.g. whether it has been harvested itself   and if so from which repository, and when. An XML Schema for such a   provenance container, as well as some supporting information is available   from the accompanying <span style="color: #0000ff">Implementation Guidelines</span> document.
\\
\\
\\
Zie: OAI-PMH 2.0: [http://www.openarchives.org/oai/openarchivesprotocol.html#definitionsconcepts/]<span style="color: #0000ff"><a href="http://www.openarchives.org/OAI/openarchivesprotocol.html#DefinitionsConcepts">http://www.openarchives.org/OAI/openarchivesprotocol.html#DefinitionsConcepts</a></span>\\
\\
\\
*Persistent* \[OAIPMH-definitie\]
The repository maintains information about deletions with no time limit. A repository that
indicates this level of support must persistently keep track of the full history of deletions and consistently reveal the status of a deleted record over time.
\\
*Identifier* \[W3C-definitie\]
An identifier embodies the information required to distinguish what is being identified from all other things within its scope of identification.  Our use of the terms "identify" and "identifying" refer to this purpose of distinguishing one resource from all other resources, regardless of how that purpose is accomplished (e.g., by name, address, or context).  These terms should not be mistaken as an assumption that an identifier defines or embodies the identity of what is referenced, though that may be the case for some identifiers.  Nor should it be assumed that a system using URIs will access the resource identified: in many cases, URIs  are used to denote resources without any intention that they be accessed.  Likewise, the "one"  resource identified might not be singular in nature (e.g., a resource might be a named set or a  mapping that varies over time).
\\
Zie: Tim Berners-Lee, Roy T. Fielding, Larry Masinter. (January 2005). "Uniform Resource Identifier (URI): Generic Syntax". Internet Society. RFC 3986; STD 66. [http://gbiv.com/protocols/uri/rfc/rfc3986.html]<span style="color: #0000ff"><a href="http://gbiv.com/protocols/uri/rfc/rfc3986.html">http://gbiv.com/protocols/uri/rfc/rfc3986.html</a></span>

Wiki Markup
*Resource* \[W3C-definitie\]
\\
This specification does not limit the scope of what might be a resource; rather, the term
"resource" is used in a general sense for whatever might be identified by a URI.  Familiar
examples include an electronic document, an image, a source of information with a consistent purpose (e.g., "today's weather report for Los Angeles"), a service (e.g., an HTTP-to-SMS gateway), and a collection of other resources.  A resource is not necessarily accessible via the Internet; e.g., human beings, corporations, and bound books in a library can also be resources. Likewise, abstract concepts can be resources, such as the operators and operands of a mathematical equation, the types of a relationship (e.g., "parent" or "employee"), or numeric values (e.g., zero,one, and infinity).
\\
Zie: Tim Berners-Lee, Roy T. Fielding, Larry Masinter. (January 2005). "Uniform Resource Identifier (URI): Generic Syntax". Internet Society. RFC 3986; STD 66. [http://gbiv.com/protocols/uri/rfc/rfc3986.html]<span style="color: #0000ff"><a href="http://gbiv.com/protocols/uri/rfc/rfc3986.html">http://gbiv.com/protocols/uri/rfc/rfc3986.html</a></span>\\
\\
*URL* \[W3C-definitie\]
The term "Uniform Resource Locator" (URL) refers to the subset of URIs that, in addition to identifying a resource, provide a means of locating the resource by describing its primary access mechanism (e.g., its network "location").
Zie: Tim Berners-Lee, Roy T. Fielding, Larry Masinter. (January 2005). "Uniform Resource Identifier (URI): Generic Syntax". Internet Society. RFC 3986; STD 66. [http://gbiv.com/protocols/uri/rfc/rfc3986.html]<span style="color: #0000ff"><a href="http://gbiv.com/protocols/uri/rfc/rfc3986.html">http://gbiv.com/protocols/uri/rfc/rfc3986.html</a></span>\\
*URI* \[W3C-definitie\]
A Uniform Resource Identifier (URI) is a compact sequence of characters that identifies an abstract or physical resource. This specification defines the generic URI syntax and a process for resolving URI references that might be in relative form, along with guidelines and security considerations for the use of URIs on the Internet. The URI syntax defines a grammar that is a superset of all valid URIs, allowing an implementation to parse the common components of a URI reference without knowing the scheme-specific requirements of every possible identifier. This specification does not define a generative grammar for URIs; that task is performed by the individual specifications of each URI scheme.
The following are two example URIs and their component parts:
\\
foo://example.com:8042/over/there?name=ferret#nose
\_/__\__\__\______\__/_\__\__\__\__/ \_\__\__\____/ \__/

scheme authoritypathquery fragment

_____________________

__
/ \ /\\\\\\\\\\\\\\\\\\\\\\\\\\\\\
urn:example:animal:ferret:nose
Zie: Tim Berners-Lee, Roy T. Fielding, Larry Masinter. (January 2005). "Uniform Resource Identifier (URI): Generic Syntax". Internet Society. RFC 3986; STD 66. http://gbiv.com/protocols/uri/rfc/rfc3986.htmlhttp://gbiv.com/protocols/uri/rfc/rfc3986.html

<ac:structured-macro ac:name="unmigrated-wiki-markup" ac:schema-version="1" ac:macro-id="838b83c81bdea777-c2402f70-4a524869-ab638149-b1b85fc4f27f326289297d07"><ac:plain-text-body><![CDATA[URI, URL, and URN [W3C-definitie]
]]></ac:plain-text-body></ac:structured-macro>
A URI can be further classified as a locator, a name, or both. The term "Uniform Resource Locator" (URL) refers to the subset of URIs that, in addition to identifying a resource, provide a means of locating the resource by describing its primary access mechanism (e.g., its network "location"). The term "Uniform Resource Name" (URN) has been used historically to refer to both URIs under the "urn" scheme

Wiki Markup
\[RFC2141\]
, which are required to remain globally unique and persistent even when the resource ceases to exist or becomes unavailable, and to any other URI with the properties of a name.
An individual scheme does not have to be classified as being just one of "name" or "locator". Instances of URIs from any given scheme may have the characteristics of names or locators or both, often depending on the persistence and care in the assignment of identifiers by the naming authority, rather than on any quality of the scheme. Future specifications and related documentation should use the general term "URI" rather than the more restrictive terms "URL" and "URN"
Wiki Markup
\[RFC3305\]
.

...

  • Namespace referencing. One may specify any existing identifier as a URN: e.g
    urn:isbn:123456789, but this has no advantage over the simpler isbn:12345678. Such identifiers may be implemented using a specially written URN plug-in and resolved to URLs: functionally this gives nothing beyond the functionality achieved by coherent management of the corresponding URLs.
  • URN implementation. In order to implement the functional requirements, the URN
    architecture assumes an additional network service: a DNS-based Resolution
    Discovery Service (RDS) to allow a client to deal with a previously unknown URN type by finding the specific service appropriate to the given URN scheme. URN resolutions are then delegated to that scheme-specific resolution service. However no such deployed RDS schemes currently exist: browsers cannot action URN strings without some additional programming in the form of a "plug-in". The lack of any wide-spread infrastructural support will require any URN implementation to develop their own resolution mechanisms, such as plug-ins or proxy servers. Resolution mechanisms which require functionality beyond 1 URN to 1 URL also require the creation of data models. Several such implementations have been developed for specific uses where deployment to a closed group of users may be achieved; these carry no guarantee of ready interoperability with other deployments, which may require a different plug in for each implementation and may use conflicting data approaches.
  • DOI names do not require a plug-in but offer this as an option. The CNRI Handle
    System web browser plug-in (native resolver, available in binary) delivers URN
    functional requirements in Windows-based browsers. URN plug-ins and Handle
    System plug-ins share the problem of any new functionality of deploying the software to users. The Handle System has significant advantages: (1) it is a global supported resolution service; (2) the plug-in is freely available, widely tested and proven across multiple applications; (3) unlike URN plug-ins, it is part of a suite of freely available and managed supporting software configured to provide coherent server-side support, including the Local Handle Service and Handle System Client Libraries. These are available across platforms and with several added security features such as trusted
    resolution and distributed administration. DOI System functionality can therefore be
    delivered through http, browser plug in, or incorporation of HANDLE.NET ® software in a dedicated application.
  • The DOI System is not registered as a formal URN, despite fulfilling all the functional requirements, since URN registration appears to offer no advantage to the DOI System. It requires an additional layer of administration for defining a DOI name as a URN namespace (the string urn:doi:10.1000/1 rather than the simpler doi:10.1000/1) and an additional step of unnecessary redirection to access the resolution service, already achieved through either http proxy or native resolution. If RDS mechanisms supporting URN specifications become widely available, the DOI System will be registered as a URN.

    Zie: http://www.doi.org/factsheets/doiidentifierspecs.htmlhttp://www.doi.org/factsheets/DOIIdentifierSpecs.html

Anchor
Bijlage2
Bijlage2

Bijlage 2. Voorstel voor gefaseerde implementatie URN:NBN in DAREverband.

...