Metadata for this document
Title |
DARE use of Dublin Core |
Creator |
Domingus, Marlon; Feijen, Martin |
Subject |
DARE repositories; metadata; Dublin Core |
Description |
Guidelines for the use of qualified Dublin Core within the Dare Programme |
Publisher |
Stichting SURF |
Date |
2004-12-06; Date_valid: 2004-12-01 / 2005-12-01 |
Type |
Internal report |
Format |
Text/richtext |
Identifier |
SURF OZ.04.5234 |
Language |
Eng |
Rights |
Copyright Stichting SURF. The text of this document may be used freely, without permission of Stichting SURF. |
Version |
Remarks |
|
|
|
|
August 2003 |
First internal version presented to project manangers |
|
September 2003 |
Second internal version presented to project managers |
|
1.0 (October 2003) |
First edition to be used starting from November 1 2003 |
Download PDF |
2.0 (December 2004) |
Second edition to be used starting from December 1 2004 |
|
Addendum (July 2006) |
Addendum on v2.0 |
\\ \\ \\ *Acknowledgements* This document is largely based on the recommendations for the use of simple Dublin Core metadata as described in: USING SIMPLE DUBLIN CORE TO DESCRIBE EPRINTS, by Andy Powell, Michael Day and Peter Cliff, UKOLN, University of Bath, Version 1.2 \[see also: [http:*/* www.rdn.ac.uk/projects/epri<ac:structured-macro ac:name="anchor" ac:schema-version="1" ac:macro-id="51b16d9a-8b1c-45c4-aa42-23e99713418c"><ac:parameter ac:name="">_Hlt53475187</ac:parameter></ac:structured-macro><ac:structured-macro ac:name="anchor" ac:schema-version="1" ac:macro-id="58194d18-9d03-4f20-8e93-cf0984a2e517"><ac:parameter ac:name="">_Hlt53475188</ac:parameter></ac:structured-macro>nts\-<ac:structured-macro ac:name="anchor" ac:schema-version="1" ac:macro-id="eb038d3e-6099-4362-bfda-e6fcd7195394"><ac:parameter ac:name="">_Hlt52270635</ac:parameter></ac:structured-macro><ac:structured-macro ac:name="anchor" ac:schema-version="1" ac:macro-id="c5c09334-89de-4ca4-8f3a-6ae14bced7b9"><ac:parameter ac:name="">_Hlt52270636</ac:parameter></ac:structured-macro>uk/docs/simpledc-guidelines/|http://www.rdn.ac.uk/projects/eprints-uk/docs/simpledc-guidelines/] \] \\ *Definitions*: "A DARE institutional repository is a facility, consisting of hardware, software, data and procedures, that contains digital resources representing any type of scientific output..." Specifications for a Networked Repository for Dutch Universities, version 3.0, p 6 "digital resources = any bit stream, independent of content or format, which has been marked as scientific output by an approved person..." Within this document we use the word "resource" to describe the instance of scientific output, and the word "object" to refer to the digital bit stream. \\ \\ |
Scope These guidelines are written primarily to facilitate the exchange of metadata between Dare partners and exchange with non-Dare partners, in compliance with the OAI-PMH definitions as distributed by DCMI. Basically these guidelines describe the mapping from an internal E.g. a Dare partner might use Marc 21 as internal format format to unqualified DC to support harvesting. The guidelines are not to be used as cataloguing instructions.
Within Dare we use unqualified DC (oai_dc).
Use of qualified DC (dare_qdc) is encouraged. Only those refinements that have been added by DCMI are to be used as refinements within Dare. These refinements have also been added in the text of the guidelines below. If a Dare partner has implemented any other (not DCMI endorsed) elements or refinements, he is obliged to eliminate these elements from the metadata during the harvesting process.
Dare partners will implement two XML schemas: one for unqualified DC for OAI compatible harvesting within the Dare community as well as outside the DARE community. Also a XML scheme will be presented for qualified DC for use within Dare.
Language of the metadata is at the discretion of the local Dare partner.
The use of Unicode is mandatory.
Only one metadata record should be used for different versions of a digital object (e.g. a postscript and a pdf version), unless the intellectual content of the versions is different. The rule of thumb is to create a new metadata record when the metadata of a version is different. This happens for instance when a new version of the resource with modifications is created and in that case recommended best practice is to use the relation element to link the newer version to the older.
In some cases (DC element 'subject' and 'type') additional information may be useful for the harvesting party and service provider. A DARE compliant data provider releases this type of information via the 'Identify request' - on IR level; not on the metadata level.
See for instance: 3. Guidelines for Optional Containers at:
http://www.openarchives.org/OAI/2.0/guidelines.htm and: http://arXiv.org/oai2?verb=Identify as well as: http://doc.utwente.nl/oai/ir?verb=Identify for best practices. Additional information can also be given in the form of textual documentation about the use metadata elements subject and type, e.g. to give information on the local classification or keywords, or information on local review policies.
The values (i.e. actual content) of the elements given below must not contain any HTML (or XML) markup. They may contain LaTeX commands, but there is no mechanism for explicitly indicating that LaTeX is being used.
Within DARE the use of elements is either:
The "mandatory when applicable" status is stronger then the recommended one and this distinction is made primarily to encourage users to input certain elements when creating a metadata record to enhance services.
Some words on the use of refinements (qualifiers). When mapping to unqualified DC the IR manager has to make choices when the internal format is "richer" than unqualified DC. This means that during the mapping process all refinements are simply dropped (the DCMI dumb down principle). The effect of the dumb down principle is that the simple form of the element, i.e. without the refinement, is the default one. E.g. when the internal format distinguishes between main title and parallel title this would show as follows in DC:
Internal format
245 $aMain title$pParallel title
Qualified DC
<dc:title>Main title</dc:title>
<dcterms:alternative>Parallel title</dcterms:alternative>
Simple DC
<dc:title>Main title, Parallel title </dc:title>
However, within DARE the following values are selected as the default values for simple oai_dc
dc:descriptiondefault "abstract"
dc:date ->default "created"
dc:relation->no default
dc:coverage-> no default
dc:rights-> no default
dc:audience->default: "education level"
Within DARE this means that the date element always pertains to the date created etc. It is advised that all DARE repositories supply this information to external harvesters as information about their repository.
As per 1/1/05 all DARE repositories are required to support oai_dc and are free to use dare_qdc. Harvesting within DARE will be based upon oai_dc.
Most important new or changed guidelines in oai_dc
Most important new or changed guidelines in dare_qdc
Simple DC:oai_dc
Basic element |
Status |
Encoding schemes |
|
|
|
Title |
M |
None |
Creator |
M |
None |
Subject |
MA |
Choice of keywords and classifications is free |
Description |
MA |
None |
Publisher |
MA |
None |
Contributor |
O |
None |
Date |
M |
Date | ISO 8601 W3C-DTF |
Type |
M |
METIS-list with additional DCMI types. |
Format |
R |
IANA list of MIME types |
Identifier |
M |
URI |
Source |
O |
None |
Language |
R |
ISO 639-1 |
Relation |
R |
none |
Coverage |
O |
Period |
Rights |
M |
None |
Audience |
O |
None |
If no defaults are mentioned in the oai_dc elements, please describe the specific use of the oai_dc elements in the Identify section of your IR. See for instance: 3. Guidelines for Optional Containers at:
http://www.openarchives.org/OAI/2.0/guidelines.htm and: http://arXiv.org/oai2?verb=Identify as well as: http://doc.utwente.nl/oai/ir?verb=Identify for best practices.
Qualified DC:oai_dc
Basic element |
Refinement |
Status |
Encoding schemes |
|
|
|
|
Title |
- |
M |
None |
|
Alternative |
MA |
|
Creator |
- |
M |
None |
Subject |
GOO, NBC, LCSH, MESH, DDC, LCC, UDC, LOCAL |
MA |
Choice of keywords and classifications is free. Use refinements when appropriate. |
Description |
- |
MA |
None |
|
TableOfContents |
R |
|
|
Abstract |
R |
|
Publisher |
- |
MA |
None |
Contributor |
- |
O |
None |
Date |
- |
M |
Date | ISO 8601 W3C-DTF |
|
dateAccepted |
R |
|
|
dateCopyrighted |
R |
|
|
Created |
R |
Created is default in mapping |
|
Valid |
R |
|
|
Available |
R |
|
|
Issued |
R |
|
|
Modified |
R |
|
|
dateSubmitted |
R |
|
Type |
- |
M |
METIS-list with additional DCMI types. |
Format |
R |
IANA list of MIME types |
|
|
Extent |
R |
|
|
Medium |
R |
|
Identifier |
- |
M |
URI |
|
Bibl. citation |
R |
|
Source |
- |
O |
None |
Language |
- |
R |
ISO 639-1 |
Relation |
- |
R |
none |
|
Isversionof |
R |
|
|
Hasversion |
R |
|
|
Replacedby |
R |
|
|
Replaces |
R |
|
|
Requiredby |
R |
|
|
Requires |
R |
|
|
Ispartof |
R |
|
|
Haspart |
R |
|
|
Isreferredby |
R |
|
|
References |
R |
|
|
Isformatof |
R |
|
|
hasFormat |
R |
|
|
Conformsto |
R |
|
Coverage |
- |
O |
|
|
Spatial |
R |
Point |
|
Temporal |
R |
Period |
Rights |
- |
M |
None |
|
Access rights |
MA |
|
Audience |
|
O |
None |
This section lists each of the Dublin Core elements. For each element the authoritative definitions and comments (except usage mandatory/optional etc, which is DARE specific) from the Dublin Core Metadata Initiative are given, followed by a DARE-specific user instruction derived form the UKOLN usage guidelines.
Element name |
Title |
||
DCMI definition |
A name given to the resource. Typically, a Title will be a name by which the resource is formally known. |
||
Usage |
Mandatory |
||
User instruction |
Preserve the original wording, order and spelling of the resource title. Only capitalize proper nouns. Punctuation need not reflect the usage of the original. Subtitles should be separated from the title by a colon. |
||
Do not confuse with |
- |
||
<ac:structured-macro ac:name="unmigrated-wiki-markup" ac:schema-version="1" ac:macro-id="a3245a6b-e207-4064-a627-fc641edd89fd"><ac:plain-text-body><![CDATA[ |
Refinements |
Alternative (Mandatory if present). [DCMI:]Any form of the title used as a substitute or alternative to the formal title of the resource. This qualifier can include Title abbreviations as well as translations. |
]]></ac:plain-text-body></ac:structured-macro> |
Examples |
Qualified DC |
||
Scheme |
Not applicable |
Element name |
Creator |
DCMI definition |
An entity primarily responsible for making the content of the resource. Typically, the name of a Creator should be used to indicate the entity. |
Usage |
Mandatory |
User instruction |
Examples of a Creator include a person, an organization, or a service. |
Do not confuse with |
Contributor (see also User instruction above). |
Refinements |
- |
Examples |
<dc:creator>Sulston, John E.</dc:creator> |
Scheme |
Not applicable |
Element name |
Subject |
DCMI definition |
The topic of the resource. Typically, a Subject will be expressed as keyword, key phrases or classification codes that describe the intellectual content of the resource. |
Usage |
Mandatory when applicable |
User instruction |
In the DC subject element two kinds of values are possible. The first - the use of keywords - is mandatory. The second - the use of a classification - is optional. |
Do not confuse with |
Type. |
Refinements |
LCSH, MESH, DDC, LCC, UDC, GOO, NBC and LOCAL |
Examples |
<dc:subject>polar oceanography; boundary current; mass transport; |
Scheme |
LCSH, MESH, DDC, LCC, UDC, NBC and GOO |
Element name |
Description |
||
DCMI definition |
An account of the content of the resource. Description may include but is not limited to: an abstract, table of contents, reference to a graphical representation of content or a free-text account of the content. |
||
Usage |
Mandatory if applicable |
||
User instruction |
This element is used for a textual description of the content. When a resource consists of several separate physical object files, do not use dc:description to list the URL's of these files. |
||
Do not confuse with |
- |
||
<ac:structured-macro ac:name="unmigrated-wiki-markup" ac:schema-version="1" ac:macro-id="9ab35afa-2253-496f-909d-60a2f4814ba0"><ac:plain-text-body><![CDATA[ |
Refinements |
Tableofcontent (recommended) [DCMI:] A list of subunits of the content of the resource. |
]]></ac:plain-text-body></ac:structured-macro> |
Examples |
<dc:description>Inleiding; 5 hoofdstukken over geschiedenis; 2 hoofdstukken met praktische tips; index</dc:description> |
||
Scheme |
Not applicable |
Element name |
Publisher |
|
DCMI definition |
An entity responsible for making the resource available. Examples of a Publisher include a person, an organization, or a service. Typically, the name of a Publisher should be used to indicate the entity. |
|
Usage |
Mandatory if applicable |
|
<ac:structured-macro ac:name="unmigrated-wiki-markup" ac:schema-version="1" ac:macro-id="6f7e73c5-47dd-4242-890d-2d8e1575cca6"><ac:plain-text-body><![CDATA[ |
User instruction |
The (commercial or non-commercial) publisher of the resource; not the (sub)institution the author is affiliated with. Publisher is used only in the bibliographic / functional sense, not an organisational one. Use only the full name of the given (commercial) publisher, not the name of an organization or institute that is otherwise [in a broader sense] associated with the creator. |
Do not confuse with |
|
|
Refinements |
- |
|
Examples |
<dc:publisher>Loughborough University. Department of Computer Science</dc:publisher> |
|
Scheme |
Not applicable |
Element name |
Contributor |
DCMI definition |
An entity responsible for making contributions to the content of the resource. Examples of a Contributor include a person, an organization, or a service. Typically, the name of a Contributor should be used to indicate the entity. |
Usage |
Optional |
User instruction |
Examples of contributors are: a supervisor, editor, technician or data collector. |
Do not confuse with |
|
Refinements |
- |
Examples |
<dc:contributor>Sulston, John E.</dc:contributor> |
Scheme |
Not applicable |
Element name |
Date |
||
<ac:structured-macro ac:name="unmigrated-wiki-markup" ac:schema-version="1" ac:macro-id="f59dfcae-1de7-4f75-88bd-c7e9dc74544b"><ac:plain-text-body><![CDATA[ |
DCMI definition |
A date associated with an event in the life cycle of the resource. Typically, Date will be associated with the creation or availability of the resource. Recommended best practice for encoding the date value is defined in a profile of ISO 8601 [W3CDTF] and follows the YYYY-MM-DD format. |
]]></ac:plain-text-body></ac:structured-macro> |
Usage |
Mandatory |
||
User instruction |
The date should be formatted according to the W3C encoding rules for dates and times : |
||
Do not confuse with |
- |
||
<ac:structured-macro ac:name="unmigrated-wiki-markup" ac:schema-version="1" ac:macro-id="896a20f4-f771-4650-83b0-60856b10b1f1"><ac:plain-text-body><![CDATA[ |
Refinements |
DateAccepted (Optional) [DCMI:] Date of acceptance of the resource (e.g. of thesis by university department, of article by journal, etc.). |
]]></ac:plain-text-body></ac:structured-macro> |
Examples |
<dc:date>2000-12-25</dc:date> |
||
Schema |
Date |
ISO 8601 W3C-DTF see: http://www.w3.org/TR/NOTE-datetime |
Element name |
Type |
DCMI definition |
The type of scientific output the resource is a manifestation of. In the DC element type the kind of dissemination, or the intellectual and/or content type of the resource is described. It is used to explain to the user what kind of resource he is looking at. Is it a book or an article. Was it written for internal or external use. Etc. |
Usage |
Mandatory. In every metadata record one DC element 'type' should be used. |
User instruction |
Use the first occurrence of the DC element 'type' for the type indication of the scientific output. The list shown below is identical with the list used within the Metis application. Repeat if applicable. Use the text, not the numbers.
|
Do not confuse with |
Format |
Refinements |
- |
Examples |
<dc:type>preprint</dc:type> |
Scheme |
|
Element name |
Format |
||
<ac:structured-macro ac:name="unmigrated-wiki-markup" ac:schema-version="1" ac:macro-id="0b650363-73bc-43dd-b6fb-9cbefce3b66c"><ac:plain-text-body><![CDATA[ |
DCMI definition |
The physical or digital manifestation of the resource. Typically, Format may include the media-type or dimensions of the resource. Format may be used to determine the software, hardware or other equipment needed to display or operate the resource. Examples of dimensions include size and duration. Recommended best practice is to select a value from a controlled vocabulary (for example, the list of Internet Media Types [MIME] defining computer media formats). |
]]></ac:plain-text-body></ac:structured-macro> |
Usage |
Recommended |
||
User instruction |
The DC element 'format' is used in order to give DARE partners the necessary context to base services on. A DARE partner can selectively harvest those records that link to resources that use or operate on software, hardware or other equipment that is supported by the DARE partner's institute. |
||
Do not confuse with |
Type |
||
<ac:structured-macro ac:name="unmigrated-wiki-markup" ac:schema-version="1" ac:macro-id="6785d9ec-4a96-4b6f-8a0c-608bc5a71f64"><ac:plain-text-body><![CDATA[ |
Refinements |
Extent (Optional) [DCMI:] The size or duration of the resource. E.g. number of pages of an article. |
]]></ac:plain-text-body></ac:structured-macro> |
Examples |
<dc:format>application/pdf</dc:format> |
||
Scheme |
the IANA registered list of Internet Media Types (MIME types) |
Element name |
Identifier |
DCMI definition |
An unambiguous reference to the resource within a given context. |
Usage |
Mandatory |
User instruction |
Use an URI to point to the resource (metadata). |
Do not confuse with |
dc:source and dc:relation |
Refinements |
|
Example |
Open URL syntax example: |
Scheme |
Dcterms |
Further information |
Open URL: See also: http://library.caltech.edu/openurl/ |
Property |
Encoding Scheme |
Value |
dc:title |
|
Studying E-Journal User Behavior Using Log Files |
dc:creator |
|
Yu, L. |
dc:creator |
|
Apps, A. |
dc:subject |
dcterms:DDC |
020 |
dc:subject |
dcterms:LCC |
Z671 |
dc:publisher |
|
Elsevier |
dc:type |
dcterms:DCMIType |
Text |
dcterms:issued |
dcterms:W3CDTF |
2000 |
dcterms:isPartOf |
dcterms:URI |
urn: ISSN:0740-8188 |
dcterms:bibliographicCitation |
|
Library and Information Science Research 22(3), 311-338. (2000) |
For oai_dc repeat dc:subject and dc:type and describe in the order in which oai_dc elements are used in the Identify section of your IR. See for instance: http://arXiv.org/oai2?verb=Identify for best practice.
Element name |
Source |
||
DCMI definition |
A reference to a resource from which the present resource is derived. |
||
Usage |
Optional |
||
User instruction |
The present resource may be derived from the Source resource in whole or in part. Recommended best practice is to reference the resource by means of a string or number conforming to a formal identification system. |
||
Do not confuse with |
dc:relation and dc:identifier |
||
<ac:structured-macro ac:name="unmigrated-wiki-markup" ac:schema-version="1" ac:macro-id="9a45e315-dca8-45fc-ab99-57fc190e1f49"><ac:plain-text-body><![CDATA[ |
Refinements |
Bibl. Citation (Optional) [DCMI:] A bibliographic reference for the resource. |
]]></ac:plain-text-body></ac:structured-macro> |
Example |
<dc:source>Ecology Letters (1461023X) vol.4 (2001)</dc:source> |
||
Scheme |
ISSN, ISBN |
Element name |
Language |
|
DCMI definition |
A language of the intellectual content of the resource. |
|
Usage |
Recommended |
|
User instruction |
A specific resource (an instance of scientific output) is either written in one human readable language or more. In these cases all used languages are used in the DC element 'language'. If a specific resource (an instance of scientific output) is written in one human readable language and is translated into other human readable languages, these translations are distinguished from the original version and therefore described separately. |
]]></ac:plain-text-body></ac:structured-macro> |
Do not confuse with |
- |
|
Refinements |
- |
|
Examples |
<dc:language>en</dc:language> |
|
Scheme |
ISO 639-1 and ISO 639-2, see: http://www.loc.gov/standards/iso639-2/englangn.html |
Element name |
Relation |
||
DCMI definition |
The reference to a related resource. |
||
Usage |
Recommended |
||
User instruction |
Recommended best practice is to reference the resource by means of a string or number conforming to a formal identification system.
|
||
Do not confuse with |
dc:identifier and dc:source. |
||
<ac:structured-macro ac:name="unmigrated-wiki-markup" ac:schema-version="1" ac:macro-id="d78790b5-a719-4c71-a6cb-ccb94c444f7b"><ac:plain-text-body><![CDATA[ |
Refinements |
Isversionof (recommended) [DCMI:] The described resource is a version, edition, or adaptation of the referenced resource. Changes in version imply substantive changes in content rather than differences in format. |
]]></ac:plain-text-body></ac:structured-macro> |
Example |
<dc:relation:haspreviousversion>uri</dc:relation:haspreviousversion> |
||
Scheme |
- |
Element name |
Coverage |
||
DCMI definition |
The extent or scope of the content of the resource. Coverage will typically include spatial location (a place name or geographic coordinates), temporal period (a period label, date, or date range) or jurisdiction (such as a named administrative entity). |
||
Usage |
Optional |
||
User instruction |
Recommended best practice is to select the value from a controlled vocabulary (for example, the Getty Thesaurus of Geographic Names or TGN) and that, where appropriate, named places or time periods be used in preference to numeric identifiers as, for example, sets of co-ordinates or date ranges. |
||
Do not confuse with |
- |
||
<ac:structured-macro ac:name="unmigrated-wiki-markup" ac:schema-version="1" ac:macro-id="cd2f0707-e410-4721-87fc-4b1157400aeb"><ac:plain-text-body><![CDATA[ |
Refinements |
Spatial (Optional) [DCMI:] Spatial characteristics of the intellectual content of the resource. |
]]></ac:plain-text-body></ac:structured-macro> |
Examples |
Example Spatial - ISO 3166 |
||
Scheme |
Point http:/ dublincore.org/documents/dcmi-point/ |
Element name |
Rights |
|||
DCMI definition |
Information about rights held in and over the resource. |
|||
Usage |
Mandatory |
|||
User instruction |
Typically, a Rights element will contain a rights management statement for the access or use of the object, or reference a service providing such information. Rights information often encompasses Intellectual Property Rights (IPR), Copyright, and various Property Rights. |
|||
Do not confuse with |
- |
|||
<ac:structured-macro ac:name="unmigrated-wiki-markup" ac:schema-version="1" ac:macro-id="f899795d-7084-4c6c-a3fc-7475fe7054d7"><ac:plain-text-body><![CDATA[ |
Refinements |
Access rights (Mandatory if formulated) [DCMI:] Information about who can access the resource or an indication of its security status. |
http://creativecommons.org/licenses/]. |
]]></ac:plain-text-body></ac:structured-macro> |
Examples |
<dc:rights>(c) University of Bath, 2003</dc:rights> |
|||
Scheme |
- |
Element name |
Audience |
||
DCMI definition |
A class of entity for whom the resource is intended or useful. |
||
Usage |
Optional |
||
User instruction |
A class of entity may be determined by the creator or the publisher or by a third party. |
||
Do not confuse with |
- |
||
<ac:structured-macro ac:name="unmigrated-wiki-markup" ac:schema-version="1" ac:macro-id="d015aafe-9397-441d-8e2c-49ebf14b5aa8"><ac:plain-text-body><![CDATA[ |
Refinements |
Mediator (Optional) [DCMI:] A class of entity that mediates access to the resource and for whom the resource is intended or useful. |
]]></ac:plain-text-body></ac:structured-macro> |
Examples |
<dc: audience>Researchers</dc: audience> |
||
Scheme |
- |