Harvesting DSpace metadata - matt-bernhardt/datalore GitHub Wiki

For any given item within a DSpace repository, there are several different sets of metadata available. Unfortunately, there are multiple ways to get at this metadata:

Sample item page

http://dspace.mit.edu/handle/1721.1/50236

The default item page within the DSpace UI displays basic metadata such as the paper title, authors, abstract, etc. This is rendered in HTML format, and certain fields may be truncated for length.

Full item page

http://dspace.mit.edu/handle/1721.1/50236?show=full

The full item page shows more complete metadata. For DSpace@MIT, this has been cataloged using Dublin Core. Like the simple item page, this is rendered in HTML format.

OAI-PMH display

http://dspace.mit.edu/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:dspace.mit.edu:1721.1/50236

Basic metadata can be access in XML format using an OAI-PMH endpoint.

DRI display

http://dspace.mit.edu/DRI/handle/1721.1/50236

Some information, including information used to build the DSpace UI, can be found via DRI. Much of the most interesting metadata is only available within the metadata@element="xhtml_head_item" field as a separate blob of XML.

XML display

http://dspace.mit.edu/handle/1721.1/50236?XML

This display appears to be somewhat similar to the DRI display.

METS display

http://dspace.mit.edu/metadata/handle/1721.1/50236/mets.xml

METS-format XML information, including a structured presentation of the xhtml_head_item element from above, can be found at these locations.

Open Search

http://dspace.mit.edu/open-search/?query=1721.1/50236

Open Search is a platform for searching DSpace, rather than precisely asking for a single record. However, some interesting results may be found by asking for the handle of a known item - see the above example.


There are probably other ways to get at even more metadata from a DSpace repository? Perhaps?