Harvesting DSpace metadata - matt-bernhardt/datalore GitHub Wiki
For any given item within a DSpace repository, there are several different sets of metadata available. Unfortunately, there are multiple ways to get at this metadata:
Sample item page
http://dspace.mit.edu/handle/1721.1/50236
The default item page within the DSpace UI displays basic metadata such as the paper title, authors, abstract, etc. This is rendered in HTML format, and certain fields may be truncated for length.
Full item page
http://dspace.mit.edu/handle/1721.1/50236?show=full
The full item page shows more complete metadata. For DSpace@MIT, this has been cataloged using Dublin Core. Like the simple item page, this is rendered in HTML format.
OAI-PMH display
Basic metadata can be access in XML format using an OAI-PMH endpoint.
DRI display
http://dspace.mit.edu/DRI/handle/1721.1/50236
Some information, including information used to build the DSpace UI, can be found via DRI. Much of the most interesting metadata is only available within the metadata@element="xhtml_head_item" field as a separate blob of XML.
XML display
http://dspace.mit.edu/handle/1721.1/50236?XML
This display appears to be somewhat similar to the DRI display.
METS display
http://dspace.mit.edu/metadata/handle/1721.1/50236/mets.xml
METS-format XML information, including a structured presentation of the xhtml_head_item element from above, can be found at these locations.
Open Search
http://dspace.mit.edu/open-search/?query=1721.1/50236
Open Search is a platform for searching DSpace, rather than precisely asking for a single record. However, some interesting results may be found by asking for the handle of a known item - see the above example.
There are probably other ways to get at even more metadata from a DSpace repository? Perhaps?