Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

EDM: resolve vocabulary items by id or URI to value #3

Open
twagoo opened this issue Jan 25, 2018 · 0 comments
Open

EDM: resolve vocabulary items by id or URI to value #3

twagoo opened this issue Jan 25, 2018 · 0 comments

Comments

@twagoo
Copy link
Member

twagoo commented Jan 25, 2018

Europeana records from the newspapers collections (possibly also others) use various identifiers for e.g. subject or resource type values that could be resolved to make the metadata better suitable for indexing into the VLO.

  • IDs of the library of congress subject headers appear as subject values, e.g. <dc-subject>sh85091614</dc-subject> (full record), which is a reference to http://id.loc.gov/authorities/subjects/sh85091614 "Newspapers--Sections, columns, etc" (skos RDF)

    • These always take the form of /sh[0-9]+/ as text content within dc:subject elements. The concept URIs don't appear to be used, i.e. no @rdf:resource.
  • Resource types are often encoded with concepts from the Getty Art and Architecture Thesaurus, which are included in expanded form in the RDF/XML representations harvested. Rather than rendering the full content we could also detect these and do a lookup or trim down the provided values to only include the most relevant information.

    • Example from the RDF: <dc:type xmlns:dc="http://purl.org/dc/elements/1.1/" rdf:resource="http://vocab.getty.edu/aat/300026656"/>, which is expanded in the conversion to CMDI with all content found in the concept definition also included in the RDF/XML served by Europeana's OAI provider (in this case ten altLabels/prefLabels in different languages: Tageblätter, tidning, newspaper etc).

An example EDM record and its current CMDI conversion:
BibliographicResource_30001170701972017.xml (RDF/XML)
BibliographicResource_30001170701972017.cmdi (CMDI)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant