Europeana and schema.org
Antoine Isaac
Dublin Core Conference
Schema.org special session
5 September 2013
Europeana Data Model: an example
For a general presentation on Europeana and EDM rationale see http://pro.europeana.eu/edm-documentation
Provided Cultural Heritage Object (CHO)
and descriptive metadata
Web Resources – digital representations
Aggregations – Bundling it all together
Why using schema.org?
Europeana tries to disseminate data to reach out to as many users
as possible
Search engines
•
Customization of result lists – rich snippets
•
Knowledge Graph
•
Search Engine Optimization
Developers more comfortable with parsing web pages
In fact: schema.org and RDFa
Europeana has been publishing structured metadata via its portal
since a while
One application case: customization of public domain pages by
Creative Commons, with details on the work and Europeana
usage guidelines for public domain works
Europeana, Creative Commons and RDFa
http://www.europeana.eu/portal/record/02001/492A0518CA2BDF09B1642
B11FA7317F5FE43B96B.html
The Creative Commons Public Domain page triggers a script that harvests mark-up on the Europeana object page
Going further
Creative Commons uses 5-6 fields, Agreed upon with the
developer(s) there
What to publish further?
Schema.org as a standardized form of more web page-based data
exchange
And it has a case – several ones in fact
Still quite prototype-ish
Current EDM – schema.org mapping
EDM element
Schema.org mapping
ProvidedCHO & Proxy
dc:contributor
dc:coverage
dc:creator
dc:date
dc:description
dc:format
dc:identifier
dc:language
dc:publisher
dc:relation
dc:rights
dc:source
dc:subject
dc:title
dc:type
dcterms:alternative
dcterms:conformsTo
dcterms:created
dcterms:extent
dcterms:hasFormat
dcterms:hasPart
dcterms:hasVersion
dcterms:isFormatOf
dcterms:isPartOf
dcterms:isReferencedBy
dcterms:isReplacedBy
dcterms:isRequiredBy
dcterms:issued
dcterms:isVersionOf
dcterms:medium
dcterms:provenance
dcterms:references
dcterms:replaces
dcterms:requires
dcterms:spatial
dcterms:tableOfContents
dcterms:temporal
edm:currentLocation
edm:hasMet
edm:hasType
edm:incorporates
edm:isDerivativeOf
edm:isNextInSequence
edm:isRelatedTo
edm:isRepresentationOf
edm:isSimilarTo
edm:isSuccessorOf
edm:realizes
edm:type
edm:unstored
edm:wasPresentAt
edm:europeanaProxy
edm:userTag
edm:year
ore:proxyFor
ore:proxyIn
owl:sameAs
rdf:type
schema:CreativeWork
schema:contributor
schema:creator
schema:description
schema:inLanguage
schema:publisher
Aggregation and
EuropeanaAggregation
ore:aggregates
N/A
edm:aggregatedCHO
N/A
edm:country
schema:addressCountry
edm:dataProvider
schema:provider
edm:hasView
schema:url
edm:isShownAt
schema:url
edm:isShownBy
schema:contentUrl
edm:landingPage
schema:url
edm:language
schema:about
schema:name
schema:alternativeHeadline
schema:dateCreated
edm:object
edm:preview
edm:provider
schema:image, if preview opt-out is NOT
activated
schema:thumbnailUrl if preview opt-out is NOT
activated
schema:provider
dc:rights
edm:rights
edm:unstored
N/A
WebResource
schema:WebPage or schema:MediaObject
dc:description
schema:additionalType
edm:rights
schema:about
N/A
N/A
schema:keywords or schema:comment
N/A
N/A
edm:end
schema:deathdate
edm:hasMet
schema:knows
edm:isRelatedTo
edm:wasPresentAt
foaf:name
schema:name
rdaGr2:biographicalInformation schema:description
rdaGr2:dateOfBirth
schema:birthdate
rdaGr2:dateOfDeath
schema:deathdate
rdaGr2:dateOfEstablishment
schema:foundingDate
rdaGr2:gender
schema:gender
rdaGr2:professionOrOccupatio schema:jobTitle
n
owl:sameAs
schema:Place
wgs84_pos:alt
schema:encodesCreativeWork
wgs84_pos:lat_long
skos:prefLabel
schema:dateCreated
schema:name
skos:altLabel skos:hiddenLabel schema:additionalName
skos:note
schema:description
dcterms:hasPart
dcterms:isFormatOf
edm:isNextInSequence
edm:begin
wgs84_pos:long
dcterms:hasPart
dcterms:issued
dc:identifier
wgs84_pos:lat
dcterms:extent
schema:contentLocation
schema:description
dc:date
Place
dcterms:conformsTo
dcterms:created
skos:note
schema:description
dc:rights
dc:source
schema:name
skos:altLabel skos:hiddenLabel schema:additionalName
schema:url
dc:format
schema:mentions
schema:Person or schema:Organization
skos:prefLabel
rdaGr2:dateOfTermination
edm:ugc
schema:datePublished
Agent
schema:datePublished
dcterms:isPartOf
owl:sameAs
schema:containedIn
Current implementation
A glimpse of an object’s full data
http://www.europeana.eu/portal/record/02001/492A0518CA2BDF09
B1642B11FA7317F5FE43B96B.html?format=labels
Anatomy of results from an RDFa parser
http://www.w3.org/2012/pyRdfa/
Several flavors of data in it…
Schema.org data
<http://www.europeana.eu/resolve/record/02001/492A0518CA2BDF09B1642B11FA7317F5FE43B9
6B> a schema:CreativeWork;
schema:name "Cofre de base rectangular com tampa de quatro face...";
schema:about "Cofre-relicário";
schema:addressCountry "Portugal";
schema:contentUrl <http://www.matriznet.imcip.pt/MatrizNet/CommonServices/ThumbnailDownloader.axd?IdReg=5113&TipoReg=1&Thumbnail
Type=2>;
schema:creator "Desconhecido";
schema:description "Cofre de base rectangular com tampa de quatro faces. A urna é decorada
com um friso perlado que rodeia representações da Virgem com o Menino, do Calvário e da
Virgem em Glória, nas quatro faces. As arestas da tampa são emolduradas por pequenas
caneluras; nas duas faces principais um entrelaçado encerra uma arcaria polilobada.";
schema:image <http://www.matriznet.imcip.pt/MatrizNet/CommonServices/ThumbnailDownloader.axd?IdReg=5113&TipoReg=1&Thumbnail
Type=1>;
schema:provider "Instituto dos Museus e da Conservação",
"Museu de Alberto Sampaio";
schema:url
<http://www.europeana.eu/portal/record/02001/492A0518CA2BDF09B1642B11FA7317F5FE43B96
B.html>,
<http://www.matriznet.imc-ip.pt/MatrizNet/Objectos/ObjectosConsultar.aspx?IdReg=5113>;
DC / EDM data
dc11:creator "Desconhecido";
dc11:date "XIII";
dc11:description "Cofre de base rectangular com tampa de quatro faces. […].";
dc11:format "Altura: 9,5 cm; Profundidade: 10 cm; Comprimento: 19,5 cm";
dc11:identifier "MAS O 37";
dc11:rights "Copyright © Instituto dos Museus e da Conservação";
dc11:subject "Cofre-relicário";
dc11:title "Cofre de base rectangular com tampa de quatro face...";
dc11:type "Ourivesaria";
edm:country "Portugal";
edm:rights <http://creativecommons.org/publicdomain/mark/1.0/>;
edm:dataProvider "Museu de Alberto Sampaio";
edm:provider "Instituto dos Museus e da Conservação";
edm:isShownAt <http://www.matriznet.imcip.pt/MatrizNet/Objectos/ObjectosConsultar.aspx?IdReg=5113>;
edm:isShownBy <http://www.matriznet.imcip.pt/MatrizNet/CommonServices/ThumbnailDownloader.axd?IdReg=5113&TipoReg=1&Thumbnail
Type=2>;
edm:landingPage
<http://www.europeana.eu/portal/record/02001/492A0518CA2BDF09B1642B11FA7317F5FE43B96
B.html>;
edm:object <http://www.matriznet.imcip.pt/MatrizNet/CommonServices/ThumbnailDownloader.axd?IdReg=5113&TipoReg=1&Thumbnail
FB OpenGraph data
og:description "Cofre de base rectangular com tampa de quatro faces. A urna é decorada […]";
og:image "http://europeanastatic.eu/api/image?type=IMAGE&uri=http://www.matriznet.imcip.pt/MatrizNet/CommonServices/ThumbnailDownloader.axd?IdReg=5113&TipoReg=1&Thumbnail
Type=1&size=FULL_DOC";
og:site_name "Europeana";
og:title "Cofre de base rectangular com tampa de quatro face... | Desconhecido";
og:type "website";
og:url
"http://preview.europeana.eu/portal/record/02001/492A0518CA2BDF09B1642B11FA7317F5FE43B
96B.html” .
Creative Commons data
xhv:license <http://creativecommons.org/publicdomain/mark/1.0/>;
cc:attributionURL <http://www.matriznet.imcip.pt/MatrizNet/Objectos/ObjectosConsultar.aspx?IdReg=5113>;
cc:morePermissions <http://www.matriznet.imcip.pt/MatrizNet/Objectos/ObjectosConsultar.aspx?IdReg=5113>;
cc:useGuidelines <http://www.europeana.eu/rights/pd-usage-guide/>;
Observations
•
Schema.org is simple
•
Not everything can be mapped
•
We’re losing grain, including some of the core benefits of
Europeana moving to the richer EDM!
•
But it’s ok, because it matches needs
•
And in fact it’s not entirely because of Schema.org
•
And we can publish different flavors of the data in RDFa
Thank you!
Questions?
Antoine Isaac
[email protected]
Download

Europeana and schema.org