Europeana and schema.org Antoine Isaac Dublin Core Conference Schema.org special session 5 September 2013 Europeana Data Model: an example For a general presentation on Europeana and EDM rationale see http://pro.europeana.eu/edm-documentation Provided Cultural Heritage Object (CHO) and descriptive metadata Web Resources – digital representations Aggregations – Bundling it all together Why using schema.org? Europeana tries to disseminate data to reach out to as many users as possible Search engines • Customization of result lists – rich snippets • Knowledge Graph • Search Engine Optimization Developers more comfortable with parsing web pages In fact: schema.org and RDFa Europeana has been publishing structured metadata via its portal since a while One application case: customization of public domain pages by Creative Commons, with details on the work and Europeana usage guidelines for public domain works Europeana, Creative Commons and RDFa http://www.europeana.eu/portal/record/02001/492A0518CA2BDF09B1642 B11FA7317F5FE43B96B.html The Creative Commons Public Domain page triggers a script that harvests mark-up on the Europeana object page Going further Creative Commons uses 5-6 fields, Agreed upon with the developer(s) there What to publish further? Schema.org as a standardized form of more web page-based data exchange And it has a case – several ones in fact Still quite prototype-ish Current EDM – schema.org mapping EDM element Schema.org mapping ProvidedCHO & Proxy dc:contributor dc:coverage dc:creator dc:date dc:description dc:format dc:identifier dc:language dc:publisher dc:relation dc:rights dc:source dc:subject dc:title dc:type dcterms:alternative dcterms:conformsTo dcterms:created dcterms:extent dcterms:hasFormat dcterms:hasPart dcterms:hasVersion dcterms:isFormatOf dcterms:isPartOf dcterms:isReferencedBy dcterms:isReplacedBy dcterms:isRequiredBy dcterms:issued dcterms:isVersionOf dcterms:medium dcterms:provenance dcterms:references dcterms:replaces dcterms:requires dcterms:spatial dcterms:tableOfContents dcterms:temporal edm:currentLocation edm:hasMet edm:hasType edm:incorporates edm:isDerivativeOf edm:isNextInSequence edm:isRelatedTo edm:isRepresentationOf edm:isSimilarTo edm:isSuccessorOf edm:realizes edm:type edm:unstored edm:wasPresentAt edm:europeanaProxy edm:userTag edm:year ore:proxyFor ore:proxyIn owl:sameAs rdf:type schema:CreativeWork schema:contributor schema:creator schema:description schema:inLanguage schema:publisher Aggregation and EuropeanaAggregation ore:aggregates N/A edm:aggregatedCHO N/A edm:country schema:addressCountry edm:dataProvider schema:provider edm:hasView schema:url edm:isShownAt schema:url edm:isShownBy schema:contentUrl edm:landingPage schema:url edm:language schema:about schema:name schema:alternativeHeadline schema:dateCreated edm:object edm:preview edm:provider schema:image, if preview opt-out is NOT activated schema:thumbnailUrl if preview opt-out is NOT activated schema:provider dc:rights edm:rights edm:unstored N/A WebResource schema:WebPage or schema:MediaObject dc:description schema:additionalType edm:rights schema:about N/A N/A schema:keywords or schema:comment N/A N/A edm:end schema:deathdate edm:hasMet schema:knows edm:isRelatedTo edm:wasPresentAt foaf:name schema:name rdaGr2:biographicalInformation schema:description rdaGr2:dateOfBirth schema:birthdate rdaGr2:dateOfDeath schema:deathdate rdaGr2:dateOfEstablishment schema:foundingDate rdaGr2:gender schema:gender rdaGr2:professionOrOccupatio schema:jobTitle n owl:sameAs schema:Place wgs84_pos:alt schema:encodesCreativeWork wgs84_pos:lat_long skos:prefLabel schema:dateCreated schema:name skos:altLabel skos:hiddenLabel schema:additionalName skos:note schema:description dcterms:hasPart dcterms:isFormatOf edm:isNextInSequence edm:begin wgs84_pos:long dcterms:hasPart dcterms:issued dc:identifier wgs84_pos:lat dcterms:extent schema:contentLocation schema:description dc:date Place dcterms:conformsTo dcterms:created skos:note schema:description dc:rights dc:source schema:name skos:altLabel skos:hiddenLabel schema:additionalName schema:url dc:format schema:mentions schema:Person or schema:Organization skos:prefLabel rdaGr2:dateOfTermination edm:ugc schema:datePublished Agent schema:datePublished dcterms:isPartOf owl:sameAs schema:containedIn Current implementation A glimpse of an object’s full data http://www.europeana.eu/portal/record/02001/492A0518CA2BDF09 B1642B11FA7317F5FE43B96B.html?format=labels Anatomy of results from an RDFa parser http://www.w3.org/2012/pyRdfa/ Several flavors of data in it… Schema.org data <http://www.europeana.eu/resolve/record/02001/492A0518CA2BDF09B1642B11FA7317F5FE43B9 6B> a schema:CreativeWork; schema:name "Cofre de base rectangular com tampa de quatro face..."; schema:about "Cofre-relicário"; schema:addressCountry "Portugal"; schema:contentUrl <http://www.matriznet.imcip.pt/MatrizNet/CommonServices/ThumbnailDownloader.axd?IdReg=5113&TipoReg=1&Thumbnail Type=2>; schema:creator "Desconhecido"; schema:description "Cofre de base rectangular com tampa de quatro faces. A urna é decorada com um friso perlado que rodeia representações da Virgem com o Menino, do Calvário e da Virgem em Glória, nas quatro faces. As arestas da tampa são emolduradas por pequenas caneluras; nas duas faces principais um entrelaçado encerra uma arcaria polilobada."; schema:image <http://www.matriznet.imcip.pt/MatrizNet/CommonServices/ThumbnailDownloader.axd?IdReg=5113&TipoReg=1&Thumbnail Type=1>; schema:provider "Instituto dos Museus e da Conservação", "Museu de Alberto Sampaio"; schema:url <http://www.europeana.eu/portal/record/02001/492A0518CA2BDF09B1642B11FA7317F5FE43B96 B.html>, <http://www.matriznet.imc-ip.pt/MatrizNet/Objectos/ObjectosConsultar.aspx?IdReg=5113>; DC / EDM data dc11:creator "Desconhecido"; dc11:date "XIII"; dc11:description "Cofre de base rectangular com tampa de quatro faces. […]."; dc11:format "Altura: 9,5 cm; Profundidade: 10 cm; Comprimento: 19,5 cm"; dc11:identifier "MAS O 37"; dc11:rights "Copyright © Instituto dos Museus e da Conservação"; dc11:subject "Cofre-relicário"; dc11:title "Cofre de base rectangular com tampa de quatro face..."; dc11:type "Ourivesaria"; edm:country "Portugal"; edm:rights <http://creativecommons.org/publicdomain/mark/1.0/>; edm:dataProvider "Museu de Alberto Sampaio"; edm:provider "Instituto dos Museus e da Conservação"; edm:isShownAt <http://www.matriznet.imcip.pt/MatrizNet/Objectos/ObjectosConsultar.aspx?IdReg=5113>; edm:isShownBy <http://www.matriznet.imcip.pt/MatrizNet/CommonServices/ThumbnailDownloader.axd?IdReg=5113&TipoReg=1&Thumbnail Type=2>; edm:landingPage <http://www.europeana.eu/portal/record/02001/492A0518CA2BDF09B1642B11FA7317F5FE43B96 B.html>; edm:object <http://www.matriznet.imcip.pt/MatrizNet/CommonServices/ThumbnailDownloader.axd?IdReg=5113&TipoReg=1&Thumbnail FB OpenGraph data og:description "Cofre de base rectangular com tampa de quatro faces. A urna é decorada […]"; og:image "http://europeanastatic.eu/api/image?type=IMAGE&uri=http://www.matriznet.imcip.pt/MatrizNet/CommonServices/ThumbnailDownloader.axd?IdReg=5113&TipoReg=1&Thumbnail Type=1&size=FULL_DOC"; og:site_name "Europeana"; og:title "Cofre de base rectangular com tampa de quatro face... | Desconhecido"; og:type "website"; og:url "http://preview.europeana.eu/portal/record/02001/492A0518CA2BDF09B1642B11FA7317F5FE43B 96B.html” . Creative Commons data xhv:license <http://creativecommons.org/publicdomain/mark/1.0/>; cc:attributionURL <http://www.matriznet.imcip.pt/MatrizNet/Objectos/ObjectosConsultar.aspx?IdReg=5113>; cc:morePermissions <http://www.matriznet.imcip.pt/MatrizNet/Objectos/ObjectosConsultar.aspx?IdReg=5113>; cc:useGuidelines <http://www.europeana.eu/rights/pd-usage-guide/>; Observations • Schema.org is simple • Not everything can be mapped • We’re losing grain, including some of the core benefits of Europeana moving to the richer EDM! • But it’s ok, because it matches needs • And in fact it’s not entirely because of Schema.org • And we can publish different flavors of the data in RDFa Thank you! Questions? Antoine Isaac [email protected]