{"id":165,"date":"2011-11-16T14:03:01","date_gmt":"2011-11-16T14:03:01","guid":{"rendered":"http:\/\/217.76.133.156\/?p=165"},"modified":"2012-06-01T14:58:35","modified_gmt":"2012-06-01T14:58:35","slug":"product-scheme-classifications","status":"publish","type":"post","link":"https:\/\/www.josemalvarez.es\/?p=165","title":{"rendered":"Product Scheme Classifications"},"content":{"rendered":"<p>Following with the activities performed to promote the CPV as a linked dataset we have finished the first beta release of new product scheme classifications (PSCs) as linked data in the context of e-procurement. Next diagram shows the ongoing work in the transformation of PSCs (gray ones are not yet transformed):<\/p>\n<p><a href=\"http:\/\/217.76.133.156\/wp-content\/uploads\/2011\/11\/graph1.png\"><img decoding=\"async\" class=\"alignnone size-full wp-image-172\" title=\"Ongoing PSCs\" src=\"http:\/\/217.76.133.156\/wp-content\/uploads\/2011\/11\/graph1.png\" alt=\"\" \/><\/a><\/p>\n<p>The process to promote all these PSCs (more information can be found in\u00a0<a title=\"PSCS Catalogue\" href=\"http:\/\/thedatahub.org\/dataset\/pscs-catalogue\">pscs-catalogue<\/a>\u00a0at <a href=\"http:\/\/thedatahub.org\/\">thedatahub.org<\/a>) have been carried out in a stepwise method (similar to <a href=\"http:\/\/www.w3.org\/2011\/gld\/wiki\/Linked_Data_Cookbook\">http:\/\/www.w3.org\/2011\/gld\/wiki\/Linked_Data_Cookbook<\/a>):<\/p>\n<ol>\n<li>Select the PSCs to be transformed and download the datasource (MSExcel in most of cases)<\/li>\n<li>Model the information about a PSC item using existing vocabularies. If it is required new concepts and relations can be defined such as in CPV case. URI design.<\/li>\n<li>Transform the data using Google Refine<\/li>\n<li>Create the mappings between a PSC and the Product Ontology (custom java-based reconciliator adapted to the descriptions of PSCs items)<\/li>\n<li>Create the mappings between a PSC and the CPV 2008 (custom java-based reconciliator between a source PSC and a target PSC)<\/li>\n<li>Validate mappings and links<\/li>\n<li>Add dataset descriptions using VoID vocabulary<\/li>\n<li>Store in Virtuoso and publish data with Pubby<\/li>\n<\/ol>\n<p>The definition of a PSC item (?<em>product<\/em>) is comprised of the next properties:<\/p>\n<ul>\n<li>URI for datasets<em>:\u00a0<\/em>http:\/\/purl.org\/weso\/pscs\/{<strong>psc<\/strong>}\/{<strong>year|version<\/strong>}\/resource\/ds<\/li>\n<li>URI for resource<em>s: <\/em>http:\/\/purl.org\/weso\/pscs\/{<strong>psc<\/strong>}\/{<strong>year|version<\/strong>}\/resource\/{<strong>id<\/strong>}<\/li>\n<li>URI for classes and propertie<em>s:\u00a0<\/em>http:\/\/purl.org\/weso\/pscs\/{<strong>psc<\/strong>}\/{<strong>year|version<\/strong>}\/ontology\/<\/li>\n<li><em>rdf:type<\/em>\u00a0&lt;<em>pscs:PSCConcept&gt;<\/em> (rdf:type skos:Concept)<\/li>\n<li><em>dcterms:identifier<\/em> &#8220;id&#8221; (the id that is part of the URI)<\/li>\n<li><em>skos:notation<\/em> &#8220;raw id&#8221; (the real id that appears in the data source)<\/li>\n<li><em>skos:prefLabel<\/em>, <em>gr:description<\/em> and <em>rdfs:label<\/em> &#8220;description&#8221;<\/li>\n<li><em>skos:inScheme<\/em> &lt;void:Dataset&gt;, &lt;skos:ConceptScheme&gt;<\/li>\n<li><em>skos:broaderTransitive\/skos:narrowerTransitive<\/em> &lt;PSCConcept&gt; (in some cases the broader of an item can not be inferred using the codes, in that case we have defined a custom property called &#8220;<em>pscs:level<\/em>&#8220;)<\/li>\n<li><em>pscs:relatedMatch<\/em> (mapping between \u00a0?<em>product<\/em> and items of ProductOntology). The next release will include a &#8220;confidence&#8221; value to stablish the weight of matchings.<\/li>\n<li><em>skos:exactMatch<\/em>\u00a0&lt;PSCConcept&gt; (some PSCs have already defined mappings among \u00a0them, we reuse this information)<\/li>\n<li><em>skos:closeMatch<\/em>\u00a0&lt;PSCConcept&gt; (mapping between ?<em>product<\/em> and items of CPV 2008).\u00a0The next release will include a &#8220;confidence&#8221; value to stablish the weight of matchings.<\/li>\n<\/ul>\n<div>The whole linkset of PSCs can be found at\u00a0<a href=\"http:\/\/purl.org\/weso\/pscs\/\">http:\/\/purl.org\/weso\/pscs\/<\/a>\u00a0and we have also extracted out some statistics (PSC void:Dataset, IRI graph and triples):<\/div>\n<div>\n<ul>\n<li><a title=\"CN 2012 Dataset\" href=\"http:\/\/purl.org\/weso\/pscs\/cn\/2012\/resource\/ds\">http:\/\/purl.org\/weso\/pscs\/cn\/2012\/resource\/ds<\/a>,\u00a0http:\/\/purl.org\/weso\/pscs\/cn\/2012,\u00a0<strong>137,484<\/strong><\/li>\n<li><a href=\"http:\/\/purl.org\/weso\/pscs\/cpa\/2008\/resource\/ds\">http:\/\/purl.org\/weso\/pscs\/cpa\/2008\/resource\/ds<\/a>,\u00a0http:\/\/purl.org\/weso\/pscs\/cpa\/2008,\u00a0<strong>92,749<\/strong><\/li>\n<li><a href=\"http:\/\/purl.org\/weso\/pscs\/cpc\/2008\/resource\/ds\">http:\/\/purl.org\/weso\/pscs\/cpc\/2008\/resource\/ds<\/a>,\u00a0http:\/\/purl.org\/weso\/pscs\/cpc\/2008,\u00a0<strong>100,819<\/strong><\/li>\n<li><a href=\"http:\/\/purl.org\/weso\/pscs\/cpv\/2003\/resource\/ds\">http:\/\/purl.org\/weso\/pscs\/cpv\/2003\/resource\/ds<\/a>,\u00a0http:\/\/purl.org\/weso\/pscs\/cpv\/2003,\u00a0<strong>546,135<\/strong><\/li>\n<li><a href=\"http:\/\/purl.org\/weso\/pscs\/cpv\/2008\/resource\/ds\">http:\/\/purl.org\/weso\/pscs\/cpv\/2008\/resource\/ds<\/a>,\u00a0http:\/\/purl.org\/weso\/pscs\/cpv\/2008,\u00a0<strong>803,311<br \/>\n<\/strong><\/li>\n<li><a href=\"http:\/\/purl.org\/weso\/pscs\/isic\/v4\/resource\/ds\">http:\/\/purl.org\/weso\/pscs\/isic\/v4\/resource\/ds<\/a>,\u00a0http:\/\/purl.org\/weso\/pscs\/isic\/v4,\u00a0<strong>18,986<\/strong><\/li>\n<li><a href=\"http:\/\/purl.org\/weso\/pscs\/naics\/2007\/resource\/ds\">http:\/\/purl.org\/weso\/pscs\/naics\/2007\/resource\/ds<\/a>,\u00a0http:\/\/purl.org\/weso\/pscs\/naics\/2007, <strong>36,292<\/strong><\/li>\n<li><a href=\"http:\/\/purl.org\/weso\/pscs\/naics\/2012\/resource\/ds\">http:\/\/purl.org\/weso\/pscs\/naics\/2012\/resource\/ds<\/a>,\u00a0http:\/\/purl.org\/weso\/pscs\/naics\/2012, <strong>35,390<\/strong><\/li>\n<li><a href=\"http:\/\/purl.org\/weso\/pscs\/sitc\/v4\/resource\/ds\">http:\/\/purl.org\/weso\/pscs\/sitc\/v4\/resource\/ds<\/a>,\u00a0http:\/\/purl.org\/weso\/sitc\/v4,\u00a0<strong>70,887<\/strong><\/li>\n<\/ul>\n<div>Try this query<strong>: <a title=\"Example query\" href=\"http:\/\/156.35.31.156\/snorql\/?query=SELECT+DISTINCT+*+WHERE{%0D%0A%3Fproduct+%3Chttp:\/\/purl.org\/weso\/pscs\/ontology\/relatedMatch%3E+%3Chttp:\/\/www.productontology.org\/id\/construction%3E+.%0D%0A%3Fproduct+skos:closeMatch+%3Fcpv.%0D%0A%3Fproduct+skos:prefLabel+%3FproductLabel.%0D%0A%3Fcpv+skos:prefLabel+%3FcpvLabel.%0D%0A%3Fproduct+skos:inScheme+%3Fscheme.%0D%0AFILTER+(%3Fscheme+!%3D+%3Chttp:\/\/purl.org\/weso\/pscs\/cpv\/2008\/resource\/ds%3E).%0D%0AFILTER+(lang(%3FcpvLabel)%3D%22en%22)%0D%0A}+limit+100%0D%0A\">&#8220;Give me 100 products or services related to &#8216;construction&#8217; in any PSC that have a mapping with products or services in CPV 2008 (descriptions in English)&#8221;<\/a><\/strong><\/div>\n<\/div>\n<p><!--more--><\/p>\n<p>The definitions have been made using the vocabularies:<\/p>\n<ul>\n<li><a title=\"SKOS W3C Recommendation\" href=\"http:\/\/www.w3.org\/TR\/skos-reference\/\">SKOS\u00a0<\/a>and\u00a0<a title=\"SKOS XL\" href=\"http:\/\/www.w3.org\/TR\/skos-reference\/skos-xl.html\">SKOS-XL<\/a><\/li>\n<li><a title=\"RDFS\" href=\"http:\/\/www.w3.org\/TR\/rdf-schema\/\">RDFS<\/a><\/li>\n<li><a title=\"Dublin Core\" href=\"http:\/\/dublincore.org\/\">Dublin Core<\/a><\/li>\n<li><a title=\"GoodRelations\" href=\"http:\/\/www.heppnetz.de\/projects\/goodrelations\/\">GoodRelations<\/a><\/li>\n<li><a title=\"Product Ontology\" href=\"http:\/\/www.productontology.org\/\">ProductOntology<\/a><\/li>\n<li><a title=\"VoID vocabulary\" href=\"http:\/\/www.w3.org\/TR\/void\/\">VoID vocabulary<\/a><\/li>\n<\/ul>\n<p>The whole linkset uses links to other datasets (<strong>151,102<\/strong>):<\/p>\n<ul>\n<li>GoodRelations \u00a0and Product Ontology products and descriptions<\/li>\n<\/ul>\n<p>In order to create all this data we have used different tools:<\/p>\n<ul>\n<li><a title=\"Google Refine\" href=\"http:\/\/code.google.com\/p\/google-refine\/\">Google Refine<\/a>\u00a0and the\u00a0<a title=\"RDF extension Google Refine\" href=\"http:\/\/lab.linkeddata.deri.ie\/2010\/grefine-rdf-extension\/\">RDF extension<\/a>\u00a0(to produce data)<\/li>\n<li>Apache\u00a0<a title=\"Apache Lucene\" href=\"http:\/\/lucene.apache.org\/\">Lucene<\/a>\u00a0and\u00a0<a title=\"Solr\" href=\"http:\/\/lucene.apache.org\/solr\/\">Solr<\/a>\u00a0to reconciliate concepts<\/li>\n<li><a title=\"Pubby\" href=\"http:\/\/www4.wiwiss.fu-berlin.de\/pubby\/\">Pubby\u00a0<\/a>(to publish data)<\/li>\n<li><a title=\"Virtuoso\" href=\"http:\/\/virtuoso.openlinksw.com\/\">OpenLink Virtuoso<\/a>\u00a0(to store data)<\/li>\n<\/ul>\n<p>Collaborators:<\/p>\n<ul>\n<li><a title=\"Labra's Home Page\" href=\"http:\/\/www.di.uniovi.es\/%7Elabra\">Jos\u00e9 Emilio Labra<\/a>\u00a0(Main Researcher of\u00a0<a title=\"WESO\" href=\"http:\/\/www.weso.es\">WESO Research Group<\/a>\u00a0at the\u00a0<a title=\"University of Oviedo\" href=\"http:\/\/www.uniovi.es\">University of Oviedo<\/a>)<\/li>\n<li>Jose Luis Mar\u00edn (<a title=\"Euroalert.net\" href=\"http:\/\/euroalert.net\/\">Euroalert.net<\/a>)<\/li>\n<li>The first version of the CPV was developed in conjunction with my colleagues of\u00a0<a title=\"CTIC\" href=\"http:\/\/ct.ctic.es\/web\/contenidos\/es\">CTIC<\/a>: Luis Polo and Emilio Rubiera in 2007.<\/li>\n<\/ul>\n<div>Acknowledgements:<\/div>\n<div>This work is part of\u00a0<a title=\"MOLDEAS\" href=\"http:\/\/purl.org\/weso\/moldeas\/\">MOLDEAS<\/a>\u00a0system developed by the\u00a0<a title=\"WESO\" href=\"http:\/\/www.weso.es\">WESO Research Group<\/a>\u00a0in the partnership project\u00a0<a href=\"http:\/\/rd.10ders.net\/\"><strong>10ders Information Services project<\/strong><\/a>\u00a0partially funded by the\u00a0<a href=\"http:\/\/www.mityc.es\/\">Spanish Ministry of Industry, Tourism and Trade<\/a>\u00a0with code TSI-020100-2010-919 and the\u00a0<a href=\"http:\/\/europa.eu\/legislation_summaries\/employment_and_social_policy\/job_creation_measures\/l60015_es.htm\">European Regional Development Fund (EFDR)<\/a>\u00a0according to the\u00a0<a href=\"http:\/\/www.micinn.es\/portal\/site\/MICINN\/\">National Plan of Scientific Research, Development and Technological Innovation 2008-2011<\/a>, leaded by\u00a0<a href=\"http:\/\/gateway-scs.es\/\">Gateway Strategic Consultancy Services<\/a>\u00a0and developed in cooperation with\u00a0<a href=\"http:\/\/www.exis-ti.com\/\">Exis-TI<\/a>.<\/div>\n<p>Note:<\/p>\n<p><em>The initial version of CPV as linked data is available in order to ensure backward compatibility.<\/em><\/p>\n<p>TO DO List<\/p>\n<ul>\n<li>Example of queries<\/li>\n<li>Confidence value in mappings<\/li>\n<li>Check broken links<\/li>\n<li>Link to other datasets, fix names (case sensitive)<\/li>\n<li>Reconciliate all products and services with the DBPedia resources<\/li>\n<li>Update public procurement notices with the new URIs<\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<a href=\"https:\/\/www.josemalvarez.es\/?p=165\" rel=\"bookmark\" title=\"Permalink to Product Scheme Classifications\"><p>Following with the activities performed to promote the CPV as a linked dataset we have finished the first beta release of new product scheme classifications (PSCs) as linked data in the context of e-procurement. Next diagram shows the ongoing work in the transformation of PSCs (gray ones are not yet transformed): The process to promote [&hellip;]<\/p>\n<\/a>","protected":false},"author":1,"featured_media":172,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[7],"tags":[59,53,58,57,52,51,54,55,62,45,29,60,64,56,63,61],"class_list":{"0":"post-165","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-datasets","8":"tag-cn","9":"tag-common-procurement-vocabulary","10":"tag-cpa","11":"tag-cpc","12":"tag-cpv","13":"tag-dataset","14":"tag-e-procurement","15":"tag-goodrelations","16":"tag-isic","17":"tag-linkeddata","18":"tag-moldeas","19":"tag-naics","20":"tag-product-scheme-classifications","21":"tag-productontology","22":"tag-psc","23":"tag-sitc","24":"h-entry","25":"hentry"},"_links":{"self":[{"href":"https:\/\/www.josemalvarez.es\/index.php?rest_route=\/wp\/v2\/posts\/165","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.josemalvarez.es\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.josemalvarez.es\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.josemalvarez.es\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.josemalvarez.es\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=165"}],"version-history":[{"count":25,"href":"https:\/\/www.josemalvarez.es\/index.php?rest_route=\/wp\/v2\/posts\/165\/revisions"}],"predecessor-version":[{"id":190,"href":"https:\/\/www.josemalvarez.es\/index.php?rest_route=\/wp\/v2\/posts\/165\/revisions\/190"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.josemalvarez.es\/index.php?rest_route=\/wp\/v2\/media\/172"}],"wp:attachment":[{"href":"https:\/\/www.josemalvarez.es\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=165"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.josemalvarez.es\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=165"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.josemalvarez.es\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=165"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}