DBpedia domains and URIs: Difference between revisions

From Mediawiki1
Jump to navigationJump to search
(width = 400 |)
No edit summary
Line 1: Line 1:
== Long-term goals ==
== Long-term goals ==


* All languages should use IRIs
* All languages use IRIs
* DBpedia chapters


{| class="wikitable"
{| class="wikitable"
Line 17: Line 18:
| <nowiki>http://xx.dbpedia.org/resource/Xyz</nowiki>
| <nowiki>http://xx.dbpedia.org/resource/Xyz</nowiki>
|-
|-
| Property extracted by generic template extractor
| Properties extracted by generic template extractor
| <nowiki>http://xx.dbpedia.org/property/xyz</nowiki>
| <nowiki>http://xx.dbpedia.org/property/xyz</nowiki>
|}
|}
Line 33: Line 34:
| <nowiki>http://dbpedia.org/resource/Xyz</nowiki>
| <nowiki>http://dbpedia.org/resource/Xyz</nowiki>
|-
|-
| Property extracted by generic template extractor from <nowiki>http://en.wikipedia.org/</nowiki> pages
| Properties extracted by generic template extractor from <nowiki>http://en.wikipedia.org/</nowiki> pages
| <nowiki>http://dbpedia.org/property/xyz</nowiki>
| <nowiki>http://dbpedia.org/property/xyz</nowiki>
|}
|}


== Notes ==
* We used to map page titles from non-English Wikipedias to IRIs using the inter-language link to the English Wikipedia. If there was no such inter-language, we did not extract any data from the page. That was because we used URIs like <nowiki>http://dbpedia.org/resource/Xyz</nowiki> for ''all'' languages and had to 'normalize' them. We will ''not'' do that anymore - we use <nowiki>http://xx.dbpedia.org/resource/Xyz</nowiki> IRIs.


==== Implementation details ====
==== Implementation details ====

Revision as of 21:36, 14 May 2012

Long-term goals

  • All languages use IRIs
  • DBpedia chapters
Items IRI pattern
Ontology classes and properties http://dbpedia.org/ontology/Xyz
Datatypes http://dbpedia.org/datatype/xyz
Resource IRI for http://xx.wikipedia.org/wiki/Xyz http://xx.dbpedia.org/resource/Xyz
Properties extracted by generic template extractor http://xx.dbpedia.org/property/xyz

Deviations from these rules for backwards compatibility

  • English and French use URIs, not IRIs. DBpedia Berlin will publish additional datasets with IRIs, but the main datasets will use URIs.
Items IRI pattern
Resource URI for http://en.wikipedia.org/wiki/Xyz http://dbpedia.org/resource/Xyz
Properties extracted by generic template extractor from http://en.wikipedia.org/ pages http://dbpedia.org/property/xyz

Notes

  • We used to map page titles from non-English Wikipedias to IRIs using the inter-language link to the English Wikipedia. If there was no such inter-language, we did not extract any data from the page. That was because we used URIs like http://dbpedia.org/resource/Xyz for all languages and had to 'normalize' them. We will not do that anymore - we use http://xx.dbpedia.org/resource/Xyz IRIs.

Implementation details

  • During the extraction, the framework will use http://xx.dbpedia.org/ IRIs for all languages. Different serializers will serializes them differently, according to some rules
    • convert en.dbpedia.org to dbpedia.org
    • convert IRIs to URIs for some languages (in subjects, predicates and objects)