DBpedia domains and URIs: Difference between revisions
From Mediawiki1
Jump to navigationJump to search
(width = 400 |) |
No edit summary |
||
Line 1: | Line 1: | ||
== Long-term goals == | == Long-term goals == | ||
* All languages | * All languages use IRIs | ||
* DBpedia chapters | |||
{| class="wikitable" | {| class="wikitable" | ||
Line 17: | Line 18: | ||
| <nowiki>http://xx.dbpedia.org/resource/Xyz</nowiki> | | <nowiki>http://xx.dbpedia.org/resource/Xyz</nowiki> | ||
|- | |- | ||
| | | Properties extracted by generic template extractor | ||
| <nowiki>http://xx.dbpedia.org/property/xyz</nowiki> | | <nowiki>http://xx.dbpedia.org/property/xyz</nowiki> | ||
|} | |} | ||
Line 33: | Line 34: | ||
| <nowiki>http://dbpedia.org/resource/Xyz</nowiki> | | <nowiki>http://dbpedia.org/resource/Xyz</nowiki> | ||
|- | |- | ||
| | | Properties extracted by generic template extractor from <nowiki>http://en.wikipedia.org/</nowiki> pages | ||
| <nowiki>http://dbpedia.org/property/xyz</nowiki> | | <nowiki>http://dbpedia.org/property/xyz</nowiki> | ||
|} | |} | ||
== Notes == | |||
* We used to map page titles from non-English Wikipedias to IRIs using the inter-language link to the English Wikipedia. If there was no such inter-language, we did not extract any data from the page. That was because we used URIs like <nowiki>http://dbpedia.org/resource/Xyz</nowiki> for ''all'' languages and had to 'normalize' them. We will ''not'' do that anymore - we use <nowiki>http://xx.dbpedia.org/resource/Xyz</nowiki> IRIs. | |||
==== Implementation details ==== | ==== Implementation details ==== |
Revision as of 21:36, 14 May 2012
Long-term goals
- All languages use IRIs
- DBpedia chapters
Items | IRI pattern |
---|---|
Ontology classes and properties | http://dbpedia.org/ontology/Xyz |
Datatypes | http://dbpedia.org/datatype/xyz |
Resource IRI for http://xx.wikipedia.org/wiki/Xyz | http://xx.dbpedia.org/resource/Xyz |
Properties extracted by generic template extractor | http://xx.dbpedia.org/property/xyz |
Deviations from these rules for backwards compatibility
- English and French use URIs, not IRIs. DBpedia Berlin will publish additional datasets with IRIs, but the main datasets will use URIs.
Items | IRI pattern |
---|---|
Resource URI for http://en.wikipedia.org/wiki/Xyz | http://dbpedia.org/resource/Xyz |
Properties extracted by generic template extractor from http://en.wikipedia.org/ pages | http://dbpedia.org/property/xyz |
Notes
- We used to map page titles from non-English Wikipedias to IRIs using the inter-language link to the English Wikipedia. If there was no such inter-language, we did not extract any data from the page. That was because we used URIs like http://dbpedia.org/resource/Xyz for all languages and had to 'normalize' them. We will not do that anymore - we use http://xx.dbpedia.org/resource/Xyz IRIs.
Implementation details
- During the extraction, the framework will use http://xx.dbpedia.org/ IRIs for all languages. Different serializers will serializes them differently, according to some rules
- convert en.dbpedia.org to dbpedia.org
- convert IRIs to URIs for some languages (in subjects, predicates and objects)