DBpedia datasets: Difference between revisions
No edit summary |
No edit summary |
||
(38 intermediate revisions by 4 users not shown) | |||
Line 1: | Line 1: | ||
<!-- | |||
!!!!!! ATTENTION !!!!!! | |||
TO EDIT THIS PAGE, PLEASE COPY IT TO A TEXT EDITOR | |||
AND USE A FIXED-WIDTH FONT. THE TABLE ROWS BELOW | |||
ARE MUCH MORE READABLE WITH A FIXED-WIDTH FONT. | |||
!!!!!! THANK YOU !!!!!! | |||
--> | |||
In the main DBpedia release, which extractors run for which language? | |||
Combinations that are not explicitly set to '''no''' may be useful but are currently not used in the main DBpedia release. | |||
You can add settings here, but be aware that they will '''not''' automatically have an effect on any extraction process. The real settings are in the repository in [http://dbpedia.hg.sourceforge.net/hgweb/dbpedia/extraction_framework/file/dump/dump/extraction.default.properties dump/extraction.default.properties]. One fine day, the extraction framework will read configuration pages similar to this one from the mappings wiki, but not yet. Not for a while. If you add or change settings here, please also send a message to [https://lists.sourceforge.net/lists/listinfo/dbpedia-developers dbpedia-developers@lists.sourceforge.net]. | |||
This matrix was mostly generated from [http://dbpedia.hg.sourceforge.net/hgweb/dbpedia/extraction_framework/file/dump/dump/extraction.default.properties dump/extraction.default.properties], with some manual additions. | |||
Similar info is available at http://wiki.dbpedia.org/Downloads2014, where '''no''' is represented as --- | |||
{| class="wikitable" | {| class="wikitable" | ||
|- | |- | ||
!Extractor | !Extractor | ||
!Files | |||
!ar!!bg!!bn!!ca!!cs!!de!!el!!en!!es!!et!!eu!!fr!!ga!!hi!!hr!!hu!!it!!id!!ja!!ko!!nl!!pl!!pt!!ru!!sk!!sl!!tr!!ur!!all | |||
|- | |- | ||
| | |[https://github.com/dbpedia/extraction-framework/tree/master/core/src/main/scala/org/dbpedia/extraction/mappings/AbstractExtractor.scala AbstractExtractor] | ||
|long-abstracts<br/>short-abstracts<!-- | |||
ar bg bn ca cs de el en es et eu fr ga hi hr hu id it ja ko nl pl pt ru sk sl tr ur all --> | |||
|yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes | |||
|- | |- | ||
| | |[https://github.com/dbpedia/extraction-framework/tree/master/core/src/main/scala/org/dbpedia/extraction/mappings/ArticleCategoriesExtractor.scala ArticleCategoriesExtractor] | ||
|article-categories<!-- | |||
ar bg bn ca cs de el en es et eu fr ga hi hr hu id it ja ko nl pl pt ru sk sl tr ur all --> | |||
|yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes | |||
|- | |- | ||
| | |[https://github.com/dbpedia/extraction-framework/tree/master/core/src/main/scala/org/dbpedia/extraction/mappings/CategoryLabelExtractor.scala CategoryLabelExtractor] | ||
|category-labels<!-- | |||
ar bg bn ca cs de el en es et eu fr ga hi hr hu id it ja ko nl pl pt ru sk sl tr ur all --> | |||
|yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes | |||
|- | |- | ||
| | |[https://github.com/dbpedia/extraction-framework/tree/master/core/src/main/scala/org/dbpedia/extraction/mappings/DisambiguationExtractor.scala DisambiguationExtractor] | ||
|disambiguation-links<!-- | |||
ar bg bn ca cs de el en es et eu fr ga hi hr hu id it ja ko nl pl pt ru sk sl tr ur all --> | |||
| || || ||yes|| ||yes||yes||yes||yes|| ||yes||yes|| || || || || ||yes|| ||yes|| ||yes||yes||yes|| || || || || | |||
|- | |- | ||
| | |[https://github.com/dbpedia/extraction-framework/tree/master/core/src/main/scala/org/dbpedia/extraction/mappings/ExternalLinksExtractor.scala ExternalLinksExtractor] | ||
|external-links<!-- | |||
ar bg bn ca cs de el en es et eu fr ga hi hr hu id it ja ko nl pl pt ru sk sl tr ur all --> | |||
|yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes | |||
|- | |- | ||
|WikiPageExtractor | |[https://github.com/dbpedia/extraction-framework/tree/master/core/src/main/scala/org/dbpedia/extraction/mappings/GeoExtractor.scala GeoExtractor] | ||
|geo-coordinates<!-- | |||
ar bg bn ca cs de el en es et eu fr ga hi hr hu id it ja ko nl pl pt ru sk sl tr ur all --> | |||
|yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes | |||
|- | |||
|[https://github.com/dbpedia/extraction-framework/tree/master/core/src/main/scala/org/dbpedia/extraction/mappings/HomepageExtractor.scala HomepageExtractor] | |||
|homepages<!-- | |||
ar bg bn ca cs de el en es et eu fr ga hi hr hu id it ja ko nl pl pt ru sk sl tr ur all --> | |||
| || || ||yes|| ||yes||yes||yes||yes|| ||yes||yes||yes|| || || || ||yes|| || || ||yes||yes||yes|| || || || || | |||
|- | |||
|[https://github.com/dbpedia/extraction-framework/tree/master/core/src/main/scala/org/dbpedia/extraction/mappings/ImageExtractor.scala ImageExtractor] | |||
|images<!-- | |||
ar bg bn ca cs de el en es et eu fr ga hi hr hu id it ja ko nl pl pt ru sk sl tr ur all --> | |||
| || || || || ||yes||yes||yes||yes|| ||yes||yes|| || || || || ||yes|| || || ||yes||yes||yes|| || || || || | |||
|- | |||
|[https://github.com/dbpedia/extraction-framework/tree/master/core/src/main/scala/org/dbpedia/extraction/mappings/InfoboxExtractor.scala InfoboxExtractor] | |||
|infoboxes<br/>infobox-properties<!-- | |||
ar bg bn ca cs de el en es et eu fr ga hi hr hu id it ja ko nl pl pt ru sk sl tr ur all --> | |||
|yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes | |||
|- | |||
|[https://github.com/dbpedia/extraction-framework/tree/master/core/src/main/scala/org/dbpedia/extraction/mappings/InterLanguageLinksExtractor.scala InterLanguageLinksExtractor] | |||
|interlanguage-links<!-- | |||
ar bg bn ca cs de el en es et eu fr ga hi hr hu id it ja ko nl pl pt ru sk sl tr ur all --> | |||
|yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes | |||
|- | |||
|[https://github.com/dbpedia/extraction-framework/tree/master/core/src/main/scala/org/dbpedia/extraction/mappings/LabelExtractor.scala LabelExtractor] | |||
|labels<!-- | |||
ar bg bn ca cs de el en es et eu fr ga hi hr hu id it ja ko nl pl pt ru sk sl tr ur all --> | |||
|yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes | |||
|- | |||
|[https://github.com/dbpedia/extraction-framework/tree/master/core/src/main/scala/org/dbpedia/extraction/mappings/MappingExtractor.scala MappingExtractor] | |||
|ontology-types<br/>ontology-properties<br/>specific-properties<!-- | |||
ar bg bn ca cs de el en es et eu fr ga hi hr hu id it ja ko nl pl pt ru sk sl tr ur all --> | |||
|yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||'''no''' | |||
|- | |||
|[https://github.com/dbpedia/extraction-framework/tree/master/core/src/main/scala/org/dbpedia/extraction/mappings/PageIdExtractor.scala PageIdExtractor] | |||
|page-links<!-- | |||
ar bg bn ca cs de el en es et eu fr ga hi hr hu id it ja ko nl pl pt ru sk sl tr ur all --> | |||
|yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes | |||
|- | |||
|[https://github.com/dbpedia/extraction-framework/tree/master/core/src/main/scala/org/dbpedia/extraction/mappings/PageLinksExtractor.scala PageLinksExtractor] | |||
|page-links<!-- | |||
ar bg bn ca cs de el en es et eu fr ga hi hr hu id it ja ko nl pl pt ru sk sl tr ur all --> | |||
|yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes | |||
|- | |||
|[https://github.com/dbpedia/extraction-framework/tree/master/core/src/main/scala/org/dbpedia/extraction/mappings/PersondataExtractor.scala PersondataExtractor] | |||
|persondata<!-- | |||
ar bg bn ca cs de el en es et eu fr ga hi hr hu id it ja ko nl pl pt ru sk sl tr ur all --> | |||
| || || || || ||yes|| ||yes|| || || ||yes|| || || || || || || || || || || || || || || || || | |||
|- | |||
|[https://github.com/dbpedia/extraction-framework/tree/master/core/src/main/scala/org/dbpedia/extraction/mappings/PndExtractor.scala PndExtractor] | |||
|pnd<!-- | |||
ar bg bn ca cs de el en es et eu fr ga hi hr hu id it ja ko nl pl pt ru sk sl tr ur all --> | |||
| || || || || ||yes|| ||yes|| || || ||yes|| || || || || || || || || || || || || || || || || | |||
|- | |||
|[https://github.com/dbpedia/extraction-framework/tree/master/core/src/main/scala/org/dbpedia/extraction/mappings/RedirectExtractor.scala RedirectExtractor] | |||
|redirects<!-- | |||
ar bg bn ca cs de el en es et eu fr ga hi hr hu id it ja ko nl pl pt ru sk sl tr ur all --> | |||
|yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes | |||
|- | |||
|[https://github.com/dbpedia/extraction-framework/tree/master/core/src/main/scala/org/dbpedia/extraction/mappings/RevisionIdExtractor.scala RevisionIdExtractor] | |||
|revisions<!-- | |||
ar bg bn ca cs de el en es et eu fr ga hi hr hu id it ja ko nl pl pt ru sk sl tr ur all --> | |||
|yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes | |||
|- | |||
|[https://github.com/dbpedia/extraction-framework/tree/master/core/src/main/scala/org/dbpedia/extraction/mappings/SkosCategoriesExtractor.scala SkosCategoriesExtractor] | |||
|skos-categories<!-- | |||
ar bg bn ca cs de el en es et eu fr ga hi hr hu id it ja ko nl pl pt ru sk sl tr ur all --> | |||
|yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes | |||
|- | |||
|[https://github.com/dbpedia/extraction-framework/tree/master/core/src/main/scala/org/dbpedia/extraction/mappings/TopicalConceptsExtractor.scala TopicalConceptsExtractor] | |||
|topical-concepts<!-- | |||
ar bg bn ca cs de el en es et eu fr ga hi hr hu id it ja ko nl pl pt ru sk sl tr ur all --> | |||
| || || ||yes|| || || ||yes||yes|| ||yes||yes|| || || || || || || || || || ||yes||yes|| || || || || | |||
|- | |||
|[https://github.com/dbpedia/extraction-framework/tree/master/core/src/main/scala/org/dbpedia/extraction/mappings/WikiPageExtractor.scala WikiPageExtractor] | |||
|links-to-wikipedia-article<!-- | |||
ar bg bn ca cs de el en es et eu fr ga hi hr hu id it ja ko nl pl pt ru sk sl tr ur all --> | |||
|yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes | |||
|} | |} |
Latest revision as of 11:29, 25 February 2015
In the main DBpedia release, which extractors run for which language?
Combinations that are not explicitly set to no may be useful but are currently not used in the main DBpedia release.
You can add settings here, but be aware that they will not automatically have an effect on any extraction process. The real settings are in the repository in dump/extraction.default.properties. One fine day, the extraction framework will read configuration pages similar to this one from the mappings wiki, but not yet. Not for a while. If you add or change settings here, please also send a message to dbpedia-developers@lists.sourceforge.net.
This matrix was mostly generated from dump/extraction.default.properties, with some manual additions. Similar info is available at http://wiki.dbpedia.org/Downloads2014, where no is represented as ---
Extractor | Files | ar | bg | bn | ca | cs | de | el | en | es | et | eu | fr | ga | hi | hr | hu | it | id | ja | ko | nl | pl | pt | ru | sk | sl | tr | ur | all |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
AbstractExtractor | long-abstracts short-abstracts |
yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes |
ArticleCategoriesExtractor | article-categories | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes |
CategoryLabelExtractor | category-labels | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes |
DisambiguationExtractor | disambiguation-links | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | |||||||||||||||||
ExternalLinksExtractor | external-links | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes |
GeoExtractor | geo-coordinates | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes |
HomepageExtractor | homepages | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | |||||||||||||||||
ImageExtractor | images | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | |||||||||||||||||||
InfoboxExtractor | infoboxes infobox-properties |
yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes |
InterLanguageLinksExtractor | interlanguage-links | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes |
LabelExtractor | labels | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes |
MappingExtractor | ontology-types ontology-properties specific-properties |
yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | no |
PageIdExtractor | page-links | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes |
PageLinksExtractor | page-links | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes |
PersondataExtractor | persondata | yes | yes | yes | ||||||||||||||||||||||||||
PndExtractor | pnd | yes | yes | yes | ||||||||||||||||||||||||||
RedirectExtractor | redirects | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes |
RevisionIdExtractor | revisions | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes |
SkosCategoriesExtractor | skos-categories | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes |
TopicalConceptsExtractor | topical-concepts | yes | yes | yes | yes | yes | yes | yes | ||||||||||||||||||||||
WikiPageExtractor | links-to-wikipedia-article | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes |