DBpedia datasets

From Mediawiki1
Jump to navigationJump to search

In the main DBpedia release, which extractors run for which language?

Combinations that are not explicitly set to no may be useful but are currently not used in the main DBpedia release.

You can add settings here, but be aware that they will not automatically have an effect on any extraction process. The real settings are in the repository in dump/extraction.default.properties. One fine day, the extraction framework will read configuration pages similar to this one from the mappings wiki, but not yet. Not for a while. If you add or change settings here, please also send a message to dbpedia-developers@lists.sourceforge.net.

This matrix was mostly generated from dump/extraction.default.properties, with some manual additions. Similar info is available at http://wiki.dbpedia.org/Downloads2014, where no is represented as ---

Extractor Files ar bg bn ca cs de el en es et eu fr ga hi hr hu it id ja ko nl pl pt ru sk sl tr ur all
AbstractExtractor long-abstracts
short-abstracts
yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes
ArticleCategoriesExtractor article-categories yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes
CategoryLabelExtractor category-labels yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes
DisambiguationExtractor disambiguation-links yes yes yes yes yes yes yes yes yes yes yes yes
ExternalLinksExtractor external-links yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes
GeoExtractor geo-coordinates yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes
HomepageExtractor homepages yes yes yes yes yes yes yes yes yes yes yes yes
ImageExtractor images yes yes yes yes yes yes yes yes yes yes
InfoboxExtractor infoboxes
infobox-properties
yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes
InterLanguageLinksExtractor interlanguage-links yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes
LabelExtractor labels yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes
MappingExtractor ontology-types
ontology-properties
specific-properties
yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes no
PageIdExtractor page-links yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes
PageLinksExtractor page-links yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes
PersondataExtractor persondata yes yes yes
PndExtractor pnd yes yes yes
RedirectExtractor redirects yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes
RevisionIdExtractor revisions yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes
SkosCategoriesExtractor skos-categories yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes
TopicalConceptsExtractor topical-concepts yes yes yes yes yes yes yes
WikiPageExtractor links-to-wikipedia-article yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes