DBpedia datasets

From Mediawiki1
Revision as of 15:37, 15 May 2012 by Chrisahn (talk | contribs)
Jump to navigationJump to search

In the main DBpedia release, which extractors run for which language?

Combinations that are not explicitly set to no may be useful but are currently not used in the main DBpedia release.

You can add settings here, but be aware that they will not automatically have an effect on any extraction process. The real settings are in the repository in dump/extract.default.properties. One fine day, the extraction framework will read configuration pages similar to this one from the mappings wiki, but not yet. Not for a while. If you add or change settings here, please also send a message to dbpedia-developers@lists.sourceforge.net.

This matrix was mostly generated from dump/extract.default.properties, with some manual additions.

Extractor Files ar bg bn ca cs de el en es eu fr ga hi hr hu it ja ko nl pl pt ru sl tr all
AbstractExtractor long-abstracts
short-abstracts
yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes
ArticleCategoriesExtractor article-categories yes
CategoryLabelExtractor category-labels yes
DisambiguationExtractor disambiguation-links yes yes yes yes yes yes yes yes yes
ExternalLinksExtractor external-links yes
GeoExtractor geo-coordinates yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes
HomepageExtractor homepages yes yes yes yes yes yes yes yes yes yes yes
ImageExtractor images yes yes yes yes yes yes
InfoboxExtractor infoboxes
infobox-properties
yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes
InterLanguageLinksExtractor same-as yes yes yes yes yes
LabelExtractor labels yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes
MappingExtractor ontology-types
ontology-properties
specific-properties
yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes no
PageIdExtractor page-links yes
PageLinksExtractor page-links yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes
PersondataExtractor persondata yes yes
PndExtractor pnd yes yes
RedirectExtractor redirects yes
RevisionIdExtractor revisions yes
SkosCategoriesExtractor skos-categories yes
WikiPageExtractor links-to-wikipedia-article yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes