DBpedia datasets: Difference between revisions

From Mediawiki1
Jump to navigationJump to search
(AbstractExtractor)
No edit summary
Line 14: Line 14:
|long-abstracts<br/>short-abstracts<!--
|long-abstracts<br/>short-abstracts<!--
  ar  bg  bn  ca  cs  de  el  en  es  eu  fr  ga  hi  hr  hu  it  ja  ko  nl  pl  pt  ru  sl  tr  all  -->
  ar  bg  bn  ca  cs  de  el  en  es  eu  fr  ga  hi  hr  hu  it  ja  ko  nl  pl  pt  ru  sl  tr  all  -->
|   ||   ||   ||   ||   ||   ||   ||yes||   ||   ||   ||   ||   ||   ||   ||   ||   ||   ||   ||   ||   ||   ||   ||   ||  
|yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes||yes
|-
|-
|[http://dbpedia.hg.sourceforge.net/hgweb/dbpedia/extraction_framework/file/default/core/src/main/scala/org/dbpedia/extraction/mappings/ArticleCategoriesExtractor.scala ArticleCategoriesExtractor]
|[http://dbpedia.hg.sourceforge.net/hgweb/dbpedia/extraction_framework/file/default/core/src/main/scala/org/dbpedia/extraction/mappings/ArticleCategoriesExtractor.scala ArticleCategoriesExtractor]

Revision as of 15:20, 15 May 2012

Which extractors run for which language?

You can add settings here, but be aware that they will not have any effect on any extraction process. The real settings are in the repository in dump/extract.default.properties. One fine day, the extraction framework will read configuration pages similar to this one from the mappings wiki, but not yet. Not for a while.

This matrix was mostly generated from dump/extract.default.properties with some manual additions.

Extractor Files ar bg bn ca cs de el en es eu fr ga hi hr hu it ja ko nl pl pt ru sl tr all
AbstractExtractor long-abstracts
short-abstracts
yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes
ArticleCategoriesExtractor article-categories yes
CategoryLabelExtractor category-labels yes
DisambiguationExtractor disambiguation-links yes yes yes yes yes yes yes yes yes
ExternalLinksExtractor external-links yes
GeoExtractor geo-coordinates yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes
HomepageExtractor homepages yes yes yes yes yes yes yes yes yes yes yes
ImageExtractor images yes yes yes yes yes yes
InfoboxExtractor infoboxes
infobox-properties
yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes
InterLanguageLinksExtractor same-as yes yes yes yes yes
LabelExtractor labels yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes
MappingExtractor ontology-types
ontology-properties
specific-properties
yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes
PageIdExtractor page-links yes
PageLinksExtractor page-links yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes
PersondataExtractor persondata yes yes
PndExtractor pnd yes yes
RedirectExtractor redirects yes
RevisionIdExtractor revisions yes
SkosCategoriesExtractor skos-categories yes
WikiPageExtractor links-to-wikipedia-article yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes yes