DBpedia datasets: Difference between revisions
From Mediawiki1
Jump to navigationJump to search
(use some extractors for all languages, not just en) |
(fr uses DisambiguationExtractor,HomepageExtractor,ImageExtractor,InterLanguageLinksExtractor,PersondataExtractor,PndExtractor) |
||
Line 31: | Line 31: | ||
|disambiguation-links<!-- | |disambiguation-links<!-- | ||
ar bg bn ca cs de el en es eu fr ga hi hr hu it ja ko nl pl pt ru sl tr all --> | ar bg bn ca cs de el en es eu fr ga hi hr hu it ja ko nl pl pt ru sl tr all --> | ||
| || || ||yes|| ||yes||yes||yes||yes|| || | | || || ||yes|| ||yes||yes||yes||yes|| ||yes|| || || || ||yes|| || || ||yes||yes||yes|| || || | ||
|- | |- | ||
|[http://dbpedia.hg.sourceforge.net/hgweb/dbpedia/extraction_framework/file/default/core/src/main/scala/org/dbpedia/extraction/mappings/ExternalLinksExtractor.scala ExternalLinksExtractor] | |[http://dbpedia.hg.sourceforge.net/hgweb/dbpedia/extraction_framework/file/default/core/src/main/scala/org/dbpedia/extraction/mappings/ExternalLinksExtractor.scala ExternalLinksExtractor] | ||
Line 51: | Line 51: | ||
|images<!-- | |images<!-- | ||
ar bg bn ca cs de el en es eu fr ga hi hr hu it ja ko nl pl pt ru sl tr all --> | ar bg bn ca cs de el en es eu fr ga hi hr hu it ja ko nl pl pt ru sl tr all --> | ||
| || || || || ||yes||yes||yes||yes|| || | | || || || || ||yes||yes||yes||yes|| ||yes|| || || || || || || || || ||yes||yes|| || || | ||
|- | |- | ||
|[http://dbpedia.hg.sourceforge.net/hgweb/dbpedia/extraction_framework/file/default/core/src/main/scala/org/dbpedia/extraction/mappings/InfoboxExtractor.scala InfoboxExtractor] | |[http://dbpedia.hg.sourceforge.net/hgweb/dbpedia/extraction_framework/file/default/core/src/main/scala/org/dbpedia/extraction/mappings/InfoboxExtractor.scala InfoboxExtractor] | ||
Line 61: | Line 61: | ||
|same-as<!-- | |same-as<!-- | ||
ar bg bn ca cs de el en es eu fr ga hi hr hu it ja ko nl pl pt ru sl tr all --> | ar bg bn ca cs de el en es eu fr ga hi hr hu it ja ko nl pl pt ru sl tr all --> | ||
| || || || || ||yes||yes||yes|| || || | | || || || || ||yes||yes||yes|| || ||yes|| || || || ||yes|| || || || || ||yes|| || || | ||
|- | |- | ||
|[http://dbpedia.hg.sourceforge.net/hgweb/dbpedia/extraction_framework/file/default/core/src/main/scala/org/dbpedia/extraction/mappings/LabelExtractor.scala LabelExtractor] | |[http://dbpedia.hg.sourceforge.net/hgweb/dbpedia/extraction_framework/file/default/core/src/main/scala/org/dbpedia/extraction/mappings/LabelExtractor.scala LabelExtractor] | ||
Line 86: | Line 86: | ||
|persondata<!-- | |persondata<!-- | ||
ar bg bn ca cs de el en es eu fr ga hi hr hu it ja ko nl pl pt ru sl tr all --> | ar bg bn ca cs de el en es eu fr ga hi hr hu it ja ko nl pl pt ru sl tr all --> | ||
| || || || || ||yes|| ||yes|| || || | | || || || || ||yes|| ||yes|| || ||yes|| || || || || || || || || || || || || || | ||
|- | |- | ||
|[http://dbpedia.hg.sourceforge.net/hgweb/dbpedia/extraction_framework/file/default/core/src/main/scala/org/dbpedia/extraction/mappings/PndExtractor.scala PndExtractor] | |[http://dbpedia.hg.sourceforge.net/hgweb/dbpedia/extraction_framework/file/default/core/src/main/scala/org/dbpedia/extraction/mappings/PndExtractor.scala PndExtractor] | ||
|pnd<!-- | |pnd<!-- | ||
ar bg bn ca cs de el en es eu fr ga hi hr hu it ja ko nl pl pt ru sl tr all --> | ar bg bn ca cs de el en es eu fr ga hi hr hu it ja ko nl pl pt ru sl tr all --> | ||
| || || || || ||yes|| ||yes|| || || | | || || || || ||yes|| ||yes|| || ||yes|| || || || || || || || || || || || || || | ||
|- | |- | ||
|[http://dbpedia.hg.sourceforge.net/hgweb/dbpedia/extraction_framework/file/default/core/src/main/scala/org/dbpedia/extraction/mappings/RedirectExtractor.scala RedirectExtractor] | |[http://dbpedia.hg.sourceforge.net/hgweb/dbpedia/extraction_framework/file/default/core/src/main/scala/org/dbpedia/extraction/mappings/RedirectExtractor.scala RedirectExtractor] |
Revision as of 19:28, 21 May 2012
In the main DBpedia release, which extractors run for which language?
Combinations that are not explicitly set to no may be useful but are currently not used in the main DBpedia release.
You can add settings here, but be aware that they will not automatically have an effect on any extraction process. The real settings are in the repository in dump/extract.default.properties. One fine day, the extraction framework will read configuration pages similar to this one from the mappings wiki, but not yet. Not for a while. If you add or change settings here, please also send a message to dbpedia-developers@lists.sourceforge.net.
This matrix was mostly generated from dump/extract.default.properties, with some manual additions.
Extractor | Files | ar | bg | bn | ca | cs | de | el | en | es | eu | fr | ga | hi | hr | hu | it | ja | ko | nl | pl | pt | ru | sl | tr | all |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
AbstractExtractor | long-abstracts short-abstracts |
yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes |
ArticleCategoriesExtractor | article-categories | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes |
CategoryLabelExtractor | category-labels | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes |
DisambiguationExtractor | disambiguation-links | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | |||||||||||||||
ExternalLinksExtractor | external-links | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes |
GeoExtractor | geo-coordinates | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes |
HomepageExtractor | homepages | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | ||||||||||||||
ImageExtractor | images | yes | yes | yes | yes | yes | yes | yes | ||||||||||||||||||
InfoboxExtractor | infoboxes infobox-properties |
yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes |
InterLanguageLinksExtractor | same-as | yes | yes | yes | yes | yes | yes | |||||||||||||||||||
LabelExtractor | labels | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes |
MappingExtractor | ontology-types ontology-properties specific-properties |
yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | no |
PageIdExtractor | page-links | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes |
PageLinksExtractor | page-links | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes |
PersondataExtractor | persondata | yes | yes | yes | ||||||||||||||||||||||
PndExtractor | pnd | yes | yes | yes | ||||||||||||||||||||||
RedirectExtractor | redirects | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes |
RevisionIdExtractor | revisions | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes |
SkosCategoriesExtractor | skos-categories | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes |
WikiPageExtractor | links-to-wikipedia-article | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes |