DBpedia datasets: Difference between revisions
From Mediawiki1
Jump to navigationJump to search
No edit summary |
(AbstractExtractor) |
||
Line 3: | Line 3: | ||
You can add settings here, but be aware that they will '''not''' have any effect on any extraction process. The real settings are in the repository in [http://dbpedia.hg.sourceforge.net/hgweb/dbpedia/extraction_framework/file/default/dump/extract.default.properties dump/extract.default.properties]. One fine day, the extraction framework will read configuration pages similar to this one from the mappings wiki, but not yet. Not for a while. | You can add settings here, but be aware that they will '''not''' have any effect on any extraction process. The real settings are in the repository in [http://dbpedia.hg.sourceforge.net/hgweb/dbpedia/extraction_framework/file/default/dump/extract.default.properties dump/extract.default.properties]. One fine day, the extraction framework will read configuration pages similar to this one from the mappings wiki, but not yet. Not for a while. | ||
This matrix was generated from [http://dbpedia.hg.sourceforge.net/hgweb/dbpedia/extraction_framework/file/default/dump/extract.default.properties dump/extract.default.properties]. | This matrix was mostly generated from [http://dbpedia.hg.sourceforge.net/hgweb/dbpedia/extraction_framework/file/default/dump/extract.default.properties dump/extract.default.properties] with some manual additions. | ||
{| class="wikitable" | {| class="wikitable" | ||
Line 10: | Line 10: | ||
!Files | !Files | ||
!ar!!bg!!bn!!ca!!cs!!de!!el!!en!!es!!eu!!fr!!ga!!hi!!hr!!hu!!it!!ja!!ko!!nl!!pl!!pt!!ru!!sl!!tr!!all | !ar!!bg!!bn!!ca!!cs!!de!!el!!en!!es!!eu!!fr!!ga!!hi!!hr!!hu!!it!!ja!!ko!!nl!!pl!!pt!!ru!!sl!!tr!!all | ||
|- | |||
|[http://dbpedia.hg.sourceforge.net/hgweb/dbpedia/extraction_framework/file/default/core/src/main/scala/org/dbpedia/extraction/mappings/AbstractExtractor.scala AbstractExtractor] | |||
|long-abstracts<br/>short-abstracts<!-- | |||
ar bg bn ca cs de el en es eu fr ga hi hr hu it ja ko nl pl pt ru sl tr all --> | |||
| || || || || || || ||yes|| || || || || || || || || || || || || || || || || | |||
|- | |- | ||
|[http://dbpedia.hg.sourceforge.net/hgweb/dbpedia/extraction_framework/file/default/core/src/main/scala/org/dbpedia/extraction/mappings/ArticleCategoriesExtractor.scala ArticleCategoriesExtractor] | |[http://dbpedia.hg.sourceforge.net/hgweb/dbpedia/extraction_framework/file/default/core/src/main/scala/org/dbpedia/extraction/mappings/ArticleCategoriesExtractor.scala ArticleCategoriesExtractor] |
Revision as of 15:20, 15 May 2012
Which extractors run for which language?
You can add settings here, but be aware that they will not have any effect on any extraction process. The real settings are in the repository in dump/extract.default.properties. One fine day, the extraction framework will read configuration pages similar to this one from the mappings wiki, but not yet. Not for a while.
This matrix was mostly generated from dump/extract.default.properties with some manual additions.
Extractor | Files | ar | bg | bn | ca | cs | de | el | en | es | eu | fr | ga | hi | hr | hu | it | ja | ko | nl | pl | pt | ru | sl | tr | all |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
AbstractExtractor | long-abstracts short-abstracts |
yes | ||||||||||||||||||||||||
ArticleCategoriesExtractor | article-categories | yes | ||||||||||||||||||||||||
CategoryLabelExtractor | category-labels | yes | ||||||||||||||||||||||||
DisambiguationExtractor | disambiguation-links | yes | yes | yes | yes | yes | yes | yes | yes | yes | ||||||||||||||||
ExternalLinksExtractor | external-links | yes | ||||||||||||||||||||||||
GeoExtractor | geo-coordinates | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes |
HomepageExtractor | homepages | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | ||||||||||||||
ImageExtractor | images | yes | yes | yes | yes | yes | yes | |||||||||||||||||||
InfoboxExtractor | infoboxes infobox-properties |
yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes |
InterLanguageLinksExtractor | same-as | yes | yes | yes | yes | yes | ||||||||||||||||||||
LabelExtractor | labels | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes |
MappingExtractor | ontology-types ontology-properties specific-properties |
yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | |
PageIdExtractor | page-links | yes | ||||||||||||||||||||||||
PageLinksExtractor | page-links | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes |
PersondataExtractor | persondata | yes | yes | |||||||||||||||||||||||
PndExtractor | pnd | yes | yes | |||||||||||||||||||||||
RedirectExtractor | redirects | yes | ||||||||||||||||||||||||
RevisionIdExtractor | revisions | yes | ||||||||||||||||||||||||
SkosCategoriesExtractor | skos-categories | yes | ||||||||||||||||||||||||
WikiPageExtractor | links-to-wikipedia-article | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes |