Use the DBpedia Extraction Framework: Difference between revisions
From Mediawiki1
Jump to navigationJump to search
No edit summary |
No edit summary |
||
(One intermediate revision by the same user not shown) | |||
Line 1: | Line 1: | ||
Once that there are infobox and/or table mappings for a language, you can run the DBpedia extraction. Several things have to be installed and configured, which is documented at http://dbpedia.org/documentation | Once that there are infobox and/or table mappings for a language, you can run the DBpedia extraction. Several things have to be installed and configured, which is documented at | ||
'''http://dbpedia.org/documentation''' | |||
* Section 1 describes what has to be installed to run the DBpedia extraction framework. | * Section 1 describes what has to be installed to run the DBpedia extraction framework. | ||
* In 4.1., all things that must be specified before starting the extraction from a dump file are listed. In the file "dump/config.properties" (using the file "dump/config.properties.default" as a template), you can specify the languages for which you want to extract, and which extractors should be used. For example, to run the HomepageExtractor and the MappingExtractor for Maltese, specify | * In section 4.1., all things that must be specified before starting the extraction from a dump file are listed. In the file "dump/config.properties" (using the file "dump/config.properties.default" as a template), you can specify the languages for which you want to extract, and which extractors should be used. For example, to run the HomepageExtractor and the MappingExtractor for Maltese, specify | ||
languages=mt | languages=mt | ||
Line 9: | Line 11: | ||
org.dbpedia.extraction.mappings.MappingExtractor | org.dbpedia.extraction.mappings.MappingExtractor | ||
* When you run the extraction (see 4.2.), the MappingExtractor will extract the information from the infoboxes that you created a mapping for. The extracted triples will be saved in a file named "mappingbased_properties_mt.nt" (for Maltese) in the output directory you specified. | * When you run the extraction (see section 4.2.), the MappingExtractor will extract the information from the infoboxes that you created a mapping for. The extracted triples will be saved in a file named "mappingbased_properties_mt.nt" (for Maltese) in the output directory you specified. |
Latest revision as of 11:33, 7 July 2011
Once that there are infobox and/or table mappings for a language, you can run the DBpedia extraction. Several things have to be installed and configured, which is documented at
http://dbpedia.org/documentation
- Section 1 describes what has to be installed to run the DBpedia extraction framework.
- In section 4.1., all things that must be specified before starting the extraction from a dump file are listed. In the file "dump/config.properties" (using the file "dump/config.properties.default" as a template), you can specify the languages for which you want to extract, and which extractors should be used. For example, to run the HomepageExtractor and the MappingExtractor for Maltese, specify
languages=mt extractors.mt=org.dbpedia.extraction.mappings.HomepageExtractor \ org.dbpedia.extraction.mappings.MappingExtractor
- When you run the extraction (see section 4.2.), the MappingExtractor will extract the information from the infoboxes that you created a mapping for. The extracted triples will be saved in a file named "mappingbased_properties_mt.nt" (for Maltese) in the output directory you specified.