Use the DBpedia Extraction Framework: Difference between revisions

From Mediawiki1
Jump to navigationJump to search
No edit summary
No edit summary
Line 1: Line 1:
Once that there are infobox and/or table mappings for a language, you can run the DBpedia extraction. Several things have to be installed and configured, which is documented at http://dbpedia.org/documentation
Once that there are infobox and/or table mappings for a language, you can run the DBpedia extraction. Several things have to be installed and configured, which is documented at '''http://dbpedia.org/documentation'''


* Section 1 describes what has to be installed to run the DBpedia extraction framework.
* Section 1 describes what has to be installed to run the DBpedia extraction framework.


* In 4.1., all things that must be specified before starting the extraction from a dump file are listed. In the file "dump/config.properties" (using the file "dump/config.properties.default" as a template), you can specify the languages for which you want to extract, and which extractors should be used. For example, to run the HomepageExtractor and the MappingExtractor for Maltese, specify
* In section 4.1., all things that must be specified before starting the extraction from a dump file are listed. In the file "dump/config.properties" (using the file "dump/config.properties.default" as a template), you can specify the languages for which you want to extract, and which extractors should be used. For example, to run the HomepageExtractor and the MappingExtractor for Maltese, specify


  languages=mt
  languages=mt
Line 9: Line 9:
               org.dbpedia.extraction.mappings.MappingExtractor
               org.dbpedia.extraction.mappings.MappingExtractor


* When you run the extraction (see 4.2.), the MappingExtractor will extract the information from the infoboxes that you created a mapping for. The extracted triples will be saved in a file named "mappingbased_properties_mt.nt" (for Maltese) in the output directory you specified.
* When you run the extraction (see section 4.2.), the MappingExtractor will extract the information from the infoboxes that you created a mapping for. The extracted triples will be saved in a file named "mappingbased_properties_mt.nt" (for Maltese) in the output directory you specified.

Revision as of 11:32, 7 July 2011

Once that there are infobox and/or table mappings for a language, you can run the DBpedia extraction. Several things have to be installed and configured, which is documented at http://dbpedia.org/documentation

  • Section 1 describes what has to be installed to run the DBpedia extraction framework.
  • In section 4.1., all things that must be specified before starting the extraction from a dump file are listed. In the file "dump/config.properties" (using the file "dump/config.properties.default" as a template), you can specify the languages for which you want to extract, and which extractors should be used. For example, to run the HomepageExtractor and the MappingExtractor for Maltese, specify
languages=mt
extractors.mt=org.dbpedia.extraction.mappings.HomepageExtractor \
              org.dbpedia.extraction.mappings.MappingExtractor
  • When you run the extraction (see section 4.2.), the MappingExtractor will extract the information from the infoboxes that you created a mapping for. The extracted triples will be saved in a file named "mappingbased_properties_mt.nt" (for Maltese) in the output directory you specified.