Use the DBpedia Extraction Framework

From Mediawiki1
Revision as of 11:32, 7 July 2011 by Maxjakob (talk | contribs)
Jump to navigationJump to search

Once that there are infobox and/or table mappings for a language, you can run the DBpedia extraction. Several things have to be installed and configured, which is documented at http://dbpedia.org/documentation

  • Section 1 describes what has to be installed to run the DBpedia extraction framework.
  • In section 4.1., all things that must be specified before starting the extraction from a dump file are listed. In the file "dump/config.properties" (using the file "dump/config.properties.default" as a template), you can specify the languages for which you want to extract, and which extractors should be used. For example, to run the HomepageExtractor and the MappingExtractor for Maltese, specify
languages=mt
extractors.mt=org.dbpedia.extraction.mappings.HomepageExtractor \
              org.dbpedia.extraction.mappings.MappingExtractor
  • When you run the extraction (see section 4.2.), the MappingExtractor will extract the information from the infoboxes that you created a mapping for. The extracted triples will be saved in a file named "mappingbased_properties_mt.nt" (for Maltese) in the output directory you specified.