How to add a mapping namespace: Difference between revisions
No edit summary |
No edit summary |
||
Line 1: | Line 1: | ||
= How to create a mapping namespace for a new language = | |||
As an example, we use a fictitious language with code "xx" and Wikipedia rank 44. | As an example, we use a fictitious language with code "xx" and Wikipedia rank 44. | ||
Line 23: | Line 23: | ||
==== edit core/org.dbpedia.extraction.wikiparser.Namespace.scala ==== | ==== edit core/org.dbpedia.extraction.wikiparser.Namespace.scala ==== | ||
add something like this at the appropriate | add something like this at the appropriate position in the code | ||
<pre> | <pre> |
Revision as of 01:11, 16 May 2012
How to create a mapping namespace for a new language
As an example, we use a fictitious language with code "xx" and Wikipedia rank 44.
Note: more code changes will be needed for the first language code that contains a dash "-".
Get language code and rank
Get the wiki language code and rank from http://s23.org/wikistats/wikipedias_html.php
namespace number: multiply the rank by 2 and add 200
talk namespace number: add 1 to the namespace number
Example: language code "xx", rank 44, namespace number 288, talk namespace number 289
CAUTION: If the calculated namespace number already exists for another language (because the ranking has changed) do NOT change the existing namespace number. Please find a neighboring or close enough number that works.
If 288 is in use, we choose other numbers, let’s say 298 and 299
Update the extraction framework
edit core/org.dbpedia.extraction.wikiparser.Namespace.scala
add something like this at the appropriate position in the code
"xx"->288
Commit and push the changes to default branch.
Update and restart the process that's serving http://mappings.dbpedia.org/server/ URLs
Log onto the appropriate machine.
Stop the server:
ps axfu | grep java
Look for class ...server.Server, and then:
kill <process id>
Then update, compile and start the server:
cd /home/dbpedia-server/dbpedia/extraction_framework hg pull hg update mvn clean install --projects core,server cd server ../run server &>server-<YYYY>-<MM>-<DD>.01.log &
update mappings wiki
TODO: add LocalSettings.php to some source repo!!! a) log onto www5 (160.45.137.86)
with user name "Administrator" open C:\Program Files (x86)\Apache Software Foundation\Apache2.2\htdocs\mappings\LocalSettings.php add the following lines at the right position in the code “xx"=> 288 restart the Apache server b) edit http://mappings.dbpedia.org/index.php/MediaWiki:Sidebar
add a link at the right position in the ranking with the right number and language code c) edit http://mappings.dbpedia.org/index.php/Template:Class, http://mappings.dbpedia.org/index.php/Template:Datatype, http://mappings.dbpedia.org/index.php/Template:DatatypeProperty, http://mappings.dbpedia.org/index.php/Template:ObjectProperty
add two lines for label@xx
d) edit http://mappings.dbpedia.org/index.php/Mapping_Statistics and http://mappings.dbpedia.org/index.php/DBpedia_datasets
e) generate statistics for new language run RedirectExtractor, InfoboxExtractor, TemplateParameterExtractor (see dump/extraction.server.properties) run CreateMappingStats (launcher ‘stats’ in server/pom.xml) copy src/main/statistics/mappingstatistics_bg.txt to same folder on server Update and deploy sprint stuff.