Hi,

the problem is not the XML namespace.

> Caused by: java.util.NoSuchElementException: key not found: 710
>         at scala.collection.MapLike$class.default(MapLike.scala:225)
>         at scala.collection.immutable.HashMap.default(HashMap.scala:38)
>         at scala.collection.MapLike$class.apply(MapLike.scala:135)
>         at scala.collection.immutable.HashMap.apply(HashMap.scala:38)
>         at 
> org.dbpedia.extraction.sources.WikipediaDumpParser.readPage(WikipediaDumpParser.java:224)

The problem is that en.wiki recently added MediaWiki namespace code
710 [1] that's not included in our configuration [2] yet. In line 224
in WikipediaDumpParser.java [3] we look up the namespace number that
we found in the Wikipedia XML dump in our configuration. That lookup
crashes if the namespace code is unknown.

To update the configuration, please cd to core/ and run

../clean-install-run generate-settings

This will download the current MediaWiki namespace settings from all
Wikipedia editions. May take a few minutes. Then run the extraction
again.

Hope this helps!

I'll try to update the configuration in the Mercurial repository soon.

Cheers,
Christopher

[1] 
http://en.wikipedia.org/w/api.php?action=query&meta=siteinfo&siprop=namespaces
[2] 
http://dbpedia.hg.sourceforge.net/hgweb/dbpedia/extraction_framework/file/331f8aba5f7d/core/src/main/scala/org/dbpedia/extraction/wikiparser/impl/wikipedia/Namespaces.scala
[3] 
http://dbpedia.hg.sourceforge.net/hgweb/dbpedia/extraction_framework/file/331f8aba5f7d/core/src/main/java/org/dbpedia/extraction/sources/WikipediaDumpParser.java#l224

On Mon, Nov 19, 2012 at 3:50 PM, Batica Dzonic <[email protected]> wrote:
>
> Wikipedia again changed XML scheme from
> http://www.mediawiki.org/xml/export-0.7/ to 0.8...
> I had the following problem with the dump extraction Loading Redirect From
> source (en)
>
> found by dbpedia:   [title=Girls' Generation II ~Girls &
> Peace~;ns=0/Main/;language:wiki=en,locale=en].
> found by wikipedia: [title=Girls' Generation II: Girls &
> Peace;ns=0/Main/;language:wiki=en,locale=en]
> Nov 19, 2012 2:14:46 PM
> org.dbpedia.extraction.mappings.Redirects$RedirectFinder apply
> WARNING: wrong redirect. page:
> [title=萬琲臻;ns=0/Main/;language:wiki=en,locale=en].
> found by dbpedia:   [title=Sharon
> Wan;ns=0/Main/;language:wiki=en,locale=en].
> found by wikipedia: [null]
> Nov 19, 2012 2:14:46 PM
> org.dbpedia.extraction.mappings.Redirects$RedirectFinder apply
> WARNING: wrong redirect. page: [title=John Courthope
> (disambiguation);ns=0/Main/;language:wiki=en,locale=en].
> found by dbpedia:   [title=John
> Courthope;ns=0/Main/;language:wiki=en,locale=en].
> found by wikipedia: [null]
> java.lang.reflect.InvocationTargetException
>         at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>         at
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
>         at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>         at java.lang.reflect.Method.invoke(Method.java:601)
>         at
> org_scala_tools_maven_executions.MainHelper.runMain(MainHelper.java:161)
>         at
> org_scala_tools_maven_executions.MainWithArgsInFile.main(MainWithArgsInFile.java:26)
> Caused by: java.util.NoSuchElementException: key not found: 710
>         at scala.collection.MapLike$class.default(MapLike.scala:225)
>         at scala.collection.immutable.HashMap.default(HashMap.scala:38)
>         at scala.collection.MapLike$class.apply(MapLike.scala:135)
>         at scala.collection.immutable.HashMap.apply(HashMap.scala:38)
>         at
> org.dbpedia.extraction.sources.WikipediaDumpParser.readPage(WikipediaDumpParser.java:224)
>         at
> org.dbpedia.extraction.sources.WikipediaDumpParser.readPages(WikipediaDumpParser.java:185)
>         at
> org.dbpedia.extraction.sources.WikipediaDumpParser.readDump(WikipediaDumpParser.java:143)
>         at
> org.dbpedia.extraction.sources.WikipediaDumpParser.run(WikipediaDumpParser.java:114)
>         at
> org.dbpedia.extraction.sources.XMLReaderSource.foreach(XMLSource.scala:57)
>         at
> scala.collection.TraversableLike$class.flatMap(TraversableLike.scala:239)
>         at
> org.dbpedia.extraction.sources.XMLReaderSource.flatMap(XMLSource.scala:53)
>         at
> org.dbpedia.extraction.mappings.Redirects$.loadFromSource(Redirects.scala:165)
>         at
> org.dbpedia.extraction.mappings.Redirects$.load(Redirects.scala:116)
>         at
> org.dbpedia.extraction.dump.extract.ConfigLoader$$anon$1.<init>(ConfigLoader.scala:100)
>         at
> org.dbpedia.extraction.dump.extract.ConfigLoader.org$dbpedia$extraction$dump$extract$ConfigLoader$$createExtractionJob(ConfigLoader.scala:55)
>         at
> org.dbpedia.extraction.dump.extract.ConfigLoader$$anonfun$getExtractionJobs$1.apply(ConfigLoader.scala:40)
>         at
> org.dbpedia.extraction.dump.extract.ConfigLoader$$anonfun$getExtractionJobs$1.apply(ConfigLoader.scala:40)
>         at scala.collection.Iterator$$anon$19.next(Iterator.scala:401)
>         at scala.collection.Iterator$class.foreach(Iterator.scala:772)
>         at scala.collection.Iterator$$anon$19.foreach(Iterator.scala:399)
>         at
> scala.collection.IterableViewLike$Transformed$class.foreach(IterableViewLike.scala:41)
>         at
> scala.collection.IterableViewLike$$anon$3.foreach(IterableViewLike.scala:80)
>         at
> org.dbpedia.extraction.dump.extract.Extraction$.main(Extraction.scala:27)
>         at
> org.dbpedia.extraction.dump.extract.Extraction.main(Extraction.scala)
>         ... 6 more
> [INFO]
> ------------------------------------------------------------------------
> [ERROR] BUILD ERROR
> [INFO]
> ------------------------------------------------------------------------
> [INFO] wrap: org.apache.commons.exec.ExecuteException: Process exited with
> an error: 240(Exit value: 240)
>
> [INFO]
> ------------------------------------------------------------------------
> [INFO] Trace
> org.apache.maven.lifecycle.LifecycleExecutionException: wrap:
> org.apache.commons.exec.ExecuteException: Process exited with an error:
> 240(Exit value: 240)
>         at
> org.apache.maven.lifecycle.DefaultLifecycleExecutor.executeGoals(DefaultLifecycleExecutor.java:719)
>         at
> org.apache.maven.lifecycle.DefaultLifecycleExecutor.executeStandaloneGoal(DefaultLifecycleExecutor.java:569)
>         at
> org.apache.maven.lifecycle.DefaultLifecycleExecutor.executeGoal(DefaultLifecycleExecutor.java:539)
>         at
> org.apache.maven.lifecycle.DefaultLifecycleExecutor.executeGoalAndHandleFailures(DefaultLifecycleExecutor.java:387)
>         at
> org.apache.maven.lifecycle.DefaultLifecycleExecutor.executeTaskSegments(DefaultLifecycleExecutor.java:348)
>         at
> org.apache.maven.lifecycle.DefaultLifecycleExecutor.execute(DefaultLifecycleExecutor.java:180)
>         at org.apache.maven.DefaultMaven.doExecute(DefaultMaven.java:328)
>         at org.apache.maven.DefaultMaven.execute(DefaultMaven.java:138)
>         at org.apache.maven.cli.MavenCli.main(MavenCli.java:362)
>         at
> org.apache.maven.cli.compat.CompatibleMain.main(CompatibleMain.java:60)
>         at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>         at
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
>         at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>         at java.lang.reflect.Method.invoke(Method.java:601)
>         at
> org.codehaus.classworlds.Launcher.launchEnhanced(Launcher.java:315)
>         at org.codehaus.classworlds.Launcher.launch(Launcher.java:255)
>         at
> org.codehaus.classworlds.Launcher.mainWithExitCode(Launcher.java:430)
>         at org.codehaus.classworlds.Launcher.main(Launcher.java:375)
> Caused by: org.apache.maven.plugin.MojoExecutionException: wrap:
> org.apache.commons.exec.ExecuteException: Process exited with an error:
> 240(Exit value: 240)
>         at
> org_scala_tools_maven.ScalaMojoSupport.execute(ScalaMojoSupport.java:350)
>         at
> org.apache.maven.plugin.DefaultPluginManager.executeMojo(DefaultPluginManager.java:490)
>         at
> org.apache.maven.lifecycle.DefaultLifecycleExecutor.executeGoals(DefaultLifecycleExecutor.java:694)
>         ... 17 more
> Caused by: org.apache.commons.exec.ExecuteException: Process exited with
> an error: 240(Exit value: 240)
>         at
> org.apache.commons.exec.DefaultExecutor.executeInternal(DefaultExecutor.java:346)
>         at
> org.apache.commons.exec.DefaultExecutor.execute(DefaultExecutor.java:149)
>         at
> org.apache.commons.exec.DefaultExecutor.execute(DefaultExecutor.java:136)
>         at
> org_scala_tools_maven_executions.JavaMainCallerByFork.run(JavaMainCallerByFork.java:80)
>         at
> org_scala_tools_maven_executions.JavaMainCallerSupport.run(JavaMainCallerSupport.java:96)
>         at
> org_scala_tools_maven.ScalaRunMojo.doExecute(ScalaRunMojo.java:105)
>         at
> org_scala_tools_maven.ScalaMojoSupport.execute(ScalaMojoSupport.java:342)
>         ... 19 more
> [INFO]
> ------------------------------------------------------------------------
> [INFO] Total time: 37 minutes 58 seconds
> [INFO] Finished at: Mon Nov 19 14:14:46 UTC 2012
> [INFO] Final Memory: 19M/46M
> [INFO]
> ------------------------------------------------------------------------
>
>
>
>
> Has anyone encountered this problem or know how to skip this?
>
>
> ------------------------------------------------------------------------------
> Monitor your physical, virtual and cloud infrastructure from a single
> web console. Get in-depth insight into apps, servers, databases, vmware,
> SAP, cloud infrastructure, etc. Download 30-day Free Trial.
> Pricing starts from $795 for 25 servers or applications!
> http://p.sf.net/sfu/zoho_dev2dev_nov
> _______________________________________________
> Dbpedia-discussion mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
>

------------------------------------------------------------------------------
Monitor your physical, virtual and cloud infrastructure from a single
web console. Get in-depth insight into apps, servers, databases, vmware,
SAP, cloud infrastructure, etc. Download 30-day Free Trial.
Pricing starts from $795 for 25 servers or applications!
http://p.sf.net/sfu/zoho_dev2dev_nov
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to