Hi Jona, Batica,
I ran into the same "*java.util.NoSuchElementException: key not found: 710*"
problem.
As Jona suggested, I tried running "../clean-install-run generate-settings"
from ".../core" but the 710 still raises an exception when running "../run
extraction"
(I'm running on the English wikipedia dump, with
"extractors=RedirectExtractor,InterLanguageLinksExtractor,MappingExtractor")
BTW, the 710 exception raises after 70 minutes of processing! (i.e. it's
not something that happens in the first stages of extraction)
Have any of you managed to solve it yet?
Any idea what to try next?
Thanks,
Omri
p.s. this is the output:
[...]
Nov 22, 2012 7:24:50 PM
org.dbpedia.extraction.mappings.Redirects$RedirectFinder apply
WARNING: wrong redirect. page: [title=John Courthope
(disambiguation);ns=0/Main/;language:wiki=en,locale=en].
found by dbpedia: [title=John
Courthope;ns=0/Main/;language:wiki=en,locale=en].
found by wikipedia: [null]
java.lang.reflect.InvocationTargetException
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:601)
at
org_scala_tools_maven_executions.MainHelper.runMain(MainHelper.java:161)
at
org_scala_tools_maven_executions.MainWithArgsInFile.main(MainWithArgsInFile.java:26)
Caused by: java.util.NoSuchElementException: key not found: 710
at scala.collection.MapLike$class.default(MapLike.scala:225)
at scala.collection.immutable.HashMap.default(HashMap.scala:38)
at scala.collection.MapLike$class.apply(MapLike.scala:135)
at scala.collection.immutable.HashMap.apply(HashMap.scala:38)
at
org.dbpedia.extraction.sources.WikipediaDumpParser.readPage(WikipediaDumpParser.java:224)
at
org.dbpedia.extraction.sources.WikipediaDumpParser.readPages(WikipediaDumpParser.java:185)
at
org.dbpedia.extraction.sources.WikipediaDumpParser.readDump(WikipediaDumpParser.java:143)
at
org.dbpedia.extraction.sources.WikipediaDumpParser.run(WikipediaDumpParser.java:114)
at
org.dbpedia.extraction.sources.XMLReaderSource.foreach(XMLSource.scala:64)
at
scala.collection.TraversableLike$class.flatMap(TraversableLike.scala:239)
at
org.dbpedia.extraction.sources.XMLReaderSource.flatMap(XMLSource.scala:60)
at
org.dbpedia.extraction.mappings.Redirects$.loadFromSource(Redirects.scala:165)
at
org.dbpedia.extraction.mappings.Redirects$.load(Redirects.scala:116)
at
org.dbpedia.extraction.dump.extract.ConfigLoader$$anon$1.<init>(ConfigLoader.scala:96)
at org.dbpedia.extraction.dump.extract.ConfigLoader.org
$dbpedia$extraction$dump$extract$ConfigLoader$$createExtractionJob(ConfigLoader.scala:51)
at
org.dbpedia.extraction.dump.extract.ConfigLoader$$anonfun$getExtractionJobs$1.apply(ConfigLoader.scala:36)
at
org.dbpedia.extraction.dump.extract.ConfigLoader$$anonfun$getExtractionJobs$1.apply(ConfigLoader.scala:36)
at scala.collection.Iterator$$anon$19.next(Iterator.scala:401)
at scala.collection.Iterator$class.foreach(Iterator.scala:772)
at scala.collection.Iterator$$anon$19.foreach(Iterator.scala:399)
at
scala.collection.IterableViewLike$Transformed$class.foreach(IterableViewLike.scala:41)
at
scala.collection.IterableViewLike$$anon$3.foreach(IterableViewLike.scala:80)
at
org.dbpedia.extraction.dump.extract.Extraction$.main(Extraction.scala:29)
at
org.dbpedia.extraction.dump.extract.Extraction.main(Extraction.scala)
... 6 more
[INFO]
------------------------------------------------------------------------
[ERROR] BUILD ERROR
[INFO]
------------------------------------------------------------------------
[INFO] wrap: org.apache.commons.exec.ExecuteException: Process exited with
an error: 240(Exit value: 240)
On Thu, Nov 22, 2012 at 11:00 AM, Jona Christopher Sahnwaldt <
[email protected]> wrote:
> Hi,
>
> the problem is not the XML namespace.
>
> > Caused by: java.util.NoSuchElementException: key not found: 710
> > at scala.collection.MapLike$class.default(MapLike.scala:225)
> > at scala.collection.immutable.HashMap.default(HashMap.scala:38)
> > at scala.collection.MapLike$class.apply(MapLike.scala:135)
> > at scala.collection.immutable.HashMap.apply(HashMap.scala:38)
> > at
> org.dbpedia.extraction.sources.WikipediaDumpParser.readPage(WikipediaDumpParser.java:224)
>
> The problem is that en.wiki recently added MediaWiki namespace code
> 710 [1] that's not included in our configuration [2] yet. In line 224
> in WikipediaDumpParser.java [3] we look up the namespace number that
> we found in the Wikipedia XML dump in our configuration. That lookup
> crashes if the namespace code is unknown.
>
> To update the configuration, please cd to core/ and run
>
> ../clean-install-run generate-settings
>
> This will download the current MediaWiki namespace settings from all
> Wikipedia editions. May take a few minutes. Then run the extraction
> again.
>
> Hope this helps!
>
> I'll try to update the configuration in the Mercurial repository soon.
>
> Cheers,
> Christopher
>
> [1]
> http://en.wikipedia.org/w/api.php?action=query&meta=siteinfo&siprop=namespaces
> [2]
> http://dbpedia.hg.sourceforge.net/hgweb/dbpedia/extraction_framework/file/331f8aba5f7d/core/src/main/scala/org/dbpedia/extraction/wikiparser/impl/wikipedia/Namespaces.scala
> [3]
> http://dbpedia.hg.sourceforge.net/hgweb/dbpedia/extraction_framework/file/331f8aba5f7d/core/src/main/java/org/dbpedia/extraction/sources/WikipediaDumpParser.java#l224
>
> On Mon, Nov 19, 2012 at 3:50 PM, Batica Dzonic <[email protected]> wrote:
> >
> > Wikipedia again changed XML scheme from
> > http://www.mediawiki.org/xml/export-0.7/ to 0.8...
> > I had the following problem with the dump extraction Loading Redirect
> From
> > source (en)
> >
> > found by dbpedia: [title=Girls' Generation II ~Girls &
> > Peace~;ns=0/Main/;language:wiki=en,locale=en].
> > found by wikipedia: [title=Girls' Generation II: Girls &
> > Peace;ns=0/Main/;language:wiki=en,locale=en]
> > Nov 19, 2012 2:14:46 PM
> > org.dbpedia.extraction.mappings.Redirects$RedirectFinder apply
> > WARNING: wrong redirect. page:
> > [title=萬琲臻;ns=0/Main/;language:wiki=en,locale=en].
> > found by dbpedia: [title=Sharon
> > Wan;ns=0/Main/;language:wiki=en,locale=en].
> > found by wikipedia: [null]
> > Nov 19, 2012 2:14:46 PM
> > org.dbpedia.extraction.mappings.Redirects$RedirectFinder apply
> > WARNING: wrong redirect. page: [title=John Courthope
> > (disambiguation);ns=0/Main/;language:wiki=en,locale=en].
> > found by dbpedia: [title=John
> > Courthope;ns=0/Main/;language:wiki=en,locale=en].
> > found by wikipedia: [null]
> > java.lang.reflect.InvocationTargetException
> > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> > at
> >
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
> > at
> >
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> > at java.lang.reflect.Method.invoke(Method.java:601)
> > at
> > org_scala_tools_maven_executions.MainHelper.runMain(MainHelper.java:161)
> > at
> >
> org_scala_tools_maven_executions.MainWithArgsInFile.main(MainWithArgsInFile.java:26)
> > Caused by: java.util.NoSuchElementException: key not found: 710
> > at scala.collection.MapLike$class.default(MapLike.scala:225)
> > at scala.collection.immutable.HashMap.default(HashMap.scala:38)
> > at scala.collection.MapLike$class.apply(MapLike.scala:135)
> > at scala.collection.immutable.HashMap.apply(HashMap.scala:38)
> > at
> >
> org.dbpedia.extraction.sources.WikipediaDumpParser.readPage(WikipediaDumpParser.java:224)
> > at
> >
> org.dbpedia.extraction.sources.WikipediaDumpParser.readPages(WikipediaDumpParser.java:185)
> > at
> >
> org.dbpedia.extraction.sources.WikipediaDumpParser.readDump(WikipediaDumpParser.java:143)
> > at
> >
> org.dbpedia.extraction.sources.WikipediaDumpParser.run(WikipediaDumpParser.java:114)
> > at
> >
> org.dbpedia.extraction.sources.XMLReaderSource.foreach(XMLSource.scala:57)
> > at
> > scala.collection.TraversableLike$class.flatMap(TraversableLike.scala:239)
> > at
> >
> org.dbpedia.extraction.sources.XMLReaderSource.flatMap(XMLSource.scala:53)
> > at
> >
> org.dbpedia.extraction.mappings.Redirects$.loadFromSource(Redirects.scala:165)
> > at
> > org.dbpedia.extraction.mappings.Redirects$.load(Redirects.scala:116)
> > at
> >
> org.dbpedia.extraction.dump.extract.ConfigLoader$$anon$1.<init>(ConfigLoader.scala:100)
> > at
> > org.dbpedia.extraction.dump.extract.ConfigLoader.org
> $dbpedia$extraction$dump$extract$ConfigLoader$$createExtractionJob(ConfigLoader.scala:55)
> > at
> >
> org.dbpedia.extraction.dump.extract.ConfigLoader$$anonfun$getExtractionJobs$1.apply(ConfigLoader.scala:40)
> > at
> >
> org.dbpedia.extraction.dump.extract.ConfigLoader$$anonfun$getExtractionJobs$1.apply(ConfigLoader.scala:40)
> > at scala.collection.Iterator$$anon$19.next(Iterator.scala:401)
> > at scala.collection.Iterator$class.foreach(Iterator.scala:772)
> > at scala.collection.Iterator$$anon$19.foreach(Iterator.scala:399)
> > at
> >
> scala.collection.IterableViewLike$Transformed$class.foreach(IterableViewLike.scala:41)
> > at
> >
> scala.collection.IterableViewLike$$anon$3.foreach(IterableViewLike.scala:80)
> > at
> > org.dbpedia.extraction.dump.extract.Extraction$.main(Extraction.scala:27)
> > at
> > org.dbpedia.extraction.dump.extract.Extraction.main(Extraction.scala)
> > ... 6 more
> > [INFO]
> > ------------------------------------------------------------------------
> > [ERROR] BUILD ERROR
> > [INFO]
> > ------------------------------------------------------------------------
> > [INFO] wrap: org.apache.commons.exec.ExecuteException: Process exited
> with
> > an error: 240(Exit value: 240)
> >
> > [INFO]
> > ------------------------------------------------------------------------
> > [INFO] Trace
> > org.apache.maven.lifecycle.LifecycleExecutionException: wrap:
> > org.apache.commons.exec.ExecuteException: Process exited with an error:
> > 240(Exit value: 240)
> > at
> >
> org.apache.maven.lifecycle.DefaultLifecycleExecutor.executeGoals(DefaultLifecycleExecutor.java:719)
> > at
> >
> org.apache.maven.lifecycle.DefaultLifecycleExecutor.executeStandaloneGoal(DefaultLifecycleExecutor.java:569)
> > at
> >
> org.apache.maven.lifecycle.DefaultLifecycleExecutor.executeGoal(DefaultLifecycleExecutor.java:539)
> > at
> >
> org.apache.maven.lifecycle.DefaultLifecycleExecutor.executeGoalAndHandleFailures(DefaultLifecycleExecutor.java:387)
> > at
> >
> org.apache.maven.lifecycle.DefaultLifecycleExecutor.executeTaskSegments(DefaultLifecycleExecutor.java:348)
> > at
> >
> org.apache.maven.lifecycle.DefaultLifecycleExecutor.execute(DefaultLifecycleExecutor.java:180)
> > at org.apache.maven.DefaultMaven.doExecute(DefaultMaven.java:328)
> > at org.apache.maven.DefaultMaven.execute(DefaultMaven.java:138)
> > at org.apache.maven.cli.MavenCli.main(MavenCli.java:362)
> > at
> > org.apache.maven.cli.compat.CompatibleMain.main(CompatibleMain.java:60)
> > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> > at
> >
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
> > at
> >
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> > at java.lang.reflect.Method.invoke(Method.java:601)
> > at
> > org.codehaus.classworlds.Launcher.launchEnhanced(Launcher.java:315)
> > at org.codehaus.classworlds.Launcher.launch(Launcher.java:255)
> > at
> > org.codehaus.classworlds.Launcher.mainWithExitCode(Launcher.java:430)
> > at org.codehaus.classworlds.Launcher.main(Launcher.java:375)
> > Caused by: org.apache.maven.plugin.MojoExecutionException: wrap:
> > org.apache.commons.exec.ExecuteException: Process exited with an error:
> > 240(Exit value: 240)
> > at
> > org_scala_tools_maven.ScalaMojoSupport.execute(ScalaMojoSupport.java:350)
> > at
> >
> org.apache.maven.plugin.DefaultPluginManager.executeMojo(DefaultPluginManager.java:490)
> > at
> >
> org.apache.maven.lifecycle.DefaultLifecycleExecutor.executeGoals(DefaultLifecycleExecutor.java:694)
> > ... 17 more
> > Caused by: org.apache.commons.exec.ExecuteException: Process exited with
> > an error: 240(Exit value: 240)
> > at
> >
> org.apache.commons.exec.DefaultExecutor.executeInternal(DefaultExecutor.java:346)
> > at
> > org.apache.commons.exec.DefaultExecutor.execute(DefaultExecutor.java:149)
> > at
> > org.apache.commons.exec.DefaultExecutor.execute(DefaultExecutor.java:136)
> > at
> >
> org_scala_tools_maven_executions.JavaMainCallerByFork.run(JavaMainCallerByFork.java:80)
> > at
> >
> org_scala_tools_maven_executions.JavaMainCallerSupport.run(JavaMainCallerSupport.java:96)
> > at
> > org_scala_tools_maven.ScalaRunMojo.doExecute(ScalaRunMojo.java:105)
> > at
> > org_scala_tools_maven.ScalaMojoSupport.execute(ScalaMojoSupport.java:342)
> > ... 19 more
> > [INFO]
> > ------------------------------------------------------------------------
> > [INFO] Total time: 37 minutes 58 seconds
> > [INFO] Finished at: Mon Nov 19 14:14:46 UTC 2012
> > [INFO] Final Memory: 19M/46M
> > [INFO]
> > ------------------------------------------------------------------------
> >
> >
> >
> >
> > Has anyone encountered this problem or know how to skip this?
> >
> >
> >
> ------------------------------------------------------------------------------
> > Monitor your physical, virtual and cloud infrastructure from a single
> > web console. Get in-depth insight into apps, servers, databases, vmware,
> > SAP, cloud infrastructure, etc. Download 30-day Free Trial.
> > Pricing starts from $795 for 25 servers or applications!
> > http://p.sf.net/sfu/zoho_dev2dev_nov
> > _______________________________________________
> > Dbpedia-discussion mailing list
> > [email protected]
> > https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
> >
>
>
> ------------------------------------------------------------------------------
> Monitor your physical, virtual and cloud infrastructure from a single
> web console. Get in-depth insight into apps, servers, databases, vmware,
> SAP, cloud infrastructure, etc. Download 30-day Free Trial.
> Pricing starts from $795 for 25 servers or applications!
> http://p.sf.net/sfu/zoho_dev2dev_nov
> _______________________________________________
> Dbpedia-discussion mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
>
------------------------------------------------------------------------------
Monitor your physical, virtual and cloud infrastructure from a single
web console. Get in-depth insight into apps, servers, databases, vmware,
SAP, cloud infrastructure, etc. Download 30-day Free Trial.
Pricing starts from $795 for 25 servers or applications!
http://p.sf.net/sfu/zoho_dev2dev_nov
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion