Hi all,
I'm have been running dbpedia extraction framework on wikipedia dumps. It was
working fine but recently it has started giving error. I'm using the following
version of the code
--------------
changeset: 1610:59dda670016e
branch: wiktionary
tag: tip
parent: 1609:a71b7d4bf8d1
parent: 1607:540017622ed2
user: Jonas Brekle
<[email protected]<mailto:[email protected]>>
date: Wed Aug 29 17:04:41 2012 +0200
summary: merge default into wiktionary again
-------------------
You can find the error code below. From my understanding the problem lies in
the NameSpace file
http://dbpedia.hg.sourceforge.net/hgweb/dbpedia/extraction_framework/file/tip/core/src/main/scala/org/dbpedia/extraction/wikiparser/Namespace.scala
My guess is that wikipedia has added a new namespace recently which is still
not added to the above file and this is the reason for the exception. You can
check wikipedia at http://en.wikipedia.org/wiki/Wikipedia:Namespace
710 corresponds to
TimedText<http://en.wikipedia.org/wiki/Wikipedia:TimedText_namespace> Namespace
Please let me know if I'm correct and if anyone has seen/fixed the same issue ?
---------------------
Dec 4, 2012 10:44:53 AM
org.dbpedia.extraction.mappings.Redirects$RedirectFinder apply
WARNING: wrong redirect. page:
[title=VTV2;ns=0/Main/;language:wiki=en,locale=en].
found by dbpedia: [title=Virtual University of
Pakistan;ns=0/Main/;language:wiki=en,locale=en;fragment='Distant teaching'].
found by wikipedia: [null]
Exception in thread "main" java.util.NoSuchElementException: key not found: 710
at scala.collection.MapLike$class.default(MapLike.scala:225)
at scala.collection.immutable.HashMap.default(HashMap.scala:38)
at scala.collection.MapLike$class.apply(MapLike.scala:135)
at scala.collection.immutable.HashMap.apply(HashMap.scala:38)
at
org.dbpedia.extraction.sources.WikipediaDumpParser.readPage(WikipediaDumpParser.java:227)
at
org.dbpedia.extraction.sources.WikipediaDumpParser.readPages(WikipediaDumpParser.java:188)
at
org.dbpedia.extraction.sources.WikipediaDumpParser.readDump(WikipediaDumpParser.java:146)
at
org.dbpedia.extraction.sources.WikipediaDumpParser.run(WikipediaDumpParser.java:117)
at
org.dbpedia.extraction.sources.XMLReaderSource.foreach(XMLSource.scala:64)
at
scala.collection.TraversableLike$class.flatMap(TraversableLike.scala:239)
at
org.dbpedia.extraction.sources.XMLReaderSource.flatMap(XMLSource.scala:60)
at
org.dbpedia.extraction.mappings.Redirects$.loadFromSource(Redirects.scala:165)
at org.dbpedia.extraction.mappings.Redirects$.load(Redirects.scala:116)
at
org.dbpedia.extraction.dump.extract.ConfigLoader$$anon$1.<init>(ConfigLoader.scala:109)
at
org.dbpedia.extraction.dump.extract.ConfigLoader.org$dbpedia$extraction$dump$extract$ConfigLoader$$createExtractionJob(ConfigLoader.scala:64)
at
org.dbpedia.extraction.dump.extract.ConfigLoader$$anonfun$getExtractionJobs$1.apply(ConfigLoader.scala:48)
at
org.dbpedia.extraction.dump.extract.ConfigLoader$$anonfun$getExtractionJobs$1.apply(ConfigLoader.scala:48)
at scala.collection.Iterator$$anon$19.next(Iterator.scala:401)
at scala.collection.Iterator$class.foreach(Iterator.scala:772)
at scala.collection.Iterator$$anon$19.foreach(Iterator.scala:399)
at
scala.collection.IterableViewLike$Transformed$class.foreach(IterableViewLike.scala:41)
at
scala.collection.IterableViewLike$$anon$3.foreach(IterableViewLike.scala:80)
at
org.dbpedia.extraction.dump.extract.Extraction$.main(Extraction.scala:36)
at org.dbpedia.extraction.dump.extract.Extraction.main(Extraction.scala)
-----------------
Regards
Amit
------------------------------------------------------------------------------
LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial
Remotely access PCs and mobile devices and provide instant support
Improve your efficiency, and focus on delivering more value-add services
Discover what IT Professionals Know. Rescue delivers
http://p.sf.net/sfu/logmein_12329d2d
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion