Hi Dimitris,
Thanks for your reply. I see that the Namespace.scala is indeed fixed in the
dump branch. Actually I'm a bit confused about all the branches that are
available at the site. Is there a place which details about all the branches in
the mercurial repository of the extraction_framework. Which one should I use? I
had checked out the default branch and did a hg update/pull sometime back. I'm
used to working with SVN which has a concept of a stable trunk . This mercurial
business seems to get confusing.
Regards
Amit
From: Dimitris Kontokostas <[email protected]<mailto:[email protected]>>
Date: Wednesday, December 5, 2012 1:53 PM
To: Amit Kumar <[email protected]<mailto:[email protected]>>
Cc: "[email protected]<mailto:[email protected]>"
<[email protected]<mailto:[email protected]>>
Subject: Re: [Dbpedia-discussion] Error in running the Dbpedia Extraction
Framework on Oct dump of Enwiki
Hi Amit,
you can look at the following thread for the error details.
http://sourceforge.net/mailarchive/message.php?msg_id=30145224
If you switch to dump branch things should work fine, however, If you need the
wiktionary branch we were planning to deploy Jona's patch to all branches
anyway so we might speed up th procedure :)
Best,
Dimitris
On Wed, Dec 5, 2012 at 9:59 AM, Amit Kumar
<[email protected]<mailto:[email protected]>> wrote:
Hi all,
I'm have been running dbpedia extraction framework on wikipedia dumps. It was
working fine but recently it has started giving error. I'm using the following
version of the code
--------------
changeset: 1610:59dda670016e
branch: wiktionary
tag: tip
parent: 1609:a71b7d4bf8d1
parent: 1607:540017622ed2
user: Jonas Brekle
<[email protected]<mailto:[email protected]>>
date: Wed Aug 29 17:04:41 2012 +0200
summary: merge default into wiktionary again
-------------------
You can find the error code below. From my understanding the problem lies in
the NameSpace file
http://dbpedia.hg.sourceforge.net/hgweb/dbpedia/extraction_framework/file/tip/core/src/main/scala/org/dbpedia/extraction/wikiparser/Namespace.scala
My guess is that wikipedia has added a new namespace recently which is still
not added to the above file and this is the reason for the exception. You can
check wikipedia at http://en.wikipedia.org/wiki/Wikipedia:Namespace
710 corresponds to
TimedText<http://en.wikipedia.org/wiki/Wikipedia:TimedText_namespace> Namespace
Please let me know if I'm correct and if anyone has seen/fixed the same issue ?
---------------------
Dec 4, 2012 10:44:53 AM
org.dbpedia.extraction.mappings.Redirects$RedirectFinder apply
WARNING: wrong redirect. page:
[title=VTV2;ns=0/Main/;language:wiki=en,locale=en].
found by dbpedia: [title=Virtual University of
Pakistan;ns=0/Main/;language:wiki=en,locale=en;fragment='Distant teaching'].
found by wikipedia: [null]
Exception in thread "main" java.util.NoSuchElementException: key not found: 710
at scala.collection.MapLike$class.default(MapLike.scala:225)
at scala.collection.immutable.HashMap.default(HashMap.scala:38)
at scala.collection.MapLike$class.apply(MapLike.scala:135)
at scala.collection.immutable.HashMap.apply(HashMap.scala:38)
at
org.dbpedia.extraction.sources.WikipediaDumpParser.readPage(WikipediaDumpParser.java:227)
at
org.dbpedia.extraction.sources.WikipediaDumpParser.readPages(WikipediaDumpParser.java:188)
at
org.dbpedia.extraction.sources.WikipediaDumpParser.readDump(WikipediaDumpParser.java:146)
at
org.dbpedia.extraction.sources.WikipediaDumpParser.run(WikipediaDumpParser.java:117)
at
org.dbpedia.extraction.sources.XMLReaderSource.foreach(XMLSource.scala:64)
at
scala.collection.TraversableLike$class.flatMap(TraversableLike.scala:239)
at
org.dbpedia.extraction.sources.XMLReaderSource.flatMap(XMLSource.scala:60)
at
org.dbpedia.extraction.mappings.Redirects$.loadFromSource(Redirects.scala:165)
at org.dbpedia.extraction.mappings.Redirects$.load(Redirects.scala:116)
at
org.dbpedia.extraction.dump.extract.ConfigLoader$$anon$1.<init>(ConfigLoader.scala:109)
at
org.dbpedia.extraction.dump.extract.ConfigLoader.org<http://org.dbpedia.extraction.dump.extract.ConfigLoader.org>$dbpedia$extraction$dump$extract$ConfigLoader$$createExtractionJob(ConfigLoader.scala:64)
at
org.dbpedia.extraction.dump.extract.ConfigLoader$$anonfun$getExtractionJobs$1.apply(ConfigLoader.scala:48)
at
org.dbpedia.extraction.dump.extract.ConfigLoader$$anonfun$getExtractionJobs$1.apply(ConfigLoader.scala:48)
at scala.collection.Iterator$$anon$19.next(Iterator.scala:401)
at scala.collection.Iterator$class.foreach(Iterator.scala:772)
at scala.collection.Iterator$$anon$19.foreach(Iterator.scala:399)
at
scala.collection.IterableViewLike$Transformed$class.foreach(IterableViewLike.scala:41)
at
scala.collection.IterableViewLike$$anon$3.foreach(IterableViewLike.scala:80)
at
org.dbpedia.extraction.dump.extract.Extraction$.main(Extraction.scala:36)
at org.dbpedia.extraction.dump.extract.Extraction.main(Extraction.scala)
-----------------
Regards
Amit
------------------------------------------------------------------------------
LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial
Remotely access PCs and mobile devices and provide instant support
Improve your efficiency, and focus on delivering more value-add services
Discover what IT Professionals Know. Rescue delivers
http://p.sf.net/sfu/logmein_12329d2d
_______________________________________________
Dbpedia-discussion mailing list
[email protected]<mailto:[email protected]>
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
--
Kontokostas Dimitris
------------------------------------------------------------------------------
LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial
Remotely access PCs and mobile devices and provide instant support
Improve your efficiency, and focus on delivering more value-add services
Discover what IT Professionals Know. Rescue delivers
http://p.sf.net/sfu/logmein_12329d2d
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion