Thanks for your suggestions.

I do not know whether attachments are allowed, so I paste the
extraction.properties below. (apparantly I already tried adding the full 
path:dbpedia.extraction.mappings.DisambiguationExtractor,
but without, the same error message follows).

extractors.nl<http://extractors.nl>=MappingExtractor,DisambiguationExtractor
and
extractors.nl<http://extractors.nl>=MappingExtractor,org.dbpedia.extraction.mappings.DisambiguationExtractor,HomepageExtractor
give the same error.

If I only have

extractors.nl<http://extractors.nl>=MappingExtractor

the extraction process takes about 30 minutes and seems to end without problems
(I haven't looked at generated outputfiles yet though).

(I started without commenting out the other extractors, but then despite having
languages=nl, other languages were being textracted)

Kind regards,

Lourens

DETAILS

The error I get, again:
error
...
Caused by: java.util.NoSuchElementException: key not found: nl
..
at 
org.dbpedia.extraction.mappings.DisambiguationExtractor.<init>(DisambiguationExtractor.scala:22)
=================extraction.properties========
dir=/home/lourens/spotlight/wikipedia
source=pages-articles.xml.bz2
require-download-complete=true
languages=nl
extractors=ArticleCategoriesExtractor,CategoryLabelExtractor,ExternalLinksExtractor,\
GeoExtractor,InfoboxExtractor,LabelExtractor,PageIdExtractor,PageLinksExtractor,\
RedirectExtractor,RevisionIdExtractor,SkosCategoriesExtractor,WikiPageExtractor
extractors.nl<http://extractors.nl>=MappingExtractor,org.dbpedia.extraction.mappings.DisambiguationExtractor,HomepageExtractor,ImageExtractor,\
InterLanguageLinksExtractor
#extractors.nl=MappingExtractor
ontology=../ontology.xml
mappings=../mappings
uri-policy.uri=uri:en; generic:en; xml-safe-predicates:*
uri-policy.iri=generic:en; xml-safe-predicates:*
format.nt.gz=n-triples;uri-policy.uri
format.nq.gz=n-quads;uri-policy.uri
format.ttl.gz=turtle-triples;uri-policy.iri
format.tql.gz=turtle-quads;uri-policy.iri

On Jun 15, 2012, at 16:41 PM, Pablo Mendes wrote:

Can you show us your extraction.properties? I suspect you forgot the line below?

languages=nl

I am also not sure if you should have fully qualified 
(org.dbpedia.extraction.mappings.DisambiguationExtractor) or just the class 
name (DisambiguationExtractor).

Cheers,
Pablo

On Fri, Jun 15, 2012 at 1:43 PM, Meij, L.K. van der 
<[email protected]<mailto:[email protected]>> wrote:

I am trying to set up dbpedia-spotlight for the Dutch language. There are some 
datasets available for Dutch (nl),
but I expect to at least need the dbpedia "disambiguation" dataset, which is 
not available for download.

After setting up extraction_framework for "nl" and doing :
editing extraction.properties: reoving all extractors.** except:
extractors.nl<http://extractors.nl/>=MappingExtractor,org.dbpedia.extraction.mappings.DisambiguationExtractor,HomepageExtractor,ImageExtractor,\
InterLanguageLinksExtractor)

$ cd dump;mvn scala:run
I get an error message:
..
INFO: Mappings loaded (nl)
java.lang.reflect.InvocationTargetException
..
Caused by: java.util.NoSuchElementException: key not found: nl
        at scala.collection.MapLike$class.default(MapLike.scala:225)
        at scala.collection.immutable.HashMap.default(HashMap.scala:38)
        at scala.collection.MapLike$class.apply(MapLike.scala:135)
        at scala.collection.immutable.HashMap.apply(HashMap.scala:38)
        at 
org.dbpedia.extraction.mappings.DisambiguationExtractor.<init>(DisambiguationExtractor.scala:22)

I assume this means that some classes have not been implemented for "nl"?

If so, I would like to know if such an effort is on the way or whether it would 
be feasible for me to give it a try?
Is there some pointer/documentation on how to get started?

Thanks,

Lourens


==================
DETAILS OF WHAT I DID


I managed to install the extraction_framework. The documentation seems a bit 
out of date though so it could be I
did things wrong.
I managed to download the "nl" wikipedia input by editing dump.properties and 
doing
$ cd dump; mvn scala:run -Dlauncher=download

Extraction started when commenting out all
other "extraction.**=" entries in dump/extraction.properties leaving only

extractors.nl<http://extractors.nl/>=MappingExtractor

and running

$ mvn scala:run
The output indicates that extraction proceeds nicely.

But I expect that in the result the "disambiguation" result will be missing.

When I replace extractors.nl<http://extractors.nl/> (analoguous to other 
languages):

extractors.nl<http://extractors.nl/>=MappingExtractor,org.dbpedia.extraction.mappings.DisambiguationExtractor,HomepageExtractor,ImageExtractor,\
InterLanguageLinksExtractor

I get error messages mentioned above.


------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and
threat landscape has changed and how IT managers can respond. Discussions
will include endpoint security, mobile security and the latest in malware
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
Dbpedia-discussion mailing list
[email protected]<mailto:[email protected]>
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion



------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to