Hi dbpedia-community!

I'm experiencing heavy problems trying to get the extraction framework 
to run. The step I'm stuck at is downloading the dumps. My config-file 
seems to be correct as the download is started by the framework when 
running "mvn scala:run". Nevertheless the download times-out at a random 
state of data downloaded.

Downloading this file 
http://dumps.wikimedia.org/enwiki/20120307/enwiki-20120307-pages-articles.xml.bz2
 
with my browser is 10x slower than by downloading it with the framework. 
Downloading it with the browser results in the supposedly completely 
downloaded archive which is corrupted everytime since the download times 
out or else (The browser shows the download as completed though).

At the moment it's impossible for me to get the dumps. I hope someone 
can please help me out since I need the most recent data at hand!

Regards,
David

My config-file:

dumpDir=K:/Work/Eclipse Workspace/DBpedia_Dumps/to_update
outputDir=K:/Work/Eclipse Workspace/DBpedia_Dumps/updated
updateDumps=true

extractors=org.dbpedia.extraction.mappings.LabelExtractor \
            org.dbpedia.extraction.mappings.WikiPageExtractor \
            org.dbpedia.extraction.mappings.InfoboxExtractor \
            org.dbpedia.extraction.mappings.PageLinksExtractor \
            org.dbpedia.extraction.mappings.GeoExtractor

extractors.en=org.dbpedia.extraction.mappings.CategoryLabelExtractor \
               org.dbpedia.extraction.mappings.ArticleCategoriesExtractor \
               org.dbpedia.extraction.mappings.ExternalLinksExtractor \
               org.dbpedia.extraction.mappings.HomepageExtractor \
               org.dbpedia.extraction.mappings.DisambiguationExtractor \
               org.dbpedia.extraction.mappings.PersondataExtractor \
               org.dbpedia.extraction.mappings.PndExtractor \
               org.dbpedia.extraction.mappings.SkosCategoriesExtractor \
               org.dbpedia.extraction.mappings.RedirectExtractor \
               org.dbpedia.extraction.mappings.MappingExtractor \
               org.dbpedia.extraction.mappings.PageIdExtractor \
               org.dbpedia.extraction.mappings.AbstractExtractor \
               org.dbpedia.extraction.mappings.RevisionIdExtractor

languages=en



------------------------------------------------------------------------------
This SF email is sponsosred by:
Try Windows Azure free for 90 days Click Here 
http://p.sf.net/sfu/sfd2d-msazure
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to