Hi dbpedia-community! I'm experiencing heavy problems trying to get the extraction framework to run. The step I'm stuck at is downloading the dumps. My config-file seems to be correct as the download is started by the framework when running "mvn scala:run". Nevertheless the download times-out at a random state of data downloaded.
Downloading this file http://dumps.wikimedia.org/enwiki/20120307/enwiki-20120307-pages-articles.xml.bz2 with my browser is 10x slower than by downloading it with the framework. Downloading it with the browser results in the supposedly completely downloaded archive which is corrupted everytime since the download times out or else (The browser shows the download as completed though). At the moment it's impossible for me to get the dumps. I hope someone can please help me out since I need the most recent data at hand! Regards, David My config-file: dumpDir=K:/Work/Eclipse Workspace/DBpedia_Dumps/to_update outputDir=K:/Work/Eclipse Workspace/DBpedia_Dumps/updated updateDumps=true extractors=org.dbpedia.extraction.mappings.LabelExtractor \ org.dbpedia.extraction.mappings.WikiPageExtractor \ org.dbpedia.extraction.mappings.InfoboxExtractor \ org.dbpedia.extraction.mappings.PageLinksExtractor \ org.dbpedia.extraction.mappings.GeoExtractor extractors.en=org.dbpedia.extraction.mappings.CategoryLabelExtractor \ org.dbpedia.extraction.mappings.ArticleCategoriesExtractor \ org.dbpedia.extraction.mappings.ExternalLinksExtractor \ org.dbpedia.extraction.mappings.HomepageExtractor \ org.dbpedia.extraction.mappings.DisambiguationExtractor \ org.dbpedia.extraction.mappings.PersondataExtractor \ org.dbpedia.extraction.mappings.PndExtractor \ org.dbpedia.extraction.mappings.SkosCategoriesExtractor \ org.dbpedia.extraction.mappings.RedirectExtractor \ org.dbpedia.extraction.mappings.MappingExtractor \ org.dbpedia.extraction.mappings.PageIdExtractor \ org.dbpedia.extraction.mappings.AbstractExtractor \ org.dbpedia.extraction.mappings.RevisionIdExtractor languages=en ------------------------------------------------------------------------------ This SF email is sponsosred by: Try Windows Azure free for 90 days Click Here http://p.sf.net/sfu/sfd2d-msazure _______________________________________________ Dbpedia-discussion mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
