Hi Andrea, Thank you for your reply.
So, I should leave out the 'srwiki-20131009-' prefix, even though it's part of the actual filename (I already tried that, but perhaps the problem lies elsewhere)? I run the extractor from the framework dump dir (where the configuration files are): /home/uros/dbpedia/extraction-framework/dump/ by invoking ../run extract config=sr.test.properties Best, Uros > Hi Uros, > > as far as I know the source parameter should be > pages-articles.xml.bz2 > > Could you please tell us what command you are launching and from which > directory? > This > > [WARNING] Not mainClass or valid launcher found/define > > suggests you are probably invoking the wrong command. > > Cheers > Andrea > > > 2013/10/21 Uros Milosevic <[email protected]> > >> Hi Dimitris, hi all, >> >> Iâve been quite busy the last few months, so now that Iâve finally >> found >> some spare time, Iâm taking another shot at DBpedia extraction for the >> Serbian language. >> >> Iâm making some progress with the latest version of the extraction >> framework (I was pretty much stuck at the beginning with the previous >> one). Iâve successfully downloaded the latest Wikipedia dump, but >> Iâm >> having problems extracting any triples from it. Hereâs what I have in >> my >> extraction config file (comments removed for clarity): >> >> ------------------------------------------------------------- >> base-dir=/home/uros/dbpedia/dumps >> source=srwiki/20131009/srwiki-20131009-pages-articles.xml.bz2 >> languages=sr >> extractors.sr=LabelExtractor >> ontology=../ontology.xml >> mappings=../mappings >> uri-policy.uri=uri:en; generic:en; xml-safe-predicates:* >> uri-policy.iri=generic:en; xml-safe-predicates:* >> format.nt.gz=n-triples;uri-policy.uri >> format.nq.gz=n-quads;uri-policy.uri >> format.ttl.gz=turtle-triples;uri-policy.iri >> format.tql.gz=turtle-quads;uri-policy.iri >> -------------------------------------------------------------- >> >> The extraction framework is located in >> /home/uros/dbpedia/extraction-framework. >> The dump is inside /home/uros/dbpedia/dumps/srwiki/20131009. >> >> As for the 'source' parameter, Iâve tried including both the absolute >> path >> and the one relative to the base-dir (but also leaving it out >> completely), >> and Iâve tried with and without the srwiki-yyyymmdd and sr: prefix, >> but >> all to no avail. Also, Iâm trying a single extractor for starters, >> hoping >> Iâll get at least something (Iâll fine tune it later). >> >> When I run the extraction script with the above config file, I get the >> following: >> >> [INFO] Scanning for projects... >> [INFO] >> ------------------------------------------------------------------------ >> [INFO] Building DBpedia Dump Extraction >> [INFO] task-segment: [scala:run] >> [INFO] >> ------------------------------------------------------------------------ >> [INFO] Preparing scala:run >> [INFO] [resources:resources {execution: default-resources}] >> [INFO] Using 'UTF-8' encoding to copy filtered resources. >> [INFO] skip non existing resourceDirectory >> /home/uros/dbpedia/extraction-framework/dump/src/main/resources >> [INFO] [scala:compile {execution: process-resources}] >> [INFO] Checking for multiple versions of scala >> [INFO] includes = [**/*.scala,**/*.java,] >> [INFO] excludes = [] >> [INFO] Nothing to compile - all classes are up to date >> [INFO] [compiler:compile {execution: default-compile}] >> [INFO] Nothing to compile - all classes are up to date >> [INFO] [scala:compile {execution: compile}] >> [INFO] Checking for multiple versions of scala >> [INFO] includes = [**/*.scala,**/*.java,] >> [INFO] excludes = [] >> [INFO] Nothing to compile - all classes are up to date >> [INFO] [resources:testResources {execution: default-testResources}] >> [INFO] Using 'UTF-8' encoding to copy filtered resources. >> [INFO] skip non existing resourceDirectory >> /home/uros/dbpedia/extraction-framework/dump/src/test/resources >> [INFO] [compiler:testCompile {execution: default-testCompile}] >> [INFO] No sources to compile >> [INFO] [scala:testCompile {execution: test-compile}] >> [INFO] Checking for multiple versions of scala >> [INFO] includes = [**/*.scala,**/*.java,] >> [INFO] excludes = [] >> [WARNING] No source files found. >> [INFO] [scala:run {execution: default-cli}] >> [INFO] Checking for multiple versions of scala >> [WARNING] Not mainClass or valid launcher found/define >> [INFO] >> ------------------------------------------------------------------------ >> [INFO] BUILD SUCCESSFUL >> [INFO] >> ------------------------------------------------------------------------ >> [INFO] Total time: 2 seconds >> [INFO] Finished at: Fri Oct 18 14:12:49 CEST 2013 >> [INFO] Final Memory: 25M/455M >> [INFO] >> ------------------------------------------------------------------------ >> >> >> No changes in the target extraction dir, though. Any help would be much >> appreciated! >> >> Best regards, >> UroÅ¡ >> >> >> >> >> >> ------------------------------------------------------------------------------ >> October Webinars: Code for Performance >> Free Intel webinars can help you accelerate application performance. >> Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most >> from >> the latest Intel processors and coprocessors. See abstracts and register >> > >> http://pubads.g.doubleclick.net/gampad/clk?id=60135031&iu=/4140/ostg.clktrk >> _______________________________________________ >> Dbpedia-developers mailing list >> [email protected] >> https://lists.sourceforge.net/lists/listinfo/dbpedia-developers >> > ------------------------------------------------------------------------------ October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register > http://pubads.g.doubleclick.net/gampad/clk?id=60135031&iu=/4140/ostg.clktrk _______________________________________________ Dbpedia-developers mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/dbpedia-developers
