Hi Uros,
yes you should leave the prefix out.
Your problem lies in the command you are executing:
../run *extract* config=sr.test.properties
If you check the "run" script you will see that the first parameter must be
the name of the scala launcher you want to invoke.
This is failing because there is no launcher named "extract" in the
dump/pom.xml.
The extraction laucher is called "extraction".
Can you retry launching:
../run *extraction* config=sr.test.properties
Cheers
Andrea
2013/10/21 Uros Milosevic <[email protected]>
> Hi Andrea,
>
> Thank you for your reply.
>
> So, I should leave out the 'srwiki-20131009-' prefix, even though it's
> part of the actual filename (I already tried that, but perhaps the problem
> lies elsewhere)?
>
> I run the extractor from the framework dump dir (where the configuration
> files are):
>
> /home/uros/dbpedia/extraction-framework/dump/
>
> by invoking
>
> ../run extract config=sr.test.properties
>
> Best,
> Uros
>
>
> > Hi Uros,
> >
> > as far as I know the source parameter should be
> > pages-articles.xml.bz2
> >
> > Could you please tell us what command you are launching and from which
> > directory?
> > This
> >
> > [WARNING] Not mainClass or valid launcher found/define
> >
> > suggests you are probably invoking the wrong command.
> >
> > Cheers
> > Andrea
> >
> >
> > 2013/10/21 Uros Milosevic <[email protected]>
> >
> >> Hi Dimitris, hi all,
> >>
> >> I’ve been quite busy the last few months, so now that I’ve finally
> >> found
> >> some spare time, I’m taking another shot at DBpedia extraction for the
> >> Serbian language.
> >>
> >> I’m making some progress with the latest version of the extraction
> >> framework (I was pretty much stuck at the beginning with the previous
> >> one). I’ve successfully downloaded the latest Wikipedia dump, but
> >> I’m
> >> having problems extracting any triples from it. Here’s what I have in
> >> my
> >> extraction config file (comments removed for clarity):
> >>
> >> -------------------------------------------------------------
> >> base-dir=/home/uros/dbpedia/dumps
> >> source=srwiki/20131009/srwiki-20131009-pages-articles.xml.bz2
> >> languages=sr
> >> extractors.sr=LabelExtractor
> >> ontology=../ontology.xml
> >> mappings=../mappings
> >> uri-policy.uri=uri:en; generic:en; xml-safe-predicates:*
> >> uri-policy.iri=generic:en; xml-safe-predicates:*
> >> format.nt.gz=n-triples;uri-policy.uri
> >> format.nq.gz=n-quads;uri-policy.uri
> >> format.ttl.gz=turtle-triples;uri-policy.iri
> >> format.tql.gz=turtle-quads;uri-policy.iri
> >> --------------------------------------------------------------
> >>
> >> The extraction framework is located in
> >> /home/uros/dbpedia/extraction-framework.
> >> The dump is inside /home/uros/dbpedia/dumps/srwiki/20131009.
> >>
> >> As for the 'source' parameter, I’ve tried including both the absolute
> >> path
> >> and the one relative to the base-dir (but also leaving it out
> >> completely),
> >> and I’ve tried with and without the srwiki-yyyymmdd and sr: prefix,
> >> but
> >> all to no avail. Also, I’m trying a single extractor for starters,
> >> hoping
> >> I’ll get at least something (I’ll fine tune it later).
> >>
> >> When I run the extraction script with the above config file, I get the
> >> following:
> >>
> >> [INFO] Scanning for projects...
> >> [INFO]
> >> ------------------------------------------------------------------------
> >> [INFO] Building DBpedia Dump Extraction
> >> [INFO] task-segment: [scala:run]
> >> [INFO]
> >> ------------------------------------------------------------------------
> >> [INFO] Preparing scala:run
> >> [INFO] [resources:resources {execution: default-resources}]
> >> [INFO] Using 'UTF-8' encoding to copy filtered resources.
> >> [INFO] skip non existing resourceDirectory
> >> /home/uros/dbpedia/extraction-framework/dump/src/main/resources
> >> [INFO] [scala:compile {execution: process-resources}]
> >> [INFO] Checking for multiple versions of scala
> >> [INFO] includes = [**/*.scala,**/*.java,]
> >> [INFO] excludes = []
> >> [INFO] Nothing to compile - all classes are up to date
> >> [INFO] [compiler:compile {execution: default-compile}]
> >> [INFO] Nothing to compile - all classes are up to date
> >> [INFO] [scala:compile {execution: compile}]
> >> [INFO] Checking for multiple versions of scala
> >> [INFO] includes = [**/*.scala,**/*.java,]
> >> [INFO] excludes = []
> >> [INFO] Nothing to compile - all classes are up to date
> >> [INFO] [resources:testResources {execution: default-testResources}]
> >> [INFO] Using 'UTF-8' encoding to copy filtered resources.
> >> [INFO] skip non existing resourceDirectory
> >> /home/uros/dbpedia/extraction-framework/dump/src/test/resources
> >> [INFO] [compiler:testCompile {execution: default-testCompile}]
> >> [INFO] No sources to compile
> >> [INFO] [scala:testCompile {execution: test-compile}]
> >> [INFO] Checking for multiple versions of scala
> >> [INFO] includes = [**/*.scala,**/*.java,]
> >> [INFO] excludes = []
> >> [WARNING] No source files found.
> >> [INFO] [scala:run {execution: default-cli}]
> >> [INFO] Checking for multiple versions of scala
> >> [WARNING] Not mainClass or valid launcher found/define
> >> [INFO]
> >> ------------------------------------------------------------------------
> >> [INFO] BUILD SUCCESSFUL
> >> [INFO]
> >> ------------------------------------------------------------------------
> >> [INFO] Total time: 2 seconds
> >> [INFO] Finished at: Fri Oct 18 14:12:49 CEST 2013
> >> [INFO] Final Memory: 25M/455M
> >> [INFO]
> >> ------------------------------------------------------------------------
> >>
> >>
> >> No changes in the target extraction dir, though. Any help would be much
> >> appreciated!
> >>
> >> Best regards,
> >> Uroš
> >>
> >>
> >>
> >>
> >>
> >>
> ------------------------------------------------------------------------------
> >> October Webinars: Code for Performance
> >> Free Intel webinars can help you accelerate application performance.
> >> Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most
> >> from
> >> the latest Intel processors and coprocessors. See abstracts and register
> >> >
> >>
> http://pubads.g.doubleclick.net/gampad/clk?id=60135031&iu=/4140/ostg.clktrk
> >> _______________________________________________
> >> Dbpedia-developers mailing list
> >> [email protected]
> >> https://lists.sourceforge.net/lists/listinfo/dbpedia-developers
> >>
> >
>
>
------------------------------------------------------------------------------
October Webinars: Code for Performance
Free Intel webinars can help you accelerate application performance.
Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from
the latest Intel processors and coprocessors. See abstracts and register >
http://pubads.g.doubleclick.net/gampad/clk?id=60135031&iu=/4140/ostg.clktrk
_______________________________________________
Dbpedia-developers mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-developers