[Dbpedia-discussion] dbpedia extraction framework on windows problems

Adrian Brasoveanu Thu, 04 Jul 2013 01:58:01 -0700

Hello all,

Sorry for re-posting this. First time I got an error message because I was
not subscribed to this list.


I tried running the DBPedia Extraction Framework on Windows.
I used these settings in the pom.xml:

                         <launcher>
                            <id>import</id>

<mainClass>org.dbpedia.extraction.dump.sql.Import</mainClass>
                            <jvmArgs>
                                <jvmArg>-server</jvmArg>
                            </jvmArgs>
                            <args>
                                <!-- base folder of downloaded dumps -->
                                <arg>/home/release/wikipedia</arg>
                                <!-- location of SQL file containing
MediaWiki table definitions  -->

<arg>/home/release/data/projects/mediawiki/core/maintenance/tables.sql</arg>
                                <!-- JDBC URL of MySQL server. Import
creates a new database for each wiki. -->

<arg>jdbc:mysql://localhost/?characterEncoding=UTF-8</arg>
                                <!-- require-download-complete -->
                                <arg>true</arg>
                                <!-- file name:
pages-articles.xml{,.bz2,.gz} -->
                                <arg>pages-articles.xml.bz2</arg>
                                <!-- languages and article count ranges,
comma-separated, e.g. "de,en" or "@mappings" etc. -->
                                <arg>en</arg>
                            </args>
                        </launcher>

The error I got was this:
The error that I get is this:

[INFO] launcher 'import' selected => org.dbpedia.extraction.dump.sql.Import
java.lang.reflect.InvocationTargetException
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:606)
at org_scala_tools_maven_executions.MainHelper.runMain(MainHelper.java:161)
at
org_scala_tools_maven_executions.MainWithArgsInFile.main(MainWithArgsInFile.java:26)
Caused by: java.io.FileNotFoundException:
\home\release\data\projects\mediawiki\core\maintenance\tables.sql (The
system cannot find the path specified)
at java.io.FileInputStream.open(Native Method)
at java.io.FileInputStream.<init>(FileInputStream.java:138)
at scala.io.Source$.fromFile(Source.scala:91)
at scala.io.Source$.fromFile(Source.scala:76)
at org.dbpedia.extraction.dump.sql.Import$.main(Import.scala:32)
at org.dbpedia.extraction.dump.sql.Import.main(Import.scala)
... 6 more

So it appears that I need to have mediawiki even though I don't want to
extract the abstracts...

My questions are this:
1) assuming that I do not want to generate the abstracts yet (I just want
to see how it works and how to create custom dumps),
do I still need a copy of the next things:
local MediaWiki and Wikipedia (http://wiki.dbpedia.org/Documentation;
https://github.com/dbpedia/extraction-framework/wiki/Extraction-Instructions
 and
http://wiki.dbpedia.org/Documentation/ExtractionConfiguration?v=17gm - do
not mention that I need a MediaWiki - and Wikipedia mirror except if I want
to extract abstracts);

2) Does this process works on Windows? Do I still need to provide old dumps
in order to run this framework?

3) Where can I setup the default configuration file that I will use? There
is no default configuration specified in the pom file...  so that when I
run the scala plugin it will automatically use that config file...


Best regards,
Adrian

------------------------------------------------------------------------------
This SF.net email is sponsored by Windows:

Build for Windows Store.

http://p.sf.net/sfu/windows-dev2dev

_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

[Dbpedia-discussion] dbpedia extraction framework on windows problems

Reply via email to