Hello all,
Sorry for re-posting this. First time I got an error message because I was
not subscribed to this list.
I tried running the DBPedia Extraction Framework on Windows.
I used these settings in the pom.xml:
<launcher>
<id>import</id>
<mainClass>org.dbpedia.extraction.dump.sql.Import</mainClass>
<jvmArgs>
<jvmArg>-server</jvmArg>
</jvmArgs>
<args>
<!-- base folder of downloaded dumps -->
<arg>/home/release/wikipedia</arg>
<!-- location of SQL file containing
MediaWiki table definitions -->
<arg>/home/release/data/projects/mediawiki/core/maintenance/tables.sql</arg>
<!-- JDBC URL of MySQL server. Import
creates a new database for each wiki. -->
<arg>jdbc:mysql://localhost/?characterEncoding=UTF-8</arg>
<!-- require-download-complete -->
<arg>true</arg>
<!-- file name:
pages-articles.xml{,.bz2,.gz} -->
<arg>pages-articles.xml.bz2</arg>
<!-- languages and article count ranges,
comma-separated, e.g. "de,en" or "@mappings" etc. -->
<arg>en</arg>
</args>
</launcher>
The error I got was this:
The error that I get is this:
[INFO] launcher 'import' selected => org.dbpedia.extraction.dump.sql.Import
java.lang.reflect.InvocationTargetException
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:606)
at org_scala_tools_maven_executions.MainHelper.runMain(MainHelper.java:161)
at
org_scala_tools_maven_executions.MainWithArgsInFile.main(MainWithArgsInFile.java:26)
Caused by: java.io.FileNotFoundException:
\home\release\data\projects\mediawiki\core\maintenance\tables.sql (The
system cannot find the path specified)
at java.io.FileInputStream.open(Native Method)
at java.io.FileInputStream.<init>(FileInputStream.java:138)
at scala.io.Source$.fromFile(Source.scala:91)
at scala.io.Source$.fromFile(Source.scala:76)
at org.dbpedia.extraction.dump.sql.Import$.main(Import.scala:32)
at org.dbpedia.extraction.dump.sql.Import.main(Import.scala)
... 6 more
So it appears that I need to have mediawiki even though I don't want to
extract the abstracts...
My questions are this:
1) assuming that I do not want to generate the abstracts yet (I just want
to see how it works and how to create custom dumps),
do I still need a copy of the next things:
local MediaWiki and Wikipedia (http://wiki.dbpedia.org/Documentation;
https://github.com/dbpedia/extraction-framework/wiki/Extraction-Instructions
and
http://wiki.dbpedia.org/Documentation/ExtractionConfiguration?v=17gm - do
not mention that I need a MediaWiki - and Wikipedia mirror except if I want
to extract abstracts);
2) Does this process works on Windows? Do I still need to provide old dumps
in order to run this framework?
3) Where can I setup the default configuration file that I will use? There
is no default configuration specified in the pom file... so that when I
run the scala plugin it will automatically use that config file...
Best regards,
Adrian
------------------------------------------------------------------------------
This SF.net email is sponsored by Windows:
Build for Windows Store.
http://p.sf.net/sfu/windows-dev2dev
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion