Hi to everybody,
I am a new spotlight user and I would like to ask help on creation model's
process with own data. I have followed this thread
https://github.com/dbpedia-spotlight/dbpedia-spotlight/wiki/Data-generation-manual
and I have succesfully reached the dbpedia dataset with the extraction
framework from a wiki dump. When I try to launch the first command of index.sh
to extract URIs and surfaceform I reiceveid in output only some warnings,no
errors, but conceptURI.list, redirects_tc.tsv and surfaceForms-fromOccs.tsv are
empty. following the output of my command. Hope that someone could help me
michele@michele:~/dbpedia_spotlight/dbpedia-spotlight/index$ mvn -e
-Dmaven.wagon.http.ssl.insecure=true -Dmaven.wagon.http.ssl.allowall=true
-Dmaven.wagon.http.ssl.ignore.validity.dates=true scala:run
-Dlauncher=ExtractCandidateMap "-DjavaOpts.Xmx=$JAVA_XMX"
"-DaddArgs=$INDEX_CONFIG_FILE"
[INFO] Error stacktraces are turned on.
[INFO] Scanning for projects...
[INFO]
[INFO] ------------------------------------------------------------------------
[INFO] Building DBpedia Spotlight Indexing 0.7
[INFO] ------------------------------------------------------------------------
[INFO]
[INFO] >>> scala-maven-plugin:3.1.5:run (default-cli) @ index >>>
Downloading:
http://scala-tools.org/repo-releases/edu/umd/cloud9/SNAPSHOT/maven-metadata.xml
[WARNING] Could not transfer metadata
edu.umd:cloud9:SNAPSHOT/maven-metadata.xml from/to scala-tools.org
(http://scala-tools.org/repo-releases/): Received fatal alert: protocol_version
[WARNING] Failure to transfer edu.umd:cloud9:SNAPSHOT/maven-metadata.xml from
http://scala-tools.org/repo-releases/ was cached in the local repository,
resolution will not be reattempted until the update interval of scala-tools.org
has elapsed or updates are forced. Original error: Could not transfer metadata
edu.umd:cloud9:SNAPSHOT/maven-metadata.xml from/to scala-tools.org
(http://scala-tools.org/repo-releases/): Received fatal alert: protocol_version
Downloading:
http://scala-tools.org/repo-releases/weka/kea-goss-weka/5.0-SNAPSHOT/maven-metadata.xml
[WARNING] Could not transfer metadata
weka:kea-goss-weka:5.0-SNAPSHOT/maven-metadata.xml from/to scala-tools.org
(http://scala-tools.org/repo-releases/): Received fatal alert: protocol_version
[WARNING] Failure to transfer
weka:kea-goss-weka:5.0-SNAPSHOT/maven-metadata.xml from
http://scala-tools.org/repo-releases/ was cached in the local repository,
resolution will not be reattempted until the update interval of scala-tools.org
has elapsed or updates are forced. Original error: Could not transfer metadata
weka:kea-goss-weka:5.0-SNAPSHOT/maven-metadata.xml from/to scala-tools.org
(http://scala-tools.org/repo-releases/): Received fatal alert: protocol_version
Downloading:
http://scala-tools.org/repo-releases/jdbm/jdbm/3.0-SNAPSHOT/maven-metadata.xml
[WARNING] Could not transfer metadata jdbm:jdbm:3.0-SNAPSHOT/maven-metadata.xml
from/to scala-tools.org (http://scala-tools.org/repo-releases/): Received fatal
alert: protocol_version
[WARNING] Failure to transfer jdbm:jdbm:3.0-SNAPSHOT/maven-metadata.xml from
http://scala-tools.org/repo-releases/ was cached in the local repository,
resolution will not be reattempted until the update interval of scala-tools.org
has elapsed or updates are forced. Original error: Could not transfer metadata
jdbm:jdbm:3.0-SNAPSHOT/maven-metadata.xml from/to scala-tools.org
(http://scala-tools.org/repo-releases/): Received fatal alert: protocol_version
[INFO]
[INFO] --- maven-enforcer-plugin:1.2:enforce (enforce-maven-3) @ index ---
[INFO]
[INFO] --- maven-enforcer-plugin:1.2:enforce (enforce-java-newer-6) @ index ---
[INFO]
[INFO] --- maven-resources-plugin:2.3:resources (default-resources) @ index ---
[INFO] Using 'UTF-8' encoding to copy filtered resources.
[INFO] skip non existing resourceDirectory
/home/michele/dbpedia_spotlight/dbpedia-spotlight/index/src/main/resources
[INFO]
[INFO] --- scala-maven-plugin:3.1.5:add-source (scala-compile-first) @ index ---
[INFO] Add Source directory:
/home/michele/dbpedia_spotlight/dbpedia-spotlight/index/src/main/scala
[INFO] Add Test Source directory:
/home/michele/dbpedia_spotlight/dbpedia-spotlight/index/src/test/scala
[INFO]
[INFO] --- scala-maven-plugin:3.1.5:compile (scala-compile-first) @ index ---
[WARNING] Expected all dependencies to require Scala version: 2.9.2
[WARNING] net.liftweb:lift-json_2.9.2:2.5-M1 requires scala version: 2.9.2
[WARNING] net.liftweb:lift-json_2.9.2:2.5-M1 requires scala version: 2.9.2
[WARNING] org.scala-lang:scalap:2.9.2 requires scala version: 2.9.2
[WARNING] org.scalatest:scalatest_2.9.2:2.0.M4 requires scala version: 2.9.2
[WARNING] org.scalaz:scalaz-core_2.9.2:6.0.4 requires scala version: 2.9.2
[WARNING] org.scalanlp:breeze-math_2.9.2:0.1 requires scala version: 2.9.2
[WARNING] org.scalanlp:breeze-process_2.9.2:0.1 requires scala version: 2.9.2
[WARNING] org.scalanlp:breeze-learn_2.9.2:0.1 requires scala version: 2.9.2
[WARNING] com.codecommit:anti-xml_2.9.1:0.3 requires scala version: 2.9.1
[WARNING] Multiple versions of scala libraries detected!
[INFO] Nothing to compile - all classes are up to date
[INFO]
[INFO] --- maven-compiler-plugin:3.1:compile (default-compile) @ index ---
[INFO] Nothing to compile - all classes are up to date
[INFO]
[INFO] --- maven-resources-plugin:2.3:testResources (default-testResources) @
index ---
[INFO] Using 'UTF-8' encoding to copy filtered resources.
[INFO] skip non existing resourceDirectory
/home/michele/dbpedia_spotlight/dbpedia-spotlight/index/src/test/resources
[INFO]
[INFO] --- maven-compiler-plugin:3.1:testCompile (default-testCompile) @ index
---
[INFO] Nothing to compile - all classes are up to date
[INFO]
[INFO] <<< scala-maven-plugin:3.1.5:run (default-cli) @ index <<<
[INFO]
[INFO] --- scala-maven-plugin:3.1.5:run (default-cli) @ index ---
[WARNING] Expected all dependencies to require Scala version: 2.9.2
[WARNING] net.liftweb:lift-json_2.9.2:2.5-M1 requires scala version: 2.9.2
[WARNING] net.liftweb:lift-json_2.9.2:2.5-M1 requires scala version: 2.9.2
[WARNING] org.scala-lang:scalap:2.9.2 requires scala version: 2.9.2
[WARNING] org.scalatest:scalatest_2.9.2:2.0.M4 requires scala version: 2.9.2
[WARNING] org.scalaz:scalaz-core_2.9.2:6.0.4 requires scala version: 2.9.2
[WARNING] org.scalanlp:breeze-math_2.9.2:0.1 requires scala version: 2.9.2
[WARNING] org.scalanlp:breeze-process_2.9.2:0.1 requires scala version: 2.9.2
[WARNING] org.scalanlp:breeze-learn_2.9.2:0.1 requires scala version: 2.9.2
[WARNING] com.codecommit:anti-xml_2.9.1:0.3 requires scala version: 2.9.1
[WARNING] Multiple versions of scala libraries detected!
[INFO] launcher 'ExtractCandidateMap' selected =>
org.dbpedia.spotlight.util.ExtractCandidateMap
INFO 2016-12-14 12:11:41,342 main [IndexingConfiguration] - Loading
configuration file
/home/michele/dbpedia_spotlight/dbpedia-spotlight/conf/indexing.properties
INFO 2016-12-14 12:11:41,383 main [ExtractCandidateMap$] - Creating concept
URIs file
/home/michele/dbpedia_spotlight/extraction-framework/dump/itwiki/data/conceptURIs.list
...
INFO 2016-12-14 12:11:41,384 main [ExtractCandidateMap$] - collecting bad
URIs from redirects in
/home/michele/dbpedia_spotlight/extraction-framework/dump/itwiki/20161201/itwiki-20161201-redirects.nt.bz2
and disambiguations in
/home/michele/dbpedia_spotlight/extraction-framework/dump/itwiki/20161201/itwiki-20161201-disambiguations-unredirected.nq.bz2
...
warning on line 1 # started 2016-12-12T15:37:32Z : cannot parse 0th element: #
started 2016-12-12T15:37:32Z
warning in NqParser.next on line 649341 # completed 2016-12-12T19:03:28Z :
cannot parse 0th element: # completed 2016-12-12T19:03:28Z
warning on line 1 # started 2016-12-12T15:37:33Z : cannot parse 0th element: #
started 2016-12-12T15:37:33Z
warning in NqParser.next on line 362519 # completed 2016-12-12T19:03:30Z :
cannot parse 0th element: # completed 2016-12-12T19:03:30Z
INFO 2016-12-14 12:11:55,158 main [ExtractCandidateMap$] - collecting
concept URIs from titles in
/home/michele/dbpedia_spotlight/extraction-framework/dump/itwiki/20161201/itwiki-20161201-labels.nt.bz2,
without redirects and disambiguations...
warning on line 1 # started 2016-12-12T15:37:32Z : cannot parse 0th element: #
started 2016-12-12T15:37:32Z
warning in NqParser.next on line 1959649 # completed 2016-12-12T19:03:27Z :
cannot parse 0th element: # completed 2016-12-12T19:03:27Z
INFO 2016-12-14 12:12:11,445 main [ExtractCandidateMap$] - Done.
INFO 2016-12-14 12:12:11,446 main [ExtractCandidateMap$] - Creating redirects
transitive closure file
/home/michele/dbpedia_spotlight/extraction-framework/dump/itwiki/20161201/itwiki-20161201-redirects.nt.bz2
...
INFO 2016-12-14 12:12:11,446 main [ExtractCandidateMap$] - loading concept
URIs from
/home/michele/dbpedia_spotlight/extraction-framework/dump/itwiki/data/conceptURIs.list...
INFO 2016-12-14 12:12:11,446 main [ExtractCandidateMap$] - loading redirects
from
/home/michele/dbpedia_spotlight/extraction-framework/dump/itwiki/20161201/itwiki-20161201-redirects.nt.bz2...
warning on line 1 # started 2016-12-12T15:37:32Z : cannot parse 0th element: #
started 2016-12-12T15:37:32Z
warning in NqParser.next on line 649341 # completed 2016-12-12T19:03:28Z :
cannot parse 0th element: # completed 2016-12-12T19:03:28Z
INFO 2016-12-14 12:12:19,058 main [ExtractCandidateMap$] - collecting
redirects transitive closure...
INFO 2016-12-14 12:12:19,534 main [ExtractCandidateMap$] - Done.
INFO 2016-12-14 12:12:19,535 main [ExtractCandidateMap$] - Creating surface
forms file
/home/michele/dbpedia_spotlight/extraction-framework/dump/itwiki/data/surfaceForms-fromOccs.tsv
...
INFO 2016-12-14 12:12:19,535 main [ExtractCandidateMap$] - storing titles of
concept URIs...
INFO 2016-12-14 12:12:19,549 main [ExtractCandidateMap$] - storing titles of
redirect and disambiguation URIs...
warning on line 1 # started 2016-12-12T15:37:32Z : cannot parse 0th element: #
started 2016-12-12T15:37:32Z
warning in NqParser.next on line 649341 # completed 2016-12-12T19:03:28Z :
cannot parse 0th element: # completed 2016-12-12T19:03:28Z
warning on line 1 # started 2016-12-12T15:37:33Z : cannot parse 0th element: #
started 2016-12-12T15:37:33Z
warning in NqParser.next on line 362519 # completed 2016-12-12T19:03:30Z :
cannot parse 0th element: # completed 2016-12-12T19:03:30Z
INFO 2016-12-14 12:12:30,936 main [ExtractCandidateMap$] - Done.
[INFO] ------------------------------------------------------------------------
[INFO] BUILD SUCCESS
[INFO] ------------------------------------------------------------------------
[INFO] Total time: 58.923s
[INFO] Finished at: Wed Dec 14 12:12:31 CET 2016
[INFO] Final Memory: 17M/219M
[INFO] ------------------------------------------------------------------------
Thanks in advice,
Have a good day
Michele
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, SlashDot.org! http://sdm.link/slashdot
_______________________________________________
Dbp-spotlight-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbp-spotlight-users