[
https://issues.apache.org/jira/browse/SOLR-16010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17492631#comment-17492631
]
Jan Høydahl commented on SOLR-16010:
------------------------------------
Currently these jars are packaged in {{{}langid/lib{}}}:
{code:java}
jsonic-1.2.7.jar
langdetect-1.1-20120112.jar
opennlp-tools-1.9.1.jar
solr-langid-10.0.0-SNAPSHOT.jar
tika-core-1.27.jar {code}
I think this is enough, as tika 1.x has not yet split language detection out
from core, like 2.x has?
> langid should include all required Tika dependencies
> ----------------------------------------------------
>
> Key: SOLR-16010
> URL: https://issues.apache.org/jira/browse/SOLR-16010
> Project: Solr
> Issue Type: Task
> Security Level: Public(Default Security Level. Issues are Public)
> Components: contrib - LangId
> Reporter: Kevin Risden
> Priority: Major
>
> Currently, the langid module requires that extraction module to be loaded for
> langid to work. It isn't clear if what is included in the extraction module
> will even meet the langid needs (ie: tika-langdetect isn't included in
> extraction module)
> {code:java}
> ➜ solr git:(SOLR-15989) find solr/packaging/build/solr-10.0.0-SNAPSHOT/
> -name '*tika*.jar'
> solr/packaging/build/solr-10.0.0-SNAPSHOT/modules/langid/lib/tika-core-1.27.jar
> solr/packaging/build/solr-10.0.0-SNAPSHOT/modules/extraction/lib/tika-parsers-1.27.jar
> solr/packaging/build/solr-10.0.0-SNAPSHOT/modules/extraction/lib/tika-java7-1.27.jar
> solr/packaging/build/solr-10.0.0-SNAPSHOT/modules/extraction/lib/tika-xmp-1.27.jar
> solr/packaging/build/solr-10.0.0-SNAPSHOT/modules/extraction/lib/vorbis-java-tika-0.8.jar
> solr/packaging/build/solr-10.0.0-SNAPSHOT/modules/extraction/lib/tika-core-1.27.jar
> {code}
> This came out of a discussion in SOLR-15989 -
> https://github.com/apache/solr/pull/621#discussion_r806083202
--
This message was sent by Atlassian Jira
(v8.20.1#820001)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]