[
https://issues.apache.org/jira/browse/SOLR-14801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Eric Pugh resolved SOLR-14801.
------------------------------
Resolution: Won't Fix
In Solr 10 we are leveraging either Tika Server (running in it's own seperate
server process) or maybe Tika Pipes (again, running in a seperate JVM).
Please revalidate your issue against Solr 10 with one of those options, and if
it is still present need, happy to work with you on a fix using the new
approach for Tika.
> Multiple Language Detection is not reflecting properly with apache Tika/Solr
> Jar ()
> -----------------------------------------------------------------------------------
>
> Key: SOLR-14801
> URL: https://issues.apache.org/jira/browse/SOLR-14801
> Project: Solr
> Issue Type: Bug
> Components: contrib - LangId
> Reporter: Navodit Bansod
> Priority: Major
>
> Hi Team,
> Please find the following issues occurring in case of multiple lang
> detection in apache Solr :
> # Primary and Secondary language is not getting detected using separate
> fields/attributes for each. The language is getting generalized with the
> language having major chunk of data and thus reflect as same is both fields -
> "lang and langs" (attribute primary and secondary language)
> # The Distance(or length) setting parameter in solrconfig.xml is properly
> SET in our cluster but still it seems this parameter is not showing any
> difference with change of values. (
> <str name="langid.threshold">0.2</str>)
> # Following Versions are being used in our solr cloud setup:
> # tika-core-1.24.1.jar
> # tika-parsers-1.24.1.jar
>
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]