[
https://issues.apache.org/jira/browse/SOLR-17958?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jan Høydahl resolved SOLR-17958.
--------------------------------
Fix Version/s: 9.10
Resolution: Fixed
> Deprecate TikaLanguageIdentifierUpdateProcessor in v9.10
> --------------------------------------------------------
>
> Key: SOLR-17958
> URL: https://issues.apache.org/jira/browse/SOLR-17958
> Project: Solr
> Issue Type: Improvement
> Components: contrib - LangId
> Reporter: Jan Høydahl
> Assignee: Jan Høydahl
> Priority: Major
> Labels: pull-request-available
> Fix For: 9.10
>
> Time Spent: 40m
> Remaining Estimate: 0h
>
> In the 'langid' module, we have three implementations of language detectors.
> The oldest one is TikaLanguageIdentifierUpdateProcessor, but we also have
> LangDetectLanguageIdentifierUpdateProcessor and
> OpenNLPLangDetectUpdateProcessor.
> This JIRA will deprecate TikaLanguageIdentifierUpdateProcessor.
> Reasons are:
> - The others are proably better
> - We want to remove Tika as a direct Solr dependency
> - The tika identifier is based on a Tika 1.x API that has been removed (they
> are now "Detectors" instead)
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]