[
https://issues.apache.org/jira/browse/SOLR-3107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13202613#comment-13202613
]
Christian Moen commented on SOLR-3107:
--------------------------------------
Attached a trivial patch tested on {{trunk}}.
> Disable random sampling in LangDetectLanguageIdentifierUpdateProcessor
> ----------------------------------------------------------------------
>
> Key: SOLR-3107
> URL: https://issues.apache.org/jira/browse/SOLR-3107
> Project: Solr
> Issue Type: Improvement
> Components: contrib - LangId
> Affects Versions: 3.6, 4.0
> Reporter: Christian Moen
> Priority: Minor
> Attachments: SOLR-3107.patch
>
>
> The {{language-detection}} library used by
> {{LangDetectLanguageIdentifierUpdateProcessor}} uses a random sampling
> feature enabled by default as a means of avoiding local noise in input. The
> feature has its merits, but it can also be confusing to users who aren't
> aware of it since it may give different on the same input. I recommend
> turning it off to prevent confusion.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]