[
https://issues.apache.org/jira/browse/SOLR-11231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16124669#comment-16124669
]
ASF GitHub Bot commented on SOLR-11231:
---------------------------------------
GitHub user cbeer opened a pull request:
https://github.com/apache/lucene-solr/pull/232
SOLR-11231 Guard against unset fields when performing language detection
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/cbeer/lucene-solr solr-11231
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/lucene-solr/pull/232.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #232
----
commit 3d944e0edf3283b55faab8201d9a636d76ff4bd4
Author: Chris Beer <[email protected]>
Date: 2017-08-12T17:32:25Z
SOLR-11231 Guard against unset fields when performing language detection
commit 5a0106b5fecdb3b9cbeb3cda7e641cffd05b1694
Author: Chris Beer <[email protected]>
Date: 2017-08-12T17:36:14Z
Short-circuit expected size calculation when it exceeds the maximum length
----
> NullPointerException in TikaLanguageIdentifierUpdateProcessor.getExpectedSize
> when performing language identification using unset fields
> ----------------------------------------------------------------------------------------------------------------------------------------
>
> Key: SOLR-11231
> URL: https://issues.apache.org/jira/browse/SOLR-11231
> Project: Solr
> Issue Type: Bug
> Security Level: Public(Default Security Level. Issues are Public)
> Components: contrib - LangId
> Affects Versions: 6.6, 7.0
> Reporter: Chris Beer
> Priority: Minor
>
> If I configure the TikaLanguageIdentifierUpdateProcessorFactory that includes
> a `langid.fl` that isn't set on the document, I get a NPE:
> {{java.lang.NullPointerException
> at
> org.apache.solr.update.processor.TikaLanguageIdentifierUpdateProcessor.getExpectedSize(TikaLanguageIdentifierUpdateProcessor.java:116)
> at
> org.apache.solr.update.processor.TikaLanguageIdentifierUpdateProcessor.concatFields(TikaLanguageIdentifierUpdateProcessor.java:76)
> at
> org.apache.solr.update.processor.TikaLanguageIdentifierUpdateProcessor.detectLanguage(TikaLanguageIdentifierUpdateProcessor.java:53)
> at
> org.apache.solr.update.processor.LanguageIdentifierUpdateProcessor.process(LanguageIdentifierUpdateProcessor.java:219)}}
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]