Uwe Schindler created SOLR-4275:
-----------------------------------

             Summary: TrieTokenizer causes StringIOOBE when input is empty 
instead of returning no token
                 Key: SOLR-4275
                 URL: https://issues.apache.org/jira/browse/SOLR-4275
             Project: Solr
          Issue Type: Bug
    Affects Versions: 4.0
            Reporter: Uwe Schindler
            Assignee: Uwe Schindler
             Fix For: 4.1, 5.0


When you use the admin interface and select a trie field (e.g. tint) and enter 
nothing into the field, the tokenizer should normally return no tokens. 
TrieTokenizer instead gets and SIOOBE because read() into the charbuffer 
returns -1 (end of stream). This is used to initialize the string's length...

The fix is to modify the tokenizer to behave like a real tokenizer:
- after reading the input, check if empty (read < 0) and then use 0 as length
- if the resulting string is empty (total len == 0), set a boolean to false and 
make the incrementToken/end methods not delegate and return false.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to