[ https://issues.apache.org/jira/browse/LUCENENET-119?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Digy updated LUCENENET-119: --------------------------- Attachment: CharTokenizer.patch > CharTokenizer can not detect the end of stream > ---------------------------------------------- > > Key: LUCENENET-119 > URL: https://issues.apache.org/jira/browse/LUCENENET-119 > Project: Lucene.Net > Issue Type: Bug > Environment: Lucene.Net 2.3.1 > Reporter: Digy > Attachments: CharTokenizer.patch > > > CharTokenizer can not detect the end of stream since input stream does not > return -1 on EOS, instead it returns 0. > {code} > dataLen = input.Read(....) > if (dataLen == -1 ) > {code} > Analyzers that use that class (like WhitespaceAnalyzer, SimpleAnalyzer etc.) > return tokens with length 255 at the end (only last token). -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.