[jira] [Updated] (LUCENE-6435) java.util.ConcurrentModificationException: Removal from the cache failed error in SimpleNaiveBayesClassifier

2015-09-16 Thread Chang KaiShin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chang KaiShin updated LUCENE-6435: -- Attachment: patch.rtf > java.util.ConcurrentModificationException: Removal from the cache

[jira] [Commented] (LUCENE-6435) java.util.ConcurrentModificationException: Removal from the cache failed error in SimpleNaiveBayesClassifier

2015-09-16 Thread Chang KaiShin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14747126#comment-14747126 ] Chang KaiShin commented on LUCENE-6435: --- By running the JUnit test in debugging mode , I get the

[jira] [Commented] (LUCENE-6435) java.util.ConcurrentModificationException: Removal from the cache failed error in SimpleNaiveBayesClassifier

2015-09-16 Thread Chang KaiShin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14747081#comment-14747081 ] Chang KaiShin commented on LUCENE-6435: --- The hashcode of class TermQuery doesn't remain consistent

[jira] [Commented] (LUCENE-6435) java.util.ConcurrentModificationException: Removal from the cache failed error in SimpleNaiveBayesClassifier

2015-10-02 Thread Chang KaiShin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14940825#comment-14940825 ] Chang KaiShin commented on LUCENE-6435: --- Good to hear the problem solved! >

[jira] [Updated] (LUCENE-7508) [smartcn] tokens are not correctly created if text length > 1024

2016-12-12 Thread Chang KaiShin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-7508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chang KaiShin updated LUCENE-7508: -- Attachment: lucene-7508.patch A hard coded buffer size 1024 exists in

[jira] [Commented] (LUCENE-7508) [smartcn] tokens are not correctly created if text length > 1024

2016-12-12 Thread Chang KaiShin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-7508?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15744174#comment-15744174 ] Chang KaiShin commented on LUCENE-7508: --- Theoretically the text size could be infinite. The ideal

[jira] [Updated] (LUCENE-7508) [smartcn] tokens are not correctly created if text length > 1024

2016-12-12 Thread Chang KaiShin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-7508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chang KaiShin updated LUCENE-7508: -- Attachment: lucene-7508-test.patch failing test case > [smartcn] tokens are not correctly

[jira] [Commented] (LUCENE-7508) [smartcn] tokens are not correctly created if text length > 1024

2016-12-15 Thread Chang KaiShin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-7508?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15753733#comment-15753733 ] Chang KaiShin commented on LUCENE-7508: --- After I looked into the internal handling of the input

[jira] [Comment Edited] (LUCENE-7509) [smartcn] Some chinese text is not tokenized correctly with Chinese punctuation marks appended

2016-12-01 Thread Chang KaiShin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-7509?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15714052#comment-15714052 ] Chang KaiShin edited comment on LUCENE-7509 at 12/2/16 7:56 AM: This is

[jira] [Commented] (LUCENE-7509) [smartcn] Some chinese text is not tokenized correctly with Chinese punctuation marks appended

2016-12-01 Thread Chang KaiShin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-7509?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15714052#comment-15714052 ] Chang KaiShin commented on LUCENE-7509: --- This is not a bug. The underlying Viterbi algorithm