[jira] [Commented] (LUCENE-8526) StandardTokenizer doesn't separate hangul characters from other non-CJK chars

2018-10-05 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16640282#comment-16640282 ] Jim Ferenczi commented on LUCENE-8526: -- Sounds great [~steve_rowe]. I'll prepare a patch. >

[jira] [Commented] (LUCENE-8526) StandardTokenizer doesn't separate hangul characters from other non-CJK chars

2018-10-05 Thread Steve Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16640279#comment-16640279 ] Steve Rowe commented on LUCENE-8526: bq. We can maybe add a note in the CJKBigram filter regarding

[jira] [Commented] (LUCENE-8526) StandardTokenizer doesn't separate hangul characters from other non-CJK chars

2018-10-05 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16640245#comment-16640245 ] Jim Ferenczi commented on LUCENE-8526: -- Ok thanks for explaining [~steve_rowe]. I thought that