peina created LUCENE-7509:
-
Summary: [smartcn] Some chinese text is not tokenized correctly
with Chinese punctuation marks appended
Key: LUCENE-7509
URL: https://issues.apache.org/jira/browse/LUCENE-7509
peina created LUCENE-7508:
-
Summary: [smartcn] tokens are not correctly created if text length
> 1024
Key: LUCENE-7508
URL: https://issues.apache.org/jira/browse/LUCENE-7508
Project: Lucene - Core
[
https://issues.apache.org/jira/browse/LUCENE-7508?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15744568#comment-15744568
]
peina commented on LUCENE-7508:
---
Great, thanks!
> [smartcn] tokens are not correctly created if text
[
https://issues.apache.org/jira/browse/LUCENE-7508?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15744140#comment-15744140
]
peina commented on LUCENE-7508:
---
Hi Chang KaiShin,
Thanks for your patch, but I don't think it makes any
[
https://issues.apache.org/jira/browse/LUCENE-7508?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15753817#comment-15753817
]
peina commented on LUCENE-7508:
---
glad to know my previous fix gave you at least some hint :)
> [smartcn]
[
https://issues.apache.org/jira/browse/LUCENE-7509?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15721497#comment-15721497
]
peina commented on LUCENE-7509:
---
BTW, is there any chance that
[
https://issues.apache.org/jira/browse/LUCENE-7509?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15721489#comment-15721489
]
peina commented on LUCENE-7509:
---
Thanks. Make sense to me.
> [smartcn] Some chinese text is not tokenized