Munkyu Im created LUCENE-8784:
---------------------------------
Summary: Nori(Korean) tokenizer removes the decimal point.
Key: LUCENE-8784
URL: https://issues.apache.org/jira/browse/LUCENE-8784
Project: Lucene - Core
Issue Type: Improvement
Reporter: Munkyu Im
This is the same issue that I mentioned to
[https://github.com/elastic/elasticsearch/issues/41401#event-2293189367]
unlike standard analyzer, nori analyzer removes the decimal point.
nori tokenizer removes "." character by default.
In this case, it is difficult to index the keywords including the decimal point.
It would be nice if there had the option whether add a decimal point or not
like Japanese tokenizer
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]