Munkyu Im created LUCENE-8784:
---------------------------------

             Summary:  Nori(Korean) tokenizer removes the decimal point. 
                 Key: LUCENE-8784
                 URL: https://issues.apache.org/jira/browse/LUCENE-8784
             Project: Lucene - Core
          Issue Type: Improvement
            Reporter: Munkyu Im


This is the same issue that I mentioned to 
[https://github.com/elastic/elasticsearch/issues/41401#event-2293189367]

unlike standard analyzer, nori analyzer removes the decimal point.

nori tokenizer removes "." character by default.
In this case, it is difficult to index the keywords including the decimal point.

It would be nice if there had the option whether add a decimal point or not 
like Japanese tokenizer

 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to