[ https://issues.apache.org/jira/browse/LUCENE-8817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16859982#comment-16859982 ]
Tomoko Uchida commented on LUCENE-8817: --------------------------------------- Sorry, Elasticseach analysis plugins heavily use "kuromoji" or "nori" in their naming so the change I wrote affects Elasticsearch users. (I feel like that it should use "japanese" or "korean" instead of "kuromoji" or "nori"...) > Combine Nori and Kuromoji DictionaryBuilder > ------------------------------------------- > > Key: LUCENE-8817 > URL: https://issues.apache.org/jira/browse/LUCENE-8817 > Project: Lucene - Core > Issue Type: New Feature > Reporter: Namgyu Kim > Priority: Major > > This issue is related to LUCENE-8816. > Currently Nori and Kuromoji Analyzer use the same dictionary structure. > (MeCab) > If we make combine DictionaryBuilder, we can reduce the code size. > But this task may have a dependency on the language. > (like HEADER string in BinaryDictionary and CharacterDefinition, methods in > BinaryDictionaryWriter, ...) > On the other hand, there are many overlapped classes. > The purpose of this patch is to provide users of Nori and Kuromoji with the > same system dictionary generator. > It may take some time because there is a little workload. > The work will be based on the latest master, and if the LUCENE-8816 is > finished first, I will pull the latest code and proceed. -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org