[
https://issues.apache.org/jira/browse/LUCENE-8904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Namgyu Kim updated LUCENE-8904:
-------------------------------
Fix Version/s: master (9.0)
8.x
> Enhance Nori DictionaryBuilder tool
> -----------------------------------
>
> Key: LUCENE-8904
> URL: https://issues.apache.org/jira/browse/LUCENE-8904
> Project: Lucene - Core
> Issue Type: Improvement
> Reporter: Namgyu Kim
> Assignee: Namgyu Kim
> Priority: Major
> Fix For: 8.x, master (9.0)
>
> Time Spent: 1h 50m
> Remaining Estimate: 0h
>
> It is the Nori version of [~sokolov]'s LUCENE-8863.
> This patch has two changes.
> 1) Improve exception handling
> 2) Enable external dictionary for testing
> Overall, it is the same as LUCENE-8863.
> But there are some differences between Nori and Kuromoji.
> These can be slightly different on the code.
> 1) CSV field size
> Nori : 12
> Kuromoji : 13
> 2) left context ID == right context ID
> Nori : can be different
> Kuromoji : always same
> 3) Dictionary Type
> Nori : just one type
> Kuromoji : IPADIC, UNIDIC
> After this job, I'll apply LUCENE-8866 and LUCENE-8871 to Nori.
--
This message was sent by Atlassian JIRA
(v7.6.14#76016)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]