[
https://issues.apache.org/jira/browse/LUCENE-2522?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
David Smiley updated LUCENE-2522:
---------------------------------
Fix Version/s: (was: 4.7)
4.8
> add simple japanese tokenizer, based on tinysegmenter
> -----------------------------------------------------
>
> Key: LUCENE-2522
> URL: https://issues.apache.org/jira/browse/LUCENE-2522
> Project: Lucene - Core
> Issue Type: New Feature
> Components: modules/analysis
> Reporter: Robert Muir
> Priority: Minor
> Fix For: 4.8
>
> Attachments: LUCENE-2522.patch, LUCENE-2522.patch, LUCENE-2522.patch
>
>
> TinySegmenter (http://www.chasen.org/~taku/software/TinySegmenter/) is a tiny
> japanese segmenter.
> It was ported to java/lucene by Kohei TAKETA <[email protected]>,
> and is under friendly license terms (BSD, some files explicitly disclaim
> copyright to the source code, giving a blessing instead)
> Koji knows the author, and already contacted about incorporating into lucene:
> {noformat}
> I've contacted Takeda-san who is the creater of Java version of
> TinySegmenter. He said he is happy if his program is part of Lucene.
> He is a co-author of my book about Solr published in Japan, BTW. ;-)
> {noformat}
--
This message was sent by Atlassian JIRA
(v6.2#6252)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]