Hello, Have you tried cmecab-java? This library provides tokenizers using MeCab (Japanese dependency structure analyzer). http://code.google.com/p/cmecab-java/
Here is a sample schema.xml. http://code.google.com/p/cmecab-java/wiki/HowToUse On 2011/04/04, at 20:16, Erik Hatcher wrote: > I'm trying to cull together the best practices for indexing/searching > Japanese text. > > For those of you using Solr, what analyzer/field-type definition do you have > for Japanese? > > Thanks for sharing! > Erik -- Kosuke Tanabe
