Hello,

Have you tried cmecab-java? This library provides tokenizers using  
MeCab (Japanese dependency structure analyzer).
http://code.google.com/p/cmecab-java/

Here is a sample schema.xml.
http://code.google.com/p/cmecab-java/wiki/HowToUse

On 2011/04/04, at 20:16, Erik Hatcher wrote:

> I'm trying to cull together the best practices for indexing/searching 
> Japanese text.
> 
> For those of you using Solr, what analyzer/field-type definition do you have 
> for Japanese?
> 
> Thanks for sharing!
>       Erik


-- 
Kosuke Tanabe

Reply via email to