As I know: for east Asian Languages(which without space for word segment in natural), as an non-dictionary based solution, bigram based word segment maybe the best way.
Regards Che, Dong ----- Original Message ----- From: "Erik Hatcher" <[EMAIL PROTECTED]> To: "Lucene Users List" <[EMAIL PROTECTED]> Sent: Saturday, January 31, 2004 1:14 AM Subject: Re: Japanese Analyzer > On Jan 29, 2004, at 1:45 PM, Otis Gospodnetic wrote: > > --- "Weir, Michael" <[EMAIL PROTECTED]> wrote: > >> Is the CJKAnalyzer the best to use for Japanese? If not, which is? > >> If so, > >> from where can I download it? > > There is also a ChineseTokenizer/Analyzer in the sandbox as well. It > may have value for Japanese as well? > > Erik > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > >
