On Thu, Jan 24, 2013 at 9:25 AM, Jerome Lanneluc <jerome_lanne...@fr.ibm.com> wrote: > Note the 2 tokens in the second sample when I would expect to have only one > token with the (55401 57046) characters. > > I could not figure out if I'm doing something wrong, or if this is a bug in > the Chinese analyzer. >
Which analyzer specifically? there is more than one... --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org