Thanks Robert. Is there another analyzer I should use?

Jerome



From:   Robert Muir <[email protected]>
To:     [email protected], 
Date:   01/24/2013 06:20 PM
Subject:        Re: Chinese analyzer



On Thu, Jan 24, 2013 at 10:53 AM, Jerome Lanneluc
<[email protected]> wrote:
> It looks like my attachment was lost. It referred to
> org.apache.lucene.analysis.cn.smart.SmartChineseAnalyzer.
>

I think this analyzer will not properly tokenize text outside of the
BMP: it pretty much only works for simplified text (e.g. chars from
GB2312 range)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]




Sauf indication contraire ci-dessus:/ Unless stated otherwise above:
Compagnie IBM France
Siège Social : 17 avenue de l'Europe, 92275 Bois-Colombes Cedex
RCS Nanterre 552 118 465
Forme Sociale : S.A.S.
Capital Social : 653.242.306,20 ?
SIREN/SIRET : 552 118 465 03644 - Code NAF 6202A 

Reply via email to