Thanks Robert. Is there another analyzer I should use? Jerome
From: Robert Muir <[email protected]> To: [email protected], Date: 01/24/2013 06:20 PM Subject: Re: Chinese analyzer On Thu, Jan 24, 2013 at 10:53 AM, Jerome Lanneluc <[email protected]> wrote: > It looks like my attachment was lost. It referred to > org.apache.lucene.analysis.cn.smart.SmartChineseAnalyzer. > I think this analyzer will not properly tokenize text outside of the BMP: it pretty much only works for simplified text (e.g. chars from GB2312 range) --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected] Sauf indication contraire ci-dessus:/ Unless stated otherwise above: Compagnie IBM France Siège Social : 17 avenue de l'Europe, 92275 Bois-Colombes Cedex RCS Nanterre 552 118 465 Forme Sociale : S.A.S. Capital Social : 653.242.306,20 ? SIREN/SIRET : 552 118 465 03644 - Code NAF 6202A
