On Freitag 21 Juli 2006 16:16, Ngo, Anh (ISS Southfield) wrote: > The lucene 2.0.0 StandardAnalyzer does treat the "_"(underscore) as a > token. Is there a way I can make StandardAnalyzer don't tokenize for > "_" or any given characters?
You need to add "_" to the #LETTER definition in StandardTokenizer.jj, then rebuild StandardTokenizer.java using the appropriate and task. Regards Daniel -- http://www.danielnaber.de --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]