hi herbert, >> WhitespaceAnalyzer looks brutal. Is it possible that I keep >> StandardAnalyzer and at the same time to tell the parser to keep a >> list of chars during indexing?
Perhaps it would be sufficient to use the WhitespaceAnalyzer and keep StandardAnalyzer for the other fields by using a PerFieldAnalyzerWrapper? > > Add something like: > > | < #MYCHARACTERS: > ("&" | ":" | "%" | ";") > > > > to the StandardTokenizer.jj and rebuild it. > > Might cause some lexical indeterministic errors, so look out for those. ... and you have to remember to do this again on each lucene-update. --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]