*This message was transferred with a trial version of CommuniGate(tm) Pro* I just had that problem as well. I got around it by writing my own Analyzer and Tokenizer. I called the Tokenizer WhitespaceTokenizer and it basically delimits tokens based on whitespace instead of the default Character.isLetter(ch). There is a test: Character.isWhitespace(ch) that I used instead of the isLetter. Then the Analyzer was just like the default only it used my WhitespaceTokenzier.
This does however present some other problems like punctuation. You don't really want each word at the end of a sentence being tokenzied with the period attached, but that's just something to add to the Tokenzier code. I'm still in testing phases, but when I get a complete solution I'll post the code. Hope that helps! Jonathan -----Original Message----- From: Ravi Damarla [mailto:[EMAIL PROTECTED]] Sent: Saturday, October 20, 2001 5:19 AM To: [EMAIL PROTECTED] Subject: Special characters *This message was transferred with a trial version of CommuniGate(tm) Pro* Hello all: I'm trying to add searching for strings containing special characters, for example, 'c++'. I'm currently using StandardAnalyzer and the '+' character is being ignored. Is there any way of doing this? Thanks, Ravi. -- | Ravi Damarla | rdmrl (at) optonline.net | | End of the Night Creations | http://www.endofthenight.com/ |
