StandardTokenizerFactory and WhitespaceTokenizerFactory

2015-07-30 Thread Tarala, Magesh
I am indexing text that contains part numbers in various formats that contain hypens/dashes, and a few other special characters. Here's the problem: If I use StandardTokenizerFactory, the hypens, etc are stripped and so I cannot search by the part number 222-333-. I can only search for 222

RE: StandardTokenizerFactory and WhitespaceTokenizerFactory

2015-07-30 Thread Tarala, Magesh
Using PatternReplaceCharFilterFactory to replace comma, period, etc with space or empty char will work? -Original Message- From: Tarala, Magesh Sent: Thursday, July 30, 2015 10:08 AM To: solr-user@lucene.apache.org Subject: StandardTokenizerFactory and WhitespaceTokenizerFactory I am

RE: StandardTokenizerFactory and WhitespaceTokenizerFactory

2015-07-30 Thread Tarala, Magesh
I'm adding PatternReplaceCharFilterFactory to exclude characters. Looks like this works. -Original Message- From: Tarala, Magesh Sent: Thursday, July 30, 2015 10:37 AM To: solr-user@lucene.apache.org Subject: RE: StandardTokenizerFactory and WhitespaceTokenizerFactory Using