Re: Handling space variations in queries - matching 'thunderbolt' for query 'thunder bolt'

Chris Hostetter Tue, 09 Aug 2011 09:12:06 -0700

: during indexing).  However, due to the pre-analysis whitespace tokenization
: done by lucene query parser, the reverse is not handled well - document with
: string 'thunderbolt' being matched to query 'thunder bolt'.


it's not so much "pre-analysis whitespace tokenization" as it is "query 
parser meta-characters" ... whitespace has meaning to the query parser in 
the same way that "+" "-" and "\"" do.

if you want a query parser that doesn't treat whitespace special, you can 
use the "FieldQParser" ... it supports no metacharacters and just runs hte 
input through the analyzer for a specified field.


-Hoss

Re: Handling space variations in queries - matching 'thunderbolt' for query 'thunder bolt'

Reply via email to