: during indexing).  However, due to the pre-analysis whitespace tokenization
: done by lucene query parser, the reverse is not handled well - document with
: string 'thunderbolt' being matched to query 'thunder bolt'.

it's not so much "pre-analysis whitespace tokenization" as it is "query 
parser meta-characters" ... whitespace has meaning to the query parser in 
the same way that "+" "-" and "\"" do.

if you want a query parser that doesn't treat whitespace special, you can 
use the "FieldQParser" ... it supports no metacharacters and just runs hte 
input through the analyzer for a specified field.


-Hoss

Reply via email to