: during indexing). However, due to the pre-analysis whitespace tokenization : done by lucene query parser, the reverse is not handled well - document with : string 'thunderbolt' being matched to query 'thunder bolt'.
it's not so much "pre-analysis whitespace tokenization" as it is "query parser meta-characters" ... whitespace has meaning to the query parser in the same way that "+" "-" and "\"" do. if you want a query parser that doesn't treat whitespace special, you can use the "FieldQParser" ... it supports no metacharacters and just runs hte input through the analyzer for a specified field. -Hoss