Thank you. Lucene documentation is vague on this subject.

On the LIA-book -earch powered by Lucene it seems the "-" operator works as a 
prohibitor regardless of the number of spaces after the "-".   Still can't tell 
if this is a bug or by design.

A Nutch parser, however, seems to have changed that.  I checked two 
Nutch/Lucene implementations the web - Indeed.com and at Oregon State. Their 
parsers provide a more "standard" public web search behavior.  I.e. the "-" is 
treated as a minus only if no space between it and the search term.  Otherwise 
the "-"becomes just another search term in the query.

-Felix

Yonik Seeley <[EMAIL PROTECTED]> wrote: On 1/23/07, Felix Litman  wrote:
> Does a special character lika a "-" prohibitor operator require no-space 
> after it in order to work as a prohibitor?
>
>   Typically on the web, e.g. Google and others, the "-" operator works as a 
> boolean prohibitor only when not followed by a space. Otherwise it is treated 
> as just a dash query term.
>
>   But in our Lucene implementation the the "-" seem to be acting as a 
> prohibitor even if there is space after it.   For example. in a query:  Sales 
> +    service    , the 'service' term is excluded by Lucene.  (Same for the 
> "+" operator.)
>
>   Is this space-treatment a bug or a standard Lucene query parser behavior?

I think this is expected, and changing it would be more confusing.  Consider:
If "-" is expected to find a dash, surely "-x" should find a dash
followed by an x?

If you want literals, put quotes around your terms...
"Sales +    service"

-Yonik

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


Reply via email to