> > What if the query is '+strong force' and 'strong' is tokenized into
> > 'strong' and the alias 'tough' ?  Will the query parser convert it to
> > '+(strong OR tough) force' ?
>
> If strong is tokenized to /strong tough/, then /+strong force/ will
> be tokenized to /+"strong tough" force/.
>
> While this may look weird to you, that's because this is a really
> bogus example.  Tokenizers in western languages generally don't expand
> a single token into more than one; this is more common in asian
> languages, where a complicated ideograph is broken down into simpler
> ones.  So a more comparable example in English would be to tokenize
> the words "HaagenDazs" into "ice cream".  So then a search for
> /+HaagenDazs chocolate/ becomes (+"ice cream" chocolate), which is
> a pretty reasonable-looking behavior.

I see, so this is intended for use in cases where the a token is
replaced with a sequence of tokens that represents, as a sequence, a single
meaning
that is similar to that of the original token.
Another case, which does not seem to be supported
is when a token is replaced with a sequence of tokens, each
representing an *alternative* meaning. Here is an example:

 'dog' -> 'dog',  'pet'
 'cat' -> 'cat',  'pet'
 'pet' -> 'pet'

When you search for 'pet' you want to match also documents with 'dog' and
'cat' but when you search for 'dog' you don't want to match 'cat' or 'pet'.

> > The FAQ is aimed at programmers that embed Lucene in there
> application and
> > not
> > at their end users.
>
> Who will still have to create documentation for _their_ (naive) users.
> Why not call attention to the issues that will undoubtedly bite their
> users?  Why not give them something they can basically cut and paste
> from?

Great idea. Write a manual for end users and we
will include it in the FAQ or have it in a separate document.

Tal

>
> _______________________________________________
> Lucene-dev mailing list
> [EMAIL PROTECTED]
> http://lists.sourceforge.net/lists/listinfo/lucene-dev
>


_______________________________________________
Lucene-dev mailing list
[EMAIL PROTECTED]
http://lists.sourceforge.net/lists/listinfo/lucene-dev

Reply via email to