>> Thank you for your great and detailed explanation. You were very explicit>>
>> >> I have one little question yet. You said "Do you have any idea about
>> terms selectivity?" What means terms selectivity?
>Search behavior and performance hardly depends on a percents of
>documents which are matched by terms.
>
>If it's one document (unique ids) or several documents, then we have one
>situation.
>If it's 70% of documents, then situation is completely different.
OK Alexander. I understand
this. How can I manage this situation? Because I will index all words from
text fields (this is the default behavior of the tokenizer, isn't it?). So,
there will be words like 'and', 'a', 'an', 'than' and many others which will
apear in many documents. I know that MYSQL fulltext index has a full list with
these common words, and they exclude this words from the index.
Tell me how can I select common terms in an efficient way. Where should I add
this? Is there a class which I can extend?
I wait your answer.
Thank you,
Sebi
__________________________________________________
Do You Yahoo!?
Tired of spam? Yahoo! Mail has the best spam protection around
http://mail.yahoo.com
__________________________________________________
Do You Yahoo!?
Tired of spam? Yahoo! Mail has the best spam protection around
http://mail.yahoo.com