Fw: [fw-general] Zend_Search_Lucene questions

Sebi Thu, 21 Dec 2006 07:07:04 -0800

>> Thank you for your great and detailed explanation. You were very explicit>> 
>> >> I have one little question yet. You said "Do you have any idea about 
>> terms selectivity?" What means terms selectivity?
>Search behavior and performance hardly depends on a percents of 
>documents which are matched by terms.
>
>If it's one document (unique ids) or several documents, then we have one 
>situation.
>If it's 70% of documents, then situation is completely different.





OK Alexander. I understand
 this. How can I manage this situation? Because I will index all words from 
text fields (this is the default behavior of the tokenizer, isn't it?). So, 
there will be words like 'and', 'a', 'an', 'than' and many others which will 
apear in many  documents. I know that MYSQL fulltext index has a full list with 
these common words, and they exclude this words from the index.  

Tell me how can I select common terms in an efficient way. Where should I add 
this? Is there a class which I can extend? 
I wait your answer.

Thank you,

            Sebi








__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 




__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com

Fw: [fw-general] Zend_Search_Lucene questions

Reply via email to