: i'm using Solr in a e-commerce site, and i wanted to use protected words
: also to reduce recall for certain queries.

I'm not sure i understand what you mwan.  Why would protected words (in 
regards to the stemmer) reduce recall ? ... i guess it depends on the 
words you are protecting right ... but why would you wnat to reduce 
recall?  isn't the goal usually to increases recall while keeping 
precision high?

(disclaimer: i'm not very smart when it comes to theoretical IR, i'm more 
of a hands on "practicallist" .. i try stuff, i draw on past experience to 
decide if it's "better" and then i deploy it and if my user satisfaction 
numbers go down i roll back.)

: It could be that a parallel approach using dismax boosting for fields such
: as "product name" and "category" will,  beside increasing precision, also
: reducing false hit recall?

Hmmm... i think it's safe to see that intellegent choice of qf, pf, 
bf, and bq values (based on inherent knowledge of hte corpus) can increase 
precision; but unless you use prohibitive fq clauses, i don't know that 
you will actaully be reducing your false hit rate ... you're just making 
their scores very small relative the top scoring docs.  a strict "mm" is 
your best bet for reducing the number of "false hits" (because things that 
don't match "enough" of the input terms will be weeded out)




-Hoss

Reply via email to