: i'm using Solr in a e-commerce site, and i wanted to use protected words : also to reduce recall for certain queries.
I'm not sure i understand what you mwan. Why would protected words (in regards to the stemmer) reduce recall ? ... i guess it depends on the words you are protecting right ... but why would you wnat to reduce recall? isn't the goal usually to increases recall while keeping precision high? (disclaimer: i'm not very smart when it comes to theoretical IR, i'm more of a hands on "practicallist" .. i try stuff, i draw on past experience to decide if it's "better" and then i deploy it and if my user satisfaction numbers go down i roll back.) : It could be that a parallel approach using dismax boosting for fields such : as "product name" and "category" will, beside increasing precision, also : reducing false hit recall? Hmmm... i think it's safe to see that intellegent choice of qf, pf, bf, and bq values (based on inherent knowledge of hte corpus) can increase precision; but unless you use prohibitive fq clauses, i don't know that you will actaully be reducing your false hit rate ... you're just making their scores very small relative the top scoring docs. a strict "mm" is your best bet for reducing the number of "false hits" (because things that don't match "enough" of the input terms will be weeded out) -Hoss