I guess most of you have already handled and many of you might still be
handling keyword stuffing. Here is my scenario. We have a huge index
containing about 6m docs. (Not sure if that is huge :-) And every document
contains title, description, tags, content (textual data). People have been
doing keyword stuffing on the documents, so when searched for a "query
term", the first results are always the ones who are optimized.

So, instead of people getting relevant results, they get spam content
(highly optimized, keyword stuffed content) as first few results. I have
tried a couple of things like providing different boosts to different
fields, but almost everything seems to fail.

I'd like to know how did you guys fixed this thing?

*Pranav Prakash*

"temet nosce"

Twitter <http://twitter.com/pranavprakash> | Blog <http://blog.myblive.com> |
Google <http://www.google.com/profiles/pranny>

Reply via email to