On 10/14/02 10:00 AM, "Benoit Hediard" <[EMAIL PROTECTED]> wrote:
..
> 
> I recommand you not to empty the noise words files, you'll loose some great
> capabilities of full-text search.
> For example, if someone search for a letter/number or a common word ('the',
> 'or'...), those words should be ignored, otherwise you'll get plenty of
> "noise" in your results.
> Even if it is minimal, the noise words file should never be empty.
> 

Make sure you know your subject domain.  I was indexing popular
song titles and using a stop word list.  I tried to search for
the Huey Lewis song "If This Is It"  -- and of course I couldn't
find it --  all four are stop words.

Of course, in most domains, stop words are good.  

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~|
Archives: http://www.houseoffusion.com/cf_lists/index.cfm?forumid=4
Subscription: http://www.houseoffusion.com/index.cfm?sidebar=lists&body=lists/cf_talk
FAQ: http://www.thenetprofits.co.uk/coldfusion/faq
This list and all House of Fusion resources hosted by CFHosting.com. The place for 
dependable ColdFusion Hosting.

Reply via email to