> But keep in mind that the way the current Endings algorithm works is
> slightly different than most "stemming" approaches. All words are indexed
> as-is and then at search time, the fuzzy algorithm can add additional
> "fuzzy query words" to the user query (at usually lower weight).

        Ah.. I was thinking of feeding the indexer pre-stemmed text as
well as the original text.  User queries would by default be stemmed
before quering, unless "" or the + are used.

        This method is pretty common in the IR research
community.  Although it is work menthining that there is some controversy
over whether stemming increases or decreases accuracy in general.  
Probably very dataset and desired results dependent...

-- 
Neal Richter 
Knowledgebase Developer
RightNow Technologies, Inc.
Customer Service for Every Web Site



_______________________________________________
htdig-dev mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/htdig-dev

Reply via email to