On Mon, Jul 11, 2016 at 06:59:25PM +0530, Abhinav Upadhyay wrote:
> But the downside is that technical keywords (e.g. kms, lfs, ffs), are
> also stemmed down and stored (e.g. km, lf, ff) in the index. So if you
> search for kms, you will see results for both kms and km.

Interesting problem.

I expect the set of documents that contain a word ("directories") and
the set of documents containing its true stem ("directory") to overlap
widely.  I also expect the set of documents that contain a word ("kms")
and an incorrect stem ("km") to scarcely overlap.  Do the manual pages
meet these expections?  If so, then maybe you can decide whether or not
to keep a stem by looking at the document-set overlap?

Dave

-- 
David Young         //\ Trestle Technology Consulting
(217) 721-9981      Urbana, IL   http://trestle.tech/

Reply via email to