According to Alexander I. Lebedev: > Gilles Detillieux <[EMAIL PROTECTED]> wrote: > Subject: Re: [htdig] a bug? (reposted) - PATCH for htfuzzy 3.1.5 > > >> If anyone wants, I could send the lists of duplicate word forms > >> for analyzing. > > > >I'd be curious to see it. I did notice that the word2root.db file was > >slightly bigger after I patched htfuzzy/EndingsDB.cc, implying that some > >words in english.0 would have more than one root. I'm surprised it's > >over 2000! What is the extended English word list? > > Gilles, > > The following are two word lists, the first one generated from > english.0.original file, the second one -- from english.0 file (I called > it the extended list). BTW, what's the difference between these lists? About 720 KB. :-) The english.0 file is the one that gets installed and used. I wasn't even aware of the english.0.original file until you mentioned it, but it's been there since the early days of 3.0.8b2. I suspect it's a long since forgotten holdover from the past, that should probably have been removed. Maybe Andrew could shed some light on this, if it's not too long ago for him to remember. -- Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]> Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil Dept. Physiology, U. of Manitoba Phone: (204)789-3766 Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930 _______________________________________________ htdig-general mailing list <[EMAIL PROTECTED]> To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe FAQ: http://htdig.sourceforge.net/FAQ.html

