According to Olivier Korn:
> At 16:00 02/10/2002 -0500, Randall Fish wrote :
> >Our users are searching for 'breast' and 'breasts' and no search results are
> >being returned. A search for 'mammogram' returns documents containing
> >'breast' and 'breasts'. The db.wordlist file contains the references to both
> >words. The words do not exist in the bad words file.
> 
> It looks very similar with the problem I had with ht://Dig version 3.1.5 
> because my locale-settings was influencing the way the sort program was 
> working.
> 
> The solution was to add LC_COLLATE=C before the call (as it is done in the 
> version 3.1.6 of the rundig script). I know you are using this version of 
> ht://Dig, but maybe Cygwin is somehow different in its way of managing 
> locales... Couldn't it be ?

This was my gut reaction yesterday too, and I was about to e-mail
asking whether db.wordlist appeared to be sorted correctly under Cygwin.
But then I remembered that I had fixed htmerge in 3.1.6 to build the word
database correctly even if db.worlist entries aren't in the right order.
It's just less efficient at it, and results in a bigger word database,
than if the words are sorted correctly, but the db shouldn't be losing
entries.

Yet, lost entries in db.words.db seems to be the most reasonable
explanation for this problem.  About all I can think to suggest right
now is to try compiling and running db/db_dump/db_dump.c to look at the
entries in db.words.db and see if anything is missing.  If entries are
missing, then htmerge isn't building this db correctly under Cygwin.
If the db is complete, then htsearch doesn't seem to be finding all the
entries under Cygwin, or is rejecting them for some unknown reason.

Oh, have you tried reindexing from scratch to see if that makes the
problem go away?  It may be that the database has become corrupted
somehow.  (Although htmerge rebuilds db.words.db from scratch each time
you run it, it may be that htsearch is dropping results because of a
corrupted db.docdb.)  Does the number of matches reported by htsearch
agree with how many results are actually shown?

-- 
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/
Dept. Physiology, U. of Manitoba  Winnipeg, MB  R3E 3J7  (Canada)


-------------------------------------------------------
This sf.net email is sponsored by:ThinkGeek
Welcome to geek heaven.
http://thinkgeek.com/sf
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to