On Mon, 29 Nov 1999, Gilles Detillieux wrote:

> Just a hunch, but you wouldn't happen to have a � in valid_punctuation,
> would you?  In any case, could you run htdig -vvv twice, searching
> first for ANL�NDE, and then for anl�nde?  How do the initial debugging
> messages differ.  What's happening to the � - is it getting stripped
> out or changed to another character?  Is the upper case � getting changed
> to a �, or to another character?  Are you using the exact same config
> file for htdig, htmerge and htsearch?

I use the default for "valid_punctuation", I even tried adding it as
"extra_word_characters: �".

Here's the debugging info for the second (237th! :) try.
 
su10-2 <74> htsearch -vvv
Enter value for words: anl�nde
tempWords: 'anl�nde:0 '
Boolean: 'anl�nde:0 '
initial: ''
Add: anl�nde
searchWords: 'anl�nde:0 '
LogicalWords: anl�nde
Pattern: 
Enter value for format:

su10-2 <75> htsearch -vvv
Enter value for words: ANL�NDE
tempWords: 'anl�nde:0 '
Boolean: 'anl�nde:0 '
initial: ''
Fuzzy on: anl�nde
   (null) anl�nde
   (null) word=anl�nde prefix_suffix=* prefix_suffix_length=1
minimum_prefix_length=1

   endings anl�nda anl�ndandet anl�ndandets anl�ndande anl�nd- anl�nder
anl�nt anl�nds anl�ndes anl�nts anl�ndes
   synonyms
searchWords: '(:0 anl�nde:0 |:0 anl�nda:0 |:0 anl�ndandet:0 |:0
anl�ndandets:0 |:0 anl�ndande:0 |:0 anl�nd-:0 |:0 anl�nder:0 |:0 anl�nt:0
|:0 anl�nds:0 |:0 anl�ndes:0 |:0 anl�nts:0 |:0 anl�ndes:0 ):0 '
LogicalWords: (anl�nde or anl�nda or anl�ndandet or anl�ndandets or
anl�ndande or anl�nd- or anl�nder or anl�nt or anl�nds or anl�ndes or
anl�nts or anl�ndes)
Pattern: anl�nde
Enter value for format: 

looks ok to me... what do you say?

> Not that I know of, but you could put a originalWords.uppercase(); right
> after the originalWords.chop(" \t\r\n"); in htsearch/htsearch.cc.  If the
> htsearch -vvv above doesn't get to the root of the problem, it might be
> interesting to see if this hack has any effect.

I'll try this too. If the above looks ok.

I got a mail from another Swedish subscriber of this list and according to
him everything worked well using sv_SE (which I don't have) and indexing
using an English dictionary (which shouldn't change anything).

I'll try to get hold of that locale and try it...

/Philippe


------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.

Reply via email to