According to David GUIODO:
> I can not get the lang environment variable on windows NT.
> So every word with an accent is cut by htdig.
>
> A solution is :
> in the htdig.conf, add the following line :
>
> extra_word_characters: éèàçùïîâäêëôöûü
>
> Magic !
> David.
> [EMAIL PROTECTED]
This is a partial solution. It still won't recognize upper-case
accented letters. If you add these to extra_word_characters, they will
be indexed, but htdig won't know how to convert upper-case accented
letters to lower-case, so you won't have case-insensitive matching for
accented letters, as you do for unaccented ones.
I've often thought there should be an extra_word_casemap attribute
added to htdig, to support adding upper- and lower-case mappings of
extra characters, for systems with broken or missing locale support,
but alas, I don't have the time to implement this.
Still, I find it surprising that the Cygwin toolkit for WinNT doesn't
have working locale support! It seems like a pretty serious oversight,
as locales are increasingly important for UNIX compatibility. Are you
certain that it doesn't work? Have you seen anything in the Cygwin
documentation about locales? (This wouldn't add LANG support to all of
WinNT, just to UNIX applications ported to NT using the Cygwin toolkit.)
--
Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba Phone: (204)789-3766
Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html