At 6:57 PM -0500 10/24/98, [EMAIL PROTECTED] wrote:
>My htdig refuses to index pages that contain literal extended characters,
>i.e characers higher than 128 that are not escaped as &nnn; . I suspect
>that is because because htdig uses the data type char which is signed by
>default in Linux, instead of using unsigned char or even better wchar_t.
Yes, this is a problem. Eventually ht://Dig will support Unicode and this
will help internationalization in general. If you are actually seeing it
ignore a whole page simply because there are extended characters, that's a
bug. However, if you don't see those characters at all, my suggestion is to
change them to escapes. After all, I doubt most browsers deal well with
them either. Depending on platform, you may see different characters
entirely!
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/
----------------------------------------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the body of the message.