On Sun, 2 Apr 2000, Dmitry Sivachenko wrote:
> main() {
> char c;
> c = 'Æ';
> setlocale(LC_CTYPE, "ru_RU.KOI8-R");
> setlocale(LC_COLLATE, "ru_RU.KOI8-R");
> printf("%c\n", toupper(c));
> };
>
> This program will _not_ convert c to upper case.
> If I change 'char'-->'unsigned char', everythink work fine
> (on my FreeBSD box).
I was asking about an example input for texindex, not an example program
that doesn't work with non-ASCII characters. We are considering whether
to change texindex so that it handles non-ASCII documents correctly.
> May be command-line option is that good solution.
> What do people think?
I do think that it's a good solution. But the problem is not solved by
it completely. If you think that texindex should call setlocale based on
that option, I'm not sure it's the right thing to do: setlocale affects
other functions beyond case-converting ones.
So we might need custom language-dependent tables for whatever texindex
has to do with non-ASCII text, instead of using locales.
It all depends on what does texindex do wrong currently. I suggest to
discuss these particular problems first.