> But strcoll *has* explicit knowledge of characters. > If you set LC_ALL to fi_FI.UTF-8 then strcoll will know about the Finnish collation >rules and also know that strings are UTF-8 encoded. >This is mandated by the standards, and glibc 2.2 implements them.
OK. But strtok() for instance does not handle utf-8 data properly. Is this also in the standards? Reading at the two urls below, I could not see where it was explained that strcoll() does and strtok() does not... >See http://mail.nl.linux.org/linux-utf8/2001-12/msg00042.html and >http://www.opengroup.org/onlinepubs/007908799/xsh/strcoll.html Paul __________________________________________________ Do You Yahoo!? Send FREE video emails in Yahoo! Mail! http://promo.yahoo.com/videomail/ -- Linux-UTF8: i18n of Linux on all levels Archive: http://mail.nl.linux.org/linux-utf8/
