Rich Felker wrote: > > For example, the unix "cut" program works automatically with UTF-8 > text as long as the delimiter is a single byte, and if you want > multibyte delimiters, all you need to do is make it accept a multibyte > delimeter character and then do a substring search instead of a byte > search. There is no need to ever treat the input string as characters, > and in fact doing so just makes it slow and bloated.
cut -c2-3 ... Daniel -- Daniel Barclay [EMAIL PROTECTED] -- Linux-UTF8: i18n of Linux on all levels Archive: http://mail.nl.linux.org/linux-utf8/
