On Wed, 2004-01-14 at 07:32, srintuar wrote: > should deal with any valid utf-8 sequence up to six bytes > long.
six byte UTF-8 sequences are not valid. According to both IETF and Unicode Consortium at least. They are only valid in ISO/IEC 10646. roozbeh -- Linux-UTF8: i18n of Linux on all levels Archive: http://mail.nl.linux.org/linux-utf8/
