Hi, > UTF-8 is clearly defined by RFC 2279 which maintains the clear > 1-to-6-bytes encoding scheme of RFC 2044 with no confusion - and will > hopefully remain so.
FYI: RFC 2279 is obsoleted by RFC 3629 which defines UTF-8 as a 1-to-4-bytes encoding scheme. Sad but true... -- Egmont -- Linux-UTF8: i18n of Linux on all levels Archive: http://mail.nl.linux.org/linux-utf8/
