This has encode and decode reversed from my understanding. I regard the string (wide-char) as the canonical form and the bytes as the encoded form. This view is reflected in the widely used terminology "charset encodings" which refers to the likes of euc-kr and shift_jis.
Yeah, I suspect we'll get it right once put in a draft :-) -- Anne van Kesteren http://annevankesteren.nl/
