On Sun, Sep 28, 2008 at 10:43 PM, James Y Knight <[EMAIL PROTECTED]> wrote: > [1] UTF-8b has a similar property to 8859-1, in that all byte strings can be > successfully round-tripped. It's not currently implemented in python core, > but it's a pretty trivial encoding, and is available under the BSD license, > see below.
UTF-8b doesn't work as intended. It produces an invalid unicode object (garbage surrogates) that cannot be used with external APIs or libraries that require unicode. If you don't need unicode then your code should state so explicitly, and 8859-1 is ideal there. -- Adam Olsen, aka Rhamphoryncus _______________________________________________ Python-3000 mailing list Python-3000@python.org http://mail.python.org/mailman/listinfo/python-3000 Unsubscribe: http://mail.python.org/mailman/options/python-3000/archive%40mail-archive.com