Re: [Python-3000] [Python-Dev] Filename as byte string in python 2.6 or 3.0?

James Y Knight Mon, 29 Sep 2008 09:16:45 -0700

On Sep 29, 2008, at 3:32 AM, Adam Olsen wrote:

On Sun, Sep 28, 2008 at 10:43 PM, James Y Knight <[EMAIL PROTECTED]>wrote:
[1] UTF-8b has a similar property to 8859-1, in that all bytestrings can besuccessfully round-tripped. It's not currently implemented inpython core,but it's a pretty trivial encoding, and is available under the BSDlicense,
see below.
UTF-8b doesn't work as intended.  It produces an invalid unicode
object (garbage surrogates) that cannot be used with external APIs or
libraries that require unicode.

I'd be interested to hear more detail on what you expect the practicalramifications of this to be. It doesn't sound likely to be a problemto me.

If you don't need unicode then your
code should state so explicitly, and 8859-1 is ideal there.

But, I *do* want unicode. ALL my filenames are encoded in utf8.Except...that one over there. That's the whole point of UTF-8b:correctly encoded names get decoded correctly and readably, and theother cases get decoded into something unique that cannot possiblyconflict.


James
_______________________________________________
Python-3000 mailing list
Python-3000@python.org
http://mail.python.org/mailman/listinfo/python-3000
Unsubscribe: 
http://mail.python.org/mailman/options/python-3000/archive%40mail-archive.com

Re: [Python-3000] [Python-Dev] Filename as byte string in python 2.6 or 3.0?

Reply via email to