Le lundi 29 septembre 2008 à 17:50 -0600, Adam Olsen a écrit :
> It's correct in the sense that it can roundtrip all filenames.  UTF-8b
> is lossy, so certain filenames are not roundtripped properly.

Why do you say UTF-8b is lossy? From what I've read it claims to be
lossless (i.e. the range of characters used for escaping of invalid
bytes are themselves escaped if they are encountered in the source
sequence).

> As a user, I expect all file names to be printable.  That requires
> unicode, and any program that creates filenames with arbitrary
> bytestrings is just broken.

But if you use iso-8859-1 for decoding, all non-ASCII filenames will be
printed wrongly, not only those with invalid bytestrings. I fail to see
what it brings.



_______________________________________________
Python-3000 mailing list
Python-3000@python.org
http://mail.python.org/mailman/listinfo/python-3000
Unsubscribe: 
http://mail.python.org/mailman/options/python-3000/archive%40mail-archive.com

Reply via email to