Glenn Linderman wrote:

My understanding of the Posix file names is that any byte values are valid except "/" and null. Is this a correct understanding?

The UTF-8b proposal seems to translate from a non-UTF-8 byte stream to a Unicode character stream. Call the original byte stream FOO. The transformation then produces FOOTR, a set of Unicode code points. Now FOOTR has a representation in UTF-8, which is a byte stream, call that byte stream FOOTRUTF8. How, by looking at FOOTR, do you know whether it represents the file name FOO or FOOTRUTF8 ? And remember that the user might provide a Unicode character stream identical to FOOTR: should it be translated to FOO or FOOTRUTF8 when creating a new file according to the user-supplied name?

If FOOTR is using PUA chars, then I believe that users should not be providing such a stream as it would have no defined meaning coming from them.

_______________________________________________
Python-3000 mailing list
Python-3000@python.org
http://mail.python.org/mailman/listinfo/python-3000
Unsubscribe: 
http://mail.python.org/mailman/options/python-3000/archive%40mail-archive.com

Reply via email to