On 5/1/08, "Martin v. Löwis" <[EMAIL PROTECTED]> wrote: > - escaping looks like this: > * \r, \n, \t, \\ > * \xXX for characters from Latin-1 > * \uXXXX for characters from the BMP > * \U00XXXXXX for anything else
> What I didn't have in my original proposal was escaping of Zs > except for space, which then would also escape NBSP, EN QUAD, > EM QUAD, THIN SPACE, HAIR SPACE, OGHAM SPACE MARK, etc. Escaping > them is fine also. Also, I didn't consider surrogate pairs in > UCS-2 builds originally; they should (of course) get represented > as-is. I realize that this is the traditional escape form, but I wonder if it might be better to just use the character names instead of the hex character codes. The names can be written in ASCII, they are unambiguous, and they are easier to understand than a random hex value. -jJ _______________________________________________ Python-3000 mailing list Python-3000@python.org http://mail.python.org/mailman/listinfo/python-3000 Unsubscribe: http://mail.python.org/mailman/options/python-3000/archive%40mail-archive.com