On 5/1/08, "Martin v. Löwis" <[EMAIL PROTECTED]> wrote:

>  - escaping looks like this:
>   * \r, \n, \t, \\
>   * \xXX for characters from Latin-1
>   * \uXXXX for characters from the BMP
>   * \U00XXXXXX for anything else

>  What I didn't have in my original proposal was escaping of Zs
>  except for space, which then would also escape NBSP, EN QUAD,
>  EM QUAD, THIN SPACE, HAIR SPACE, OGHAM SPACE MARK, etc. Escaping
>  them is fine also. Also, I didn't consider surrogate pairs in
>  UCS-2 builds originally; they should (of course) get represented
>  as-is.

I realize that this is the traditional escape form, but I wonder if it
might be better to just use the character names instead of the hex
character codes.  The names can be written in ASCII, they are
unambiguous, and they are easier to understand than a random hex
value.

-jJ
_______________________________________________
Python-3000 mailing list
Python-3000@python.org
http://mail.python.org/mailman/listinfo/python-3000
Unsubscribe: 
http://mail.python.org/mailman/options/python-3000/archive%40mail-archive.com

Reply via email to