On 6/14/07, Stephen J. Turnbull <[EMAIL PROTECTED]> wrote:
>  > There are also plenty of things that a native speaker may view as a
>  > single character, but which unicode treats as (at most) a Named
>  > Sequence.

> Eg, the New Line Function (Unicode's name for "universal newline"),
> which can be any of the usual suspects (CR, LF, CRLF) depending on
> context.

I hadn't even thought of such abstract chracters; I was thinking of
(Normative Appendix) UAX 34: Unicode Named Character Sequences at
http://unicode.org/reports/tr34/

These are more like õ, or the NJ digraph, except that a
single-character equivalent has not been coded (and probably never
will be coded -- see
http://www.unicode.org/faq/ligature_digraph.html#3).

The current list of named sequences is available at
http://www.unicode.org/Public/UNIDATA/NamedSequences.txt

-jJ
_______________________________________________
Python-3000 mailing list
[email protected]
http://mail.python.org/mailman/listinfo/python-3000
Unsubscribe: 
http://mail.python.org/mailman/options/python-3000/archive%40mail-archive.com

Reply via email to