On 6/14/07, Stephen J. Turnbull <[EMAIL PROTECTED]> wrote: > > There are also plenty of things that a native speaker may view as a > > single character, but which unicode treats as (at most) a Named > > Sequence.
> Eg, the New Line Function (Unicode's name for "universal newline"), > which can be any of the usual suspects (CR, LF, CRLF) depending on > context. I hadn't even thought of such abstract chracters; I was thinking of (Normative Appendix) UAX 34: Unicode Named Character Sequences at http://unicode.org/reports/tr34/ These are more like õ, or the NJ digraph, except that a single-character equivalent has not been coded (and probably never will be coded -- see http://www.unicode.org/faq/ligature_digraph.html#3). The current list of named sequences is available at http://www.unicode.org/Public/UNIDATA/NamedSequences.txt -jJ _______________________________________________ Python-3000 mailing list [email protected] http://mail.python.org/mailman/listinfo/python-3000 Unsubscribe: http://mail.python.org/mailman/options/python-3000/archive%40mail-archive.com
