Re: Python Unicode handling wins again -- mostly

Dave Angel Fri, 29 Nov 2013 19:07:35 -0800

On Fri, 29 Nov 2013 21:28:47 -0500, Roy Smith <r...@panix.com> wrote:

In article <mailman.3417.1385777557.18130.python-l...@python.org>,
 Chris Angelico <ros...@gmail.com> wrote:
> On Sat, Nov 30, 2013 at 1:08 PM, Roy Smith <r...@panix.com> wrote:

> > I would certainly expect, x.lower() == x.upper().lower(), to be

True for

> > all values of x over the set of valid unicode codepoints.

Having

> > u"\uFB04".upper() ==> "FFL" breaks that. I would also expect

len(x) ==

> > len(x.upper()) to be True.

> That's a nice theory, but the Unicode consortium disagrees with

you on

> both points.

And they were already false long before Unicode. I don’t knowspecifics but there are many cases where there are no uppercaseequivalents for a particular lowercase character. And others wherethe uppercase equivalent takes multiple characters.


--
DaveA

--
https://mail.python.org/mailman/listinfo/python-list

Re: Python Unicode handling wins again -- mostly

Reply via email to