On 6/6/07, Greg Ewing <[EMAIL PROTECTED]> wrote: > Are you suggesting that this should be done on the fly > when comparing strings? Or that all strings should be > stored in canonicalised form?
Preferably the second; store them canonicalized. > I can see some big cans of worms being opened up by > either approach. Surprising results could include > things like s1 == s2 but len(s1) <> len(s2), or > len(s1 + s2) <> len(s1) + len(s2). Yes, these are surprising, but that is the nature of unicode. People will get used to it, with the same pains they face now over "1" + "1" = "11", or output that doesn't line up because one row had a single-digit number. -jJ _______________________________________________ Python-3000 mailing list [email protected] http://mail.python.org/mailman/listinfo/python-3000 Unsubscribe: http://mail.python.org/mailman/options/python-3000/archive%40mail-archive.com
