https://bugzilla.wikimedia.org/show_bug.cgi?id=33507

--- Comment #15 from [email protected] 2012-01-13 17:45:25 UTC ---
(In reply to comment #12)
> original first, then current live value:
> 
> >>> print "\n".join(repr(s.decode('utf-8')) for s in 
> >>> ("\xe0\xa7\xb1\xe0\xa6\xbf\xe0\xa6\x95\xe0\xa6\xbf\xe0\xa6\xaa\xe0\xa6\xbf\xe0\xa6\xa1\xe0\xa6\xbf\xe0\xa7\x9f\xe0\xa6\xbe\x20","\xe0\xa7\xb1\xe0\xa6\xbf\xe0\xa6\x95\xe0\xa6\xbf\xe0\xa6\xaa\xe0\xa6\xbf\xe0\xa6\xa1\xe0\xa6\xbf\xe0\xa6\xaf\xe0\xa6\xbc\xe0\xa6\xbe"))
> u'\u09f1\u09bf\u0995\u09bf\u09aa\u09bf\u09a1\u09bf\u09df\u09be '
> u'\u09f1\u09bf\u0995\u09bf\u09aa\u09bf\u09a1\u09bf\u09af\u09bc\u09be'
> 
> ৱিকিপিডিয়া 
> ৱিকিপিডিয়া
> 
> Looks like it's 1 char longer? 1 was replaced with 2 new ones. I'm just seeing
> boxes (not letters, must need a font) so I definitely could use some help from
> a native.
> 

from unicode \u09df is canonically equivalent to \u09af\u09bc. they are exactly
same, except for normalization.

http://www.fileformat.info/info/unicode/char/9DF/index.htm

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to