[issue12846] unicodedata.normalize turkish letter problem

2011-08-28 Thread Ezio Melotti
Changes by Ezio Melotti : -- nosy: +ezio.melotti stage: -> committed/rejected ___ Python tracker ___ ___ Python-bugs-list mailing lis

[issue12846] unicodedata.normalize turkish letter problem

2011-08-28 Thread Terry J. Reedy
Terry J. Reedy added the comment: You are doing two different things to the original string: normalizing and encoding to ascii with errors ignored. Each should be tested separately. On 3.2: import unicodedata s1 = "üfürükçü ağaç ve ıslıkçı çeşme" s2 = unicodedata.normalize('NFKD', s1) print(s2

[issue12846] unicodedata.normalize turkish letter problem

2011-08-26 Thread Cem YILDIZ
Cem YILDIZ added the comment: unicodedata.normalize cannot convert turkish letter "ı" into "i": import unicodedata s = u"üfürükçü ağaç ve ıslıkçı çeşme" print unicodedata.normalize('NFKD', s).encode('ascii','ignore') >> ufurukcu agac ve slkc cesme but the result should be >> ufurukcu agac v

[issue12846] unicodedata.normalize turkish letter problem

2011-08-26 Thread Cem YILDIZ
Changes by Cem YILDIZ : -- type: -> behavior ___ Python tracker ___ ___ Python-bugs-list mailing list Unsubscribe: http://mail.pytho

[issue12846] unicodedata.normalize turkish letter problem

2011-08-26 Thread Cem YILDIZ
New submission from Cem YILDIZ : unicodedata.normalize cannot convert turkish letter "ı" into "i": import unicodedata s = u"üfürükçü ağaç ve ıslıkçı çeşme" print(shoehorn_unicode_into_ascii(s)) print unicodedata.normalize('NFKD', s).encode('ascii','ignore') >> ufurukcu agac ve slkc cesme but