the unicode saga continues...

Ethan Furman Fri, 13 Nov 2009 21:43:10 -0800

So I've added unicode support to my dbf package, but I also have somerather large programs that aren't ready to make the switch over yet. Soas a workaround I added a (rather lame) option to convert theunicode-ified data that was decoded from the dbf table back into anencoded format.

Here's the fun part: in figuring out what the option should be for usewith my system, I tried some tests...

Python 2.5.4 (r254:67916, Dec 23 2008, 15:10:54) [MSC v.1310 32 bit(Intel)] on win32

Type "help", "copyright", "credits" or "license" for more information.
>>> print u'\xed'
í
>>> print u'\xed'.encode('cp437')
í
>>> print u'\xed'.encode('cp850')
í
>>> print u'\xed'.encode('cp1252')
φ
>>> import locale
>>> locale.getdefaultlocale()
('en_US', 'cp1252')

My confusion lies in my apparant codepage (cp1252), and the discrepancywith character u'\xed' which is absolutely an i with an accent; yet whenI encode with cp1252 and print it, I get an o with a line.


Can anybody clue me in to what's going on here?

~Ethan~
--
http://mail.python.org/mailman/listinfo/python-list

the unicode saga continues...

Reply via email to