I forgot to add that I'm using elementtree to process the xml files and don't (usually) have any problems with that. Plus, the workaround that works is to encode each elementtree output ie.:
thisxmlline = thisxmlline.encode('utf8') But, this seems odd to me as isn't it already being processed as utf-8? Dinesh From: Dinesh B Vadhia Sent: Thursday, June 04, 2009 6:47 AM To: tutor@python.org Subject: unicode, utf-8 problem again Hi! I'm processing a large number of xml files that are all declared as utf-8 encoded in the header ie. <?xml version="1.0" encoding="UTF-8"?> My Python environment has been set for 'utf-8' through site.py. Additionally, the top of each program/module has the declaration: # -*- coding: utf-8 -*- But, I still get this error: Traceback (most recent call last): ... UnicodeEncodeError: 'ascii' codec can't encode character u'\u201c' in position 76: ordinal not in range(128) What am I missing? Dinesh
_______________________________________________ Tutor maillist - Tutor@python.org http://mail.python.org/mailman/listinfo/tutor