On Fri, May 26, 2006 at 09:22:41PM +0100, Bryan Lawrence wrote: > > Does elementtree and/or expat need to know the encoding to get this right? > (which may be a problem coz this could be from anyone's document in any > encoding ...) >
I think you will have to tell elementtree what encoding your XML is in. Otherwise how would it know? I am sure there is a better way, but I have seen people try to guess encodings like: # untested and from my bad memory :-) encodings = ['utf-8', 'utf-16',i 'iso-8859-1',] for encoding in encodings: try: unicode(s, encoding) except UnicodeError: pass else: break The encodings list would be a list of common encodings that you may expect. Again there must be a better way to do this... I would suggest that you try to set a standard for encodings. David Stanek -- http://www.traceback.org GPG keyID #6272EDAF on http://pgp.mit.edu Key fingerprint = 8BAA 7E11 8856 E148 6833 655A 92E2 3E00 6272 EDAF
pgptoixWcDL18.pgp
Description: PGP signature
_______________________________________________ XML-SIG maillist - XML-SIG@python.org http://mail.python.org/mailman/listinfo/xml-sig