Hello,
very often I have the following problem: I write a program that processes many
files which it assumes to be encoded in utf-8. Then, some day, I there is a
non-utf-8 character in one of several hundred or thousand (new) files. The
program exits with an error message like this:
UnicodeDecodeError: 'utf8' codec can't decode byte 0xe4 in position 60: invalid
continuation byte
I usually solve the problem by moving files around and by recoding them.
What I really want to do is use something like
try:
# open file, read line, or do something else, I don't care
except UnicodeDecodeError:
sys.exit("Found a bad char in file " + file + " line " + str(line_number)
Yet, no matter where I put this try-except, it doesn't work.
How should I use try-except with UnicodeDecodeError?
Jaroslav
--
http://mail.python.org/mailman/listinfo/python-list