tags 348377 + pending
thanks

On Mon, 2006-01-16 at 18:27 +0100, Francesco Potortì wrote:
> The following command line:
>  webcheck --wait=1 --force --output=/home/www/reports/potorti
>          -i potorti.it -y width= -y /.cache/ http://www.potorti.it/
> 
> fails with this error:
[...]
> UnicodeDecodeError: 'ascii' codec can't decode byte 0xec in position 23: 
> ordinal not in range(128)

This has been tracked down to a link that has non-ascii characters in
it. I have fixed webcheck to firstly not fail on non-ascii characters
any more when generating a report and secondly to escape non-ascii
characters when parsing html.

In any case it is better to avoid urls with non-ascii characters in it
because a link to potortì.html in an utf-8 encoded document results in
another url than the same link in an iso-8859-1 encoded document.

This fix will be included in the next release of webcheck. Thanks for
your bug report and thanks for testing webcheck.

-- 
-- arthur - [EMAIL PROTECTED] - http://people.debian.org/~adejong --

Attachment: signature.asc
Description: This is a digitally signed message part

Reply via email to