https://bz.apache.org/SpamAssassin/show_bug.cgi?id=7520

--- Comment #22 from Henrik Krohns <[email protected]> ---

Captured HTML::Parser warnings to info() messages.

Added better logging, now you can actually see decoding errors

dbg: message: attempt to decode as UTF-8 failed, declared utf-8 (UTF-8 "\x92"
does not map to Unicode)
dbg: message: decoded as last-resort charset Windows-1252, declared utf-8
dbg: message: HTML::Parser utf8_mode off (default, assumed Unicode characters)
info: message: HTML::Parser warning: Parsing of undecoded UTF-8 will give
garbage when decoding entities

Sending        spamassassin-3.4/lib/Mail/SpamAssassin/HTML.pm
Sending        spamassassin-3.4/lib/Mail/SpamAssassin/Message/Node.pm
Sending        trunk/lib/Mail/SpamAssassin/HTML.pm
Sending        trunk/lib/Mail/SpamAssassin/Message/Node.pm
Transmitting file data ....done
Committing transaction...
Committed revision 1864377.

So if it's illegal UTF-8, is there anything else we can do?

-- 
You are receiving this mail because:
You are the assignee for the bug.

Reply via email to