https://bz.apache.org/SpamAssassin/show_bug.cgi?id=8218
Sidney Markowitz <sid...@sidney.com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |sid...@sidney.com --- Comment #1 from Sidney Markowitz <sid...@sidney.com> --- Created attachment 5942 --> https://bz.apache.org/SpamAssassin/attachment.cgi?id=5942&action=edit Test email Can you attach a full test email that demonstrates the problem? I made your snippet into an email with quoted-printable and ran ./spamassassin -t -D uri,message < uriwraptest.eml Did I miss something? I am testing using current trunk. Here is the debug output showing the correct URL being parsed: Mar 15 20:23:08.229 [99779] dbg: message: _decode_header date: Thu, 2 May 2002 00:02:49 +1200 Mar 15 20:23:08.229 [99779] dbg: message: _decode_header subject: foo Mar 15 20:23:08.229 [99779] dbg: message: _decode_header to: <b...@example.org> Mar 15 20:23:08.229 [99779] dbg: message: _decode_header from: <b...@example.com> Mar 15 20:23:08.229 [99779] dbg: message: _decode_header message-id: <intm-6516584-3669405-2002.08.01-16.21.51...@example.com> Mar 15 20:23:08.229 [99779] dbg: message: _decode_header mime-version: 1.0 Mar 15 20:23:08.229 [99779] dbg: message: _decode_header content-type: text/html; charset=US-ASCII Mar 15 20:23:08.229 [99779] dbg: message: _decode_header content-transfer-encoding: quoted-printable Mar 15 20:23:08.230 [99779] dbg: message: main message type: text/html Mar 15 20:23:08.235 [99779] dbg: message: ---- MIME PARSER START ---- Mar 15 20:23:08.235 [99779] dbg: message: parsing normal part Mar 15 20:23:08.235 [99779] dbg: message: storing a body to memory Mar 15 20:23:08.235 [99779] dbg: message: ---- MIME PARSER END ---- Mar 15 20:23:08.235 [99779] dbg: message: decoding quoted-printable Mar 15 20:23:08.235 [99779] dbg: message: contains only US-ASCII characters, declared US-ASCII, not decoding Mar 15 20:23:08.235 [99779] dbg: message: HTML::Parser utf8_mode off (default, assumed Unicode characters) Mar 15 20:23:08.236 [99779] dbg: message: spaces (octets) in HTML: 3 out of 21, chars!? Mar 15 20:23:08.242 [99779] dbg: uri: canonicalizing html uri: http://hashbltest.surbl.org/example_uri Mar 15 20:23:08.242 [99779] dbg: uri: cleaned uri: http://hashbltest.surbl.org/example_uri Mar 15 20:23:08.242 [99779] dbg: uri: added host: hashbltest.surbl.org domain: surbl.org -- You are receiving this mail because: You are the assignee for the bug.