https://issues.apache.org/SpamAssassin/show_bug.cgi?id=6788
Kevin A. McGrail <[email protected]> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|RESOLVED |REOPENED Resolution|WORKSFORME |--- --- Comment #6 from Kevin A. McGrail <[email protected]> --- (In reply to comment #5) > this is not a duplicate of bug 6751. The dot is always 2E hex. Sorry about that. I viewed that as an alternate character being used and lumped it together. Reopening though I tried your small fix in HTML.pm Index: lib/Mail/SpamAssassin/HTML.pm =================================================================== --- lib/Mail/SpamAssassin/HTML.pm (revision 1338322) +++ lib/Mail/SpamAssassin/HTML.pm (working copy) @@ -240,6 +240,10 @@ # the HTML::Parser API won't do it for us $text =~ s/<(\w+)\s*\/>/<$1>/gi; + # Bug 6788 https://issues.apache.org/SpamAssassin/show_bug.cgi?id=6788 + # we want a space after a closing tag so that URLs aren't lumped together + $text =~ s/>/> /g; + # Ignore stupid warning that can't be suppressed: 'Parsing of # undecoded UTF-8 will give garbage when decoding entities at ..' (bug 4046) { This breaks html_obfu.t t/html_obfu.t 9 5 55.56% 1-5 Thoughts? -- You are receiving this mail because: You are the assignee for the bug.
