http://bugzilla.spamassassin.org/show_bug.cgi?id=3163

           Summary: Obfuscation FP when obfuscating tag starts line or
                    punctuation follows tag.
           Product: Spamassassin
           Version: SVN Trunk (Latest Devel Version)
          Platform: Other
        OS/Version: other
            Status: NEW
          Severity: normal
          Priority: P5
         Component: Rules (Eval Tests)
        AssignedTo: [EMAIL PROTECTED]
        ReportedBy: [EMAIL PROTECTED]


The code in SpamAssassin::HTML::html_text falsely (IMO) suspects
obfuscation when the text before the obfuscating tag ends in a new
line (but the text after the tag starts with a non-whitespace
character, which is fine).  (The problem is fixed by using a different
regexp assertion, \z instead of $.)

For example, the code considers the anchor obfuscating because
the new line after "see" is not counted as whitespace:

Please see
<a href="http://www.icseng.info/";>http://www.icseng.info/</a> for more

The code also currently suspects obfuscation if the text after
the tag consists of any non-whitespace character, even if it's
just punctuation.  For example,

<a href="mailto:[EMAIL PROTECTED]"><u>[EMAIL PROTECTED]</a></u>;

the </a> and </u> are suspected of obfuscating "[EMAIL PROTECTED]" and ";".

This second case is not exactly a bug but something that aught to
be addressed.

The patch below fixes both problems.  To be considered obfuscating the
text after the tag must start with a run of non-whitespace characters
that consists of at least one alphabetic character (in any position).



------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

Reply via email to