http://bugzilla.spamassassin.org/show_bug.cgi?id=3163
Summary: Obfuscation FP when obfuscating tag starts line or
punctuation follows tag.
Product: Spamassassin
Version: SVN Trunk (Latest Devel Version)
Platform: Other
OS/Version: other
Status: NEW
Severity: normal
Priority: P5
Component: Rules (Eval Tests)
AssignedTo: [EMAIL PROTECTED]
ReportedBy: [EMAIL PROTECTED]
The code in SpamAssassin::HTML::html_text falsely (IMO) suspects
obfuscation when the text before the obfuscating tag ends in a new
line (but the text after the tag starts with a non-whitespace
character, which is fine). (The problem is fixed by using a different
regexp assertion, \z instead of $.)
For example, the code considers the anchor obfuscating because
the new line after "see" is not counted as whitespace:
Please see
<a href="http://www.icseng.info/">http://www.icseng.info/</a> for more
The code also currently suspects obfuscation if the text after
the tag consists of any non-whitespace character, even if it's
just punctuation. For example,
<a href="mailto:[EMAIL PROTECTED]"><u>[EMAIL PROTECTED]</a></u>;
the </a> and </u> are suspected of obfuscating "[EMAIL PROTECTED]" and ";".
This second case is not exactly a bug but something that aught to
be addressed.
The patch below fixes both problems. To be considered obfuscating the
text after the tag must start with a run of non-whitespace characters
that consists of at least one alphabetic character (in any position).
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.