Re: Hidden text.

Alan via users Sat, 16 Aug 2025 07:41:50 -0700


On 2025-08-16 05:12, giova...@paclan.it wrote:

On 8/14/25 12:06 PM, Mark London wrote:
Hi - These days, how often do spammers at additional random hiddentext? For example, I found a ton of extra text at the bottom of anemail, that was hidden by the following HTML:
<div style="font-family:Cambria, 'Hoefler Text', 'Liberation Serif',Times, 'Times New Roman', serif; width:1px; line-height:0;overflow:hidden">
Is this a common enough spammer technique, for me to write a rule totest for it? Thanks! - Mark
I tried to write something similar in the past(https://github.com/apache/spamassassin/blob/trunk/rulesrc/sandbox/gbechis/20_html.cf)but I disabled them
since they hit lot of commercial newsletters as well.
We can write a rule and I can add it to my sandbox for testing, couldyou share (privately if you prefer) a spam email using this technique ?
 Thanks
  Giovanni


I have a rule that looks for much text with no height:

full        __AB_LINEZERO /line-height?:\s*[0][^0-9.])[^<]{1000,}</i

And another that looks for a lot of small font text:

full        __AB_SMALLFONT /font(-size)?:\s*[0-8](\.[0-9]*)?px[^<]{1000,}</i

Usually this is an attempt to get around Bayesian classification, so Ialso have these:


full        __AB_BAYES_BUSTING_DIV /<div.{10000,}?<\/div>/
full        __AB_BAYES_BUSTING_P /<p.{10000,}?<\/p>/

I use these as meta rules, so for example if the message also comes froma domain commonly used for spam (one of the KAM rules), I'll bump thescore by a bit (maybe 0.2 to 0.5) for each.

In my experience no one indicator is reliable (except in the mostinsidious cases which get scores like 20.0). Spam usually triggersseveral other rules, which pushes the overall score over 5, which iswhere I have the flag threshold set, and sometimes over 10 where theyget sent to write-only memory.


--
For SpamAssassin Users List

Re: Hidden text.

Reply via email to