Re: Rule writing: new text obfuscation mechanism

Joe Quinn Fri, 06 Jun 2014 05:30:30 -0700

The way we handle it inhttp://www.pccc.com/downloads/SpamAssassin/contrib/KAM.cf is to use aregex like /this.advertisement/ unanchored by \b.

When matching against phrases like yours, we find the word boundary doesnot add any specificity to the rule because the odds of matching againsta different word or phrase is nil, and we catch almost every obfuscationof word boundaries.

Good catch though, we do have some rules in KAM.cf that can be avoidedby this, and off the top of my head I can think of several stock SArules that are vulnerable too.


On 6/5/2014 9:44 PM, John Hardin wrote:

All:
I've run across a new text obfuscation method in active use byspammers. It appears to be an attempt to bypass RE-based text matchingof words. Rules you write will need modification to not be spoofed bythis.
Unfortunately the RE engine considers the underscore as being a "word"character, so a rule like /\bthis advertisement\b/ can be defeated byreplacing the spaces in the sentence with underscores. This is stillreadable to a human but foils the word-boundary check.
Recommendation: instead of a bare \b, use (?:\b|_) and instead ofembedded spaces use [-_\s]
Examples:

Manage_advertising_preferences_here

To_remove_yourself_from_this_admail,_please_do_so_here

Be_removed_from_this_important_offer

Re: Rule writing: new text obfuscation mechanism

Reply via email to