Hello all,

I noticed recently that our users are getting spam with the subject similar to the following:

SehxpyNaturalRedheaddFayeReaganHasHerFirstLesbianExperienceWithBrunet

SpamAssassin seems to be having a hard time determining whether it is spam or not because it appears as one long word.

In all cases, the subject contains no spaces (to prevent detection I would think) and is longer than 62 characters (not sure why they do this, but it is true in every sample I've seen so far).

I would like to create a rule to pick up on this, but having a bit of difficult with the regex for the rule. This is what I've come up with so far

header CR_SUBJECT_SPAMMY    Subject =~ /.{62}/
describe CR_SUBJECT_SPAMMY Subject looks spammy (contains a lot of characters, and no spaces)
score CR_SUBJECT_SPAMMY     2.5

I just need to modify the regex to check that the Subject contains no spaces.

I've done some research, and the longest non-coined word in a major dictionary is 30 characters long, meaning that if it was used twice in a subject, the total length would still only be 60 characters, There may be some FPs if the sender used formatting like commas and such, but the possibility of them using 2 of the word, then formatting without spacing, would probably be extremely remote.

Any assistance or advice would be greatly appreciated.

Regards,

Lawrence Williams
LCWSoft

Reply via email to