Cybertime Hostmaster wrote:

>>Is anyone blocking on the occurance of one or more ^ in a Subject:
> 
> header?
> 
>>If so, what are your experiences?  Most of the spam that is getting
> 
> through
> 
>>here contains at least one and I don't typically see valid mail with
>>one--although I'm sure they are common on the Exponential Math mailing
> 
> list ;-)
> 
> Lately I have seen a lot of "junk characters" in spam subject lines.  The
> use of the [EMAIL PROTECTED]&*()`'<>,./?;:'"~ is what I mean by that.
> 
> It seems to be an attempt by spammers to bypass filters based off of pure
> subject matching.
> 
> My question, in reply to yours, is do you think it would be good to aim
> for ^, or do you think it would be better to aim for the whole lot?

Of the one's I've seen, the ^ stands out as being the one most likely to be 
used in a valid email.  Of the list you note above, probably the backtick is 
another likely candidate--the others are too common in valid email.

> 
> Obviously there needs to be a limit where 4 or 5 of these characters would
> not trigger the REJECT.  Then, the mail that goes over the limit would be
> slammed.
> 
> This would, unfortunately, require the use of match all between the blocks
> tested.
> 
> So I think the regexp involved would be something along the lines of this:
> 
> /^Subject:.*[:^alnum:].*[:^alnum:].*[:^alnum:].*[:^alnum:].*[:^alnum:].*[:
> ^alnum:]/
> 
> Anyone see anything majorly wrong with that?

No, but only because I don't understand it ;-)

 > If not, it would be worth it
> to test with a "WARN subject_junk ${1}" to see what it hits, and adjust as
> needed.

Good idea.  I think I'll start w/something simple to test:
/^Subject:(.*(\^|~))/ WARN subject_junk ${1}

-- 
Chris Scott
Host Orlando, Inc
http://www.hostorlando.com/


Reply via email to