Alex wrote:
Hi,

I'm having a problem with emails that are from a freemail domain with
simply a shorturl in them, like this:

<br><a 
href=http://bit.ly/aqI4o1>http://bit.ly/aqI4o1/Benjamin</a><br><br><br>lovee
ya<br>

rawbody    LOC_BITLY
/href\=http:\/\/bit\.ly\/.+\w{1,8}>http:\/\/bit\.ly\/.+\w{1,15}\/.+\w{1,15}<\/a><br><br>/

Is this the most effective and best way to accomplish this? I believe
it works (reliably?) but am concerned about what seemed to be
excessive memory usage and false positives, obviously. Do you have any
suggestions to improve this?

It also seems that no matter how many times I train these they don't
score higher than BAYES_50, at least the FNs.

Thanks,
Alex


I'm seeing these mostly from hotmail accounts so I use a URI rule (rather than your rawbody example) and meta it with FROM_HOTMAIL. For example,

uri             LOCAL_URI_BITLY         m{https?://bit\.ly/\w{6}}
describe        LOCAL_URI_BITLY         contains bit.ly link

meta            LOCAL_HOTMAIL_SPAM_URI  (__FROM_HOTMAIL_COM && LOCAL_URI_BITLY)
describe        LOCAL_HOTMAIL_SPAM_URI  From hotmail.com and bit.ly

I've been training these hotmail with links spam for months, and they all score BAYES_99 for me.

Reply via email to