Does anyone have a regular expression to match the URI in Googlepages and
livefilestore spams that have been coming through the last little while.
I've been trying to come up with a local rule to bump up their scores but
apparently my regular expression skills suck because I haven't been able to
mash any of my existing rules to match this stuff :-)

http:++randomcrap.googlepages.com
http:++randomcrap.googlepages.com/index.html
http:++randomcrap.bay.livefilestore.com/randomcrap/randomcrap.html

Those seem to be the formats I've been seeing here lately.  (I've replaced
the initial // with ++ above to hopefully avoid confusing anybody's rules...
and "randomcrap" refers to the string of random alphanumeric crap in the
URI.

This is off the top of my head and is UNTESTED, so there may be errors. But they should be close.

uri GOOGLEPAGES /http://[~/]*\.googlepages\.com/i
uri LIVEFILESTORE /http://[~/]*\.bay\.livefilestore\.com/i

Those should hit on the patterns you gave. Add score lines and they will probably help. Of course, they may hit on valid stuff too, assuming googlepages or livefilestore are every valid in the mail you get. So I wouldn't be inclined to score them over 2 or 3, at least to start with.

       Loren


Reply via email to