Does anyone have a regular expression to match the URI in Googlepages and
livefilestore spams that have been coming through the last little while.
I've been trying to come up with a local rule to bump up their scores but
apparently my regular expression skills suck because I haven't been able
to
mash any of my existing rules to match this stuff :-)
http:++randomcrap.googlepages.com
http:++randomcrap.googlepages.com/index.html
http:++randomcrap.bay.livefilestore.com/randomcrap/randomcrap.html
Those seem to be the formats I've been seeing here lately. (I've replaced
the initial // with ++ above to hopefully avoid confusing anybody's
rules...
and "randomcrap" refers to the string of random alphanumeric crap in the
URI.
This is off the top of my head and is UNTESTED, so there may be errors. But
they should be close.
uri GOOGLEPAGES /http://[~/]*\.googlepages\.com/i
uri LIVEFILESTORE /http://[~/]*\.bay\.livefilestore\.com/i
Those should hit on the patterns you gave. Add score lines and they will
probably help. Of course, they may hit on valid stuff too, assuming
googlepages or livefilestore are every valid in the mail you get. So I
wouldn't be inclined to score them over 2 or 3, at least to start with.
Loren