On Saturday, September 18, 2004, 9:07:55 PM, Matt wrote:

M> John,

M> If you read this more carefully, I was not suggesting that
M> action betaken that would affect everyone's system in such a way
M> that it wouldrequire modifications.� The 60 result code was
M> recently changed fromGray rules to IP rules, and that change may or
M> may not suggest amodification to the standard way that Sniffer
M> operates (consideringthat the environment will only return one
M> result code).� Sniffer may ormay not follow the numerical ordering
M> of the result codes at present,but then again, it might.�
M> Regardless, it wouldn't be a bad idea toreview the precedence as a
M> part of ongoing due diligence.� I alsorecommended one potential

I agree it's not a bad idea to review these things from time to time,
and in fact we do quite frequently - though not publicly.

I also agree that making any sweeping changes would probably be a
mistake at this time.

Well guys, here is how it goes.

When more than one rule matches, the one with the lowest symbol #
wins. If there is more than one match within that symbol then the one
that is earliest in the message wins.

This is why we code white rules to symbol 0, or symbol 1 in some
cases; and also why we generally reserve the lower numbered symbols
for any specific user requests.

As much as possible we've ordered the rule groups so that the least
specific rules are found in the higher numbers and the more specific
rules are in the lower numbers.

We even have some rules (work in progress) that are "above band" in
the 65-255 range which have special meanings and functions. These will
become more important later as these features are further developed.

There are a lot of schemes out there that can be used, and in fact we
can use an entirely different scheme for each user if we wish - though
that might be a lot of work (so we might have to charge extra for the
consulting time to develop and maintain such a thing).

The scheme that we have is a little bit out of date*, but it still
seems to work for most folks, so we'll probably keep it around for a
while. We've had a number of alternate schemes suggested, some that
might even be practical to implement - but none that wouldn't cause
quite a bit of upheaval if we suddenly decided to rework everything
for our current users.

In fact, there are only a hand full of people who ever even mention it.

Since your list shows 60, 63, 62, and 61 all at the bottom of your
list I'm guessing that the current voting scheme is probably in line
with your priorities at this point. That is, more specific rules (by
symbol #) seem to line up roughly with your estimate of accuracy.

Hope this helps,
_M

* Little out of date: Spammers almost always reuse URI and numbered
links on multiple campaigns these days. This wasn't the case so much
when we began. One result of this shift is that it is now common to
find Snake-Oil spam matching a porn rule & vice versa. In fact, the
actual kind of spam probably matches the rule group less than 31.6% of
the time (and of course 94% of statistics are made up on the spot -
which means, 1/3 is a guess on my part from looking at spam all day).

We've kept the scheme, however, because there are many rules that we
create which are not based on URI and these tend to remain accurate to
the type of spam. Also, since we generate and review our rules largely
through a manual process - it helps to know what kind of spam we were
looking at when we created the rule. That is, we are less likely to
err while looking at a porn/adult spam than we are when looking at a
travel spam - so differences in our accuracy are likely to develop
along the groups we've selected - even if the type of spam captured by
the rule migrates over time.




This E-Mail came from the Message Sniffer mailing list. For information and 
(un)subscription instructions go to 
http://www.sortmonster.com/MessageSniffer/Help/Help.html

Reply via email to