Hello ddv,

Thursday, July 8, 2004, 11:36:25 AM, you wrote:

dhcz> Thanks for the reponse Loren.  Glad to know I wasn't just
dhcz> missing something obvious :)

You weren't, and the quick work-around, until SARE gets our act
together, is to rescore those rules in your local.cf -- either set one
of the rules to zero, or halve both of them. I'd suggest the former.

dhcz> Can I ask what a further 2 things mean:
dhcz> - What is a 4-part ruleset?

I'm responding, since I'm the one who created the first 4-part
rulesets.  I assume you're talking about 70_sare_genlsubj0.cf through
...3.cf (also header and html).

File 0 (70_sare_genlsubj0.cf, 70_sare_header0.cf, and
70_sare_html0.cf) contains those rules which hit spam and only spam
(no ham), and which hit significant spam (well, more than 10 among
several corpora). Those are rules which we feel confident every system
will benefit from (well, except that you may want to turn off some
rules in genlsubj0.cf if your business is specific enough that
everyone else's spam might be your ham).

File 1 contains rules which either hit fewer than 10 spam, or which
also hit ham. Primary qualification is that our S/O (total spam count
/ total email count) must be 0.900 or higher. Probably also good for
most sites, but not as conservative as file 0.

File 2 contains rules which should hit spam and only spam, but recent
mass-checks had no matches. Many of these are obfuscation rules.
Aggressive systems with more than sufficient resources may benefit
from these rules eventually (some will hit spam some day), but those
systems with tight resources should definitely stay away from these.

File 3 contains rules which hit lots of ham. They are scored low, and
might help push otherwise questionable spam over the SA threshold.
Very conservative systems, and those that are tight on resources,
should probably avoid these files.

dhcz> - What is the official SARE ruleset (ie. how do I tell if
dhcz> something is official or not)?

All actively maintained SARE rule set files are named 70_sare_xxxxx.cf
or something very similar. Some older files haven't yet been renamed
to this convention, but the more active files are.

You can also go to http://www.rulesemporium.com/rules.htm -- rules
listed on that page are official SARE rules files. They are named in
the box that has "redir" on top, and "highrisk" at bottom.

Those rules which are on other pages, such as the "incomplete rules",
"other rules", and "archived rules" are not official SARE rules.

Hope this helps.

Bob Menschel


Reply via email to