OK, based on what little discussion there's been so far, here's a
draft proposal for people to think about.

Summary: A group of volunteers will maintain a collected/distributed
whitelist, using SpamAssassin's whitelist_from_rcvd capabilities,
similar to (but in the opposite direction as) William Stearns'
collected/distributed blacklist at
http://www.stearns.org/sa-blacklist/sa-blacklist.current.cf

Goal: There are public newsletters, services, etc., which a) do not
spam, and b) can easily be mistaken as spam by SpamAssassin for a
variety of reasons (overly aggressive custom rules, wrongly taught
Bayes system, paid advertising listing SURBL URIs, etc). Though
anti-spam devotees see these as opportunities for cleaning up our
system, for the purposes of email reliability we want these emails to
go through unhindered. 

Assumption: This activity will focus only on public newsletters,
services, etc., which normally do not contain any private information.
Therefore there will not be any privacy or confidentiality concerns
for the great majority of emails from these sources.

Distribution:  The rules file which results from this activity will be
maintained within the SARE system, as file 70_sare_whitelist.cf -- it
can be downloaded manually or via RDJ.

Rules: Since these rules are gathered by the community at large,
rather than use the whitelist_from_rcvd rule which normally scores
-100, we will use the def_whitelist_from_rcvd rule, which scores only
-15. Any site that wishes can copy any of these rules to a
def_whitelist_from rule to gain the full -100 points.

[Question to the devs: would you agree this is a valid use for the
def_whitelist_from_rcvd rule?]

Adoption to SA Core: We expect that some/most of these rules will be
adopted into the SpamAssassin core distribution when new releases are
issued. At such times, rules adopted into SA will be migrated to a
stable file named 70_sare_whitelist_pre_vvvvvv.cf (eg:
70_sare_whitelist_pre_030100 for SA 3.1.0). This file will be
announced on the SA-Users list, and can be retrieved once into any/all
systems. That file will not change. These rules will be deleted from
the primary 70_sare_whitelist.cf file concurrent with the official
release of the new SA release. Systems that migrate to a new SA
release would then be expected to delete all "pre" files, though there
wouldn't be any harm other than a minimal performance hit if they do
not.

Contributions: Anyone within the SA community can submit a single
whitelist rule for inclusion into the active rules file, and/or submit
a qualifying email (with full headers), from which the whitelist team
will create a rule. Submissions with a full qualifying email are
preferred. Submissions should be sent to [EMAIL PROTECTED]

[Submissions can be sent to that address immediately -- something will
be done with them, whether following this proposal or not.]

Process: At least at first, we expect many/most of these submissions
to be obvious ham sources (such as NYTimes.com would be). Obvious ham
sources will be agreed upon unanimously by the whitelist team, and
added to the whitelist. 

Any non-obvious ham source, any that has a majority agreement within
the whitelist team but not unanimous agreement, will be examined in
detail by one of the team. The results of that examination will be
reviewed by the whitelist team, and any that now receives unanimous
agreement will be added to the rules file.

Any submission which does not obtain unanimous agreement will not be
added to the rules file.

Any submission which already matches a def_whitelist_from_rcvd rule
within the SA distribution will be identified and ignored (after
response back to the submitter). (We are not going to try to develop
pre-3.0.0 files.)

Team members: Anyone within the SpamAssassin community who has
submitted a CLA to Apache, and who is accepted unanimously by the
whitelist team, can join the team.

[The CLA requirement is to ensure that rules can be adopted into the
SA without any licensing concerns.]

- - - - - - -

Can anyone think of any guidelines to be added or changed?

Any volunteers willing to help out?

Bob Menschel



Reply via email to