Re: Is BAYES filtering working? Having doubts.

Bill Cole Wed, 30 Dec 2015 16:13:19 -0800

On 30 Dec 2015, at 8:37, RW wrote:

On Tue, 29 Dec 2015 20:41:31 -0500
Bill Cole wrote:

On 29 Dec 2015, at 20:02, Ian Zimmerman wrote:

esired result.


Clearly you can do the su magic if needed.


Um, no.

Neither su nor sudo magically changes the permissions or ownership of
files.



No, but sudo allows sa-learn to run as the user that owns the database
and a group that can read the mailstore.

Yes, and that can be useful if you use virtual users instead of realones for the mailstore OR use inherited ACLs to give a special groupread access to it. If you use real users, you likely cannot and almostcertainly should not make the whole mailstore owned by one group withread access.

And since it seems I muddled my point(s) again in the referenced messageI'll try restating one last time:

1. The sa-learn process for a system-wide BayesDB should not run as root(because root should only handle suspect message data in the simplest ofways, not attempting to parse it in any way.)

2. If you pass filenames as arguments to sa-learn, they must be filesthat can be read by the user sa-learn runs as (in this case: "amavis".)

3. Maildir message files are normally only readable by either a realuser they are delivered to (if the mail system uses real OS users) or asingle low-privilege user whose only role is mailstore handling (if ituses "virtual users" known only to mail and SASL facilities.) In eithercase, it is often unsafe and/or inconvenient to use group ownership andpermissions to enable the system-wide BayesDB owner to read Maildirmessage files directly. Inherited ACLs are a safer and more robustapproach, but not all filesystems support them.


Combining those:

A site with a system-wide BayesDB managed by the 'amavis' user and amailstore using Maildirs in which files are NOT generally readable bythat user can do one of the following to orchestrate re-training ofmis-classified messages or other post-delivery training withhuman-judged messages:

A. Copy files from the Maildirs to a secured temporary directory, makethose copies readable by the amavis user, run sa-learn as amavis withthe directory as an argument, and clean up afterwards.B. Pipe each message independently into a sa-learn process running asamavis

C. Run a spamd as amavis and feed messages to it via spamc

Since these all involve running a learning process as amavis and passingit data from files amavis can't read, most people will choose to drivethe overall process as root, being other users as needed. Arguably (C)is potentially the safest because you can have the spamd started by theamavis user listening only on a socket and have the spamc processes runby whatever users own the various message files. It's also the mostcomplex (i.e. error-prone) and resource-heavy (process launch permessage +1) and many people won't understand that root shouldn't run thespamc processes so maybe not the best thing to recommend, especially onsystems using real users for mail and/or with high launch costs. Theother two aren't concretely unsafe to do as root (as root wouldn't beparsing message data) but would violate a dogmatic law against roothandling message data, if you have that in your catechism.

The more I think about it, the more I like the ACL approach more thanany of those 3: just give the BayesDB owner inherited read permissionsfor all Maildirs and have it run sa-learn with the target files (or -ffilelist) as args. If the Maildirs aren't on an ACL-capable filesystem,maybe that was a bad choice which merits correcting.

Re: Is BAYES filtering working? Having doubts.

Reply via email to