Re: Is BAYES filtering working? Having doubts.

Bill Cole Tue, 29 Dec 2015 17:41:56 -0800

On 29 Dec 2015, at 20:02, Ian Zimmerman wrote:

On 2015-12-29 19:44 -0500, Bill Cole wrote:
On 29 Dec 2015, at 18:54, Ian Zimmerman wrote:
In fact sa-learn accepts multiple named arguments on the commandline,so the alternative I use is to go through the spambox N files at atime
in a shell loop.  (I have N=100 but obviously this depends.)
Which successfully ignores the original issue of this threadcompletely: that theuser sa-learn must run as cannot read the files being learnt. If youpass unreadablefilenames as arguments, sa-learn just whines and fails. Shockingly,that is not the
desired result.
Clearly you can do the su magic if needed.


Um, no.

Neither su nor sudo magically changes the permissions or ownership offiles. If you pass filenames as arguments they must be readable by theuser actually running sa-learn, which is the *unprivileged* userhandling the system-wide BayesDB ("amavis" in the case originating thisthread, but "spamd" and "defang" are other common ones...) In mostreasonably well-secured systems using Maildir message stores, theMaildirs are all owned by individual users or by one user that handlesdelivery to "virtual users" understood by the MTA and IMAP or POP serverby not by the OS. That is generally NOT the same user running spamd orcontent filters for a system-wide BayesDB. As a result, relearning hasto be done as root, shuttling data from files owned by one user into aprocess running as another.

The point is that the
overhead which you fear is reduced N times.

And since the sa-learn processes can't read the files it is given asarguments, they run with blinding speed, skipping all that costlyparsing and learning stuff...

Re: Is BAYES filtering working? Having doubts.

Reply via email to