On Wed, Feb 04, 2004 at 02:21:56PM -0500, Michael Faurot wrote: > In article <[EMAIL PROTECTED]> you wrote: > > > I'm running an MD/SA gateway for a customer where mail is scanned, > > tagged, and forwarded directly to their servers (nothing is stored > > locally), but I need to train SpamAssassin and beef up its bayes db. > > How do people typically gather ham and spam to train the box under these > > conditions? Is it possible to do it without too much intervention on > > the customer's part? > > Yes, just use the bayes_auto_learn option in SA. If there's a good amount > of traffic going through the box, it should build up a corpus fairly > quickly. You may also want to tweak bayes_auto_learn_threshold_nonspam > and bayes_auto_learn_threshold_spam if you don't like the defaults. > I wound up leaving bayes_auto_learn_threshold_nonspam at its default > but adjusted bayes_auto_learn_threshold_spam to 8.0.
You may also want to look at the following bug in SA's bugzilla. We've got patches maintained to 2.61 right now, expect to have patches to 2.63, etc. Working with the devs to get the system merged into the dist tree. http://bugzilla.spamassassin.org/show_bug.cgi?id=2167 If you have any questions, I'll be happy to answer them off-list. -- Kelsey Cummings - [EMAIL PROTECTED] sonic.net, inc. System Administrator 2260 Apollo Way 707.522.1000 (Voice) Santa Rosa, CA 95407 707.547.2199 (Fax) http://www.sonic.net/ Fingerprint = D5F9 667F 5D32 7347 0B79 8DB7 2B42 86B6 4E2C 3896 _______________________________________________ Visit http://www.mimedefang.org and http://www.canit.ca MIMEDefang mailing list [EMAIL PROTECTED] http://lists.roaringpenguin.com/mailman/listinfo/mimedefang

