Re: Clear Database Question

2009-12-04 Thread Matus UHLAR - fantomas
On 03.12.09 20:58, Jason Carson wrote:
 Is it necessary to clear the database...
 
 sa-learn --clear
 
 ...before I run the following to train SpamAssassin's bayesian classifier...
 
 sa-learn --spam /home/jason/.maildir/.Spam/cur/

no, and don't do that unless you believe your database is really broken.

Also remember to train enough of ham - bayes DB will help you DIFFER spam
and ham, therefore it needs to know how the ham looks like.

-- 
Matus UHLAR - fantomas, uh...@fantomas.sk ; http://www.fantomas.sk/
Warning: I wish NOT to receive e-mail advertising to this address.
Varovanie: na tuto adresu chcem NEDOSTAVAT akukolvek reklamnu postu.
- Have you got anything without Spam in it?
- Well, there's Spam egg sausage and Spam, that's not got much Spam in it.


Re: Clear Database Question

2009-12-04 Thread Matt Kettler
Jason Carson wrote:
 Hello everyone,

 Is it necessary to clear the database...

 sa-learn --clear

 ...before I run the following to train SpamAssassin's bayesian classifier...

 sa-learn --spam /home/jason/.maildir/.Spam/cur/

   
No. That would be ill advised.

Running --clear deletes your entire bayes database, which can take a
long time to recover from. I would only advise using it if you've
decided all your previous training is worthless, or your database
becomes corrupted.

Also be sure to consider that once you clear the database SA will
deactivate bayes until 200 spam and 200 nonspam messages get trained.

SpamAssassin will automatically make room when it needs to by pushing
out the least popular tokens through the expire process (which you can
manually trigger via the sa-learn --force-expire command, but it
normally checks during message processing twice a day)







Clear Database Question

2009-12-03 Thread Jason Carson
Hello everyone,

Is it necessary to clear the database...

sa-learn --clear

...before I run the following to train SpamAssassin's bayesian classifier...

sa-learn --spam /home/jason/.maildir/.Spam/cur/