On Sat, 30 Oct 2004, Tuc at Beach House wrote:

> > > himinbjorg% sa-learn --dump magic
> > > 0.000          0          3          0  non-token data: bayes db version
> > > 0.000          0        175          0  non-token data: nspam
> > > 0.000          0      73501          0  non-token data: nham
> > > 0.000          0    1027341          0  non-token data: ntokens
> > 
> > I think 175 spam messages is not nearly enough for Bayes to be
> > adequately trained.  Also, the ratio of ham to spam (~0.3%) looks a
> > bit odd.  If it reflects roughly equal time periods over which you
> > received the messages, it suggests you might be missing (or perhaps
> > misclassifying) some of your spam.
> > 
> But shouldn't it have carried my database over from my previous install? 
> I'd been using it for atleast 6 months on different versions before this
> upgrade.  Did it 'forget' it all?  Do I need to totally retrain it?  I'm
> using a stock install from FreeBSD ports, no local/global overrides.

Looks like somebody didn't read the UPGRADE doc...

 Due to the database format change, you will want to do something like
  this when upgrading:

  - stop running spamassassin/spamd (ie: you don't want it to be running
    during the upgrade)
  - run "sa-learn --rebuild", this will sync your journal.  if you skip
    this step, any data from the journal will be lost when the DB is
    upgraded.
  - upgrade SA to 3.0.0
  - run "sa-learn --sync", which will cause the db format to be upgraded.
    if you want to see what is going on, you can add the "-D" option.
  - test the new database by running some sample mails through
    SpamAssassin, and/or at least running "sa-learn --dump" to make sure
    the data looks valid.
  - start running spamassassin/spamd again
 
. . . . . . . . . . . . . . .
Randomly generated quote:
"My great concern is not whether you have failed, but whether you
are content with your failure." - Abraham Lincoln

Reply via email to