Dhaval> I have been keeping an eye on the db file and notice that after
    Dhaval> a training, the timestamp is updated but the filesize is the
    Dhaval> same. Is this normal?

Yes.  The database file isn't a simple text file.  There are lots of "holes"
in the file to tuck new tokens, and for existing tokens all that happens
most of the time is that the count for the token increases.

    Dhaval> Does anybody have any advice on what to look for?

Try running sb_dbexpimp.py before and after training, then compare the
output:

    sb_dbexpimp.py -e -f bayes1.csv
    sb_mboxtrain.py ...
    sb_dbexpimp.py -e -f bayes2.csv
    diff -u bayes1.csv bayes2.csv | more

Skip
_______________________________________________
[email protected]
http://mail.python.org/mailman/listinfo/spambayes
Check the FAQ before asking: http://spambayes.sf.net/faq.html

Reply via email to