On 05/08/2013 08:15 PM, Bob Proulx wrote:
Andrew Talbot wrote:
Well, I certainly hope someone offers to help!
If only to say "there is no default database."
There is no default database. :-)
As we've spoken about off-list, my boss is being very particular about the
deployment of Bayes, and it sounds like one of his caveats is that we don't
start from a blank database.
But you do start from a blank database. That is why it needs to learn
from a significant number of spam and non-spam, 200 each, before it
will produce meaningful results.
For the record, I agree with your logic completely ...... And I hate to say
stupid things like this, but it doesn't even matter to me if the tokens in
the default database are useless at this point, or if there are only 20 of
them. I just need to get this deployed so it can start learning.
If you deploy it then it will immediately start learning. Learning is
not limited by having an empty database. Only classification is
limited until learning has occurred.
and you have extra switches to learn only, etc
# master switch
use_bayes 1
# see docs
bayes_auto_learn 1
use_learner 1
# score switch
use_bayes_rules 1