In message: <[EMAIL PROTECTED]>
"Michael Harris" <[EMAIL PROTECTED]> writes:
>I am trying to right a program for my 3rd year project at university which
>will attempt to optimise the best configuration of spambayes for a user
>given his spam/ham corpus.
>
>I was just wandering if anyone could tell me firstly how i can train
>spambayes through command line on mbox's. And secondly i need a way of
>querying spambayes on the probability which it would assign to an individual
>email of 'spaminess' given the training it has received.
For command-line operations of spambayes, look at sb_filter; it
provides the ability to train, untrain, and evaluate messages.
However, for large-scale evaluation of training strategies, I suggest
reading through TESTING.txt and testtools/{timcv.py,table.py,*.txt}.
Doing it all through the command line will be much, much slower than
using one of the test scripts that embeds a classifier and iterates
over all the messages for training and testing (without paying the
startup costs for each operation).
- Alex
_______________________________________________
spambayes-dev mailing list
[email protected]
http://mail.python.org/mailman/listinfo/spambayes-dev