[Skip Montanaro]
> It might be useful to codify some of these ideas into a tool 
> the user can run to reduce training dataset sizes without 
> necessarily committing to the train-to-exhaustion concept.

I think it would be a good idea if we had a spambayes.training module that
contained various training code like this, code to do tte/nonedge/etc and so
forth.  contrib/tte.py (and maybe other new contrib/ or utilties/ scripts)
could just be the getopt stuff and then a few lines of code calling the
appropriate functions in spambayes.training, and the other scripts could
make use of the same code (I'd also like to have Outlook, sb_server and
sb_imapfilter to have a slightly higher abstraction for training to allow
for flexibility in what's done).

=Tony.Meyer

_______________________________________________
spambayes-dev mailing list
spambayes-dev@python.org
http://mail.python.org/mailman/listinfo/spambayes-dev

Reply via email to