Ah. Perhaps the thing to do is to build a starting database by training on a corpus of ham and spam messages that you know your clients would agree that you've classified correctly. You can then install that database on their machines when you install SpamBayes. See http://spambayes.sourceforge.net/faq.html#can-i-share-move-my-training-d ata-from-one-computer-to-another.
-----Original Message----- From: Luca Benassi [mailto:[EMAIL PROTECTED] Sent: Thursday, March 08, 2007 8:22 AM To: Jesse Pelton Cc: [email protected] Subject: Re: [Spambayes] interacting with dbm Jesse Pelton ha scritto: > As I understand it, the thing to do is to train on messages that contain > those words. That is, if you receive a message that contains those > words and SpamBayes classifies it as spam or possible spam, tell > SpamBayes that it's actually ham. Alternatively, you could probably > parse through the code to understand the database design and hack up > something to add some special tokens to the database. Heck, you could > change the code itself to give special treatment to your whitelist > tokens. The source is readily available. > > I don't recommend any of the above, though. If you somehow receive a > spam message with one of your whitelist words in it, the first approach > would require you to mislead SpamBayes about what you consider to be > spam, and the others seem very error-prone. > > Lots of people have tried to find ways to make SpamBayes smarter. As > near as I can tell, no one has found a way to make it consistently > perform better in a long time. Does it matter that it's dumb if it > gives good results? thx for your reply ... you're right ;) however, what I'd like to try is to pre-train spambayes, before (or, *while*) installing to some clients. > -----Original Message----- > From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] > On Behalf Of Luca Benassi > Sent: Thursday, March 08, 2007 6:42 AM > To: [email protected] > Subject: [Spambayes] interacting with dbm > > I've serched a bit in doc, faq and old messagges but ... no luck :] > > This is my question: can I populate the dbm with my list of *safe* > (good) words? I'd like to, sort of, whitelist a group of words. > > Thx in advice, > Luca > -- Dr. Luca Benassi Laboratori Guglielmo Marconi Via Porrettana 123, 40037 Pontecchio Marconi (BO) - ITALY Phone:+39-0516781934 Fax:+39-051846479 e-mail: [EMAIL PROTECTED] Systems & Networks Division _______________________________________________ [email protected] http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html
