Tony Meyer wrote: > Update of /cvsroot/spambayes/spambayes/Outlook2000 > In directory > sc8-pr-cvs1.sourceforge.net:/tmp/cvs-serv11996/Outlook2000 > > Modified Files: > filter.py manager.py msgstore.py > Log Message: > It makes life much simpler if the classification strings match the > non-Outlook ones. > > Thresholds are 0-100, cutoffs are 0.0-1.0 - need to convert between > them, or everything is spam! > > Index: filter.py > ==================================================================> RCS file: > /cvsroot/spambayes/spambayes/Outlook2000/filter.py,v > retrieving revision 1.43 > retrieving revision 1.44 > diff -C2 -d -r1.43 -r1.44 > *** filter.py 22 Dec 2004 00:30:26 -0000 1.43 > --- filter.py 22 Dec 2004 01:22:00 -0000 1.44 > *************** > *** 17,29 **** > disposition = "Yes" > attr_prefix = "spam" > ! msg.c = mgr.bayes_options["Headers", "header_spam_string"] > elif prob_perc >= config.unsure_threshold: > disposition = "Unsure" > attr_prefix = "unsure" > ! msg.c = mgr.bayes_options["Headers", "header_unsure_string"] > else: > disposition = "No" > attr_prefix = "ham" > ! msg.c = mgr.bayes_options["Headers", "header_ham_string"] > > ms = mgr.message_store > --- 17,30 ---- > disposition = "Yes" > attr_prefix = "spam" > ! msg.c = mgr.bayes_options["Headers", "header_spam_string"][0] > elif prob_perc >> config.unsure_threshold: disposition = > "Unsure" > attr_prefix = "unsure" > ! msg.c = mgr.bayes_options["Headers", "header_unsure_string"][0] > else: > disposition = "No" > attr_prefix = "ham" > ! msg.c = mgr.bayes_options["Headers", "header_ham_string"][0] > ! mgr.classifier_data.message_db.store_msg(msg) > > ms = mgr.message_store
I believe the non-Outlook versions store fixed values of 's', 'h', or 'u' in the msg.c field. See the RememberClassification function in spambayes.Message which contains the comment: """ # this must store state independent of options settings, as they # may change, which would really screw this database up """ Using only the first character of the configured header strings could be especially bad if the user configured the strings to something like "***SPAM***", "***GOOD***", and "***UNSURE***". -- Kenny Pitt _______________________________________________ spambayes-dev mailing list [EMAIL PROTECTED] http://mail.python.org/mailman/listinfo/spambayes-dev
