http://bugzilla.spamassassin.org/show_bug.cgi?id=4095
------- Additional Comments From [EMAIL PROTECTED] 2005-01-22 22:03 ------- Subject: Re: Using Bayesian Filters to score rules [EMAIL PROTECTED] wrote: >http://bugzilla.spamassassin.org/show_bug.cgi?id=4095 > > > > > >------- Additional Comments From [EMAIL PROTECTED] 2005-01-22 21:12 ------- >Subject: Re: New: Ising Bayesian Filters to score rules > > >1. How do you decide when the system is sufficiently trained to switch >over to scoring this way? > > What I envision is that rules are listed as white or black initially. No scores are assigned. Then the initial message are tested and they will return a list of rules triggered. If the message triggers say 3/4 black rules or more then it is learned as spam. If it triggers 3/4 white rules - then it is learned as ham. Once the learning process begins then the rules themselves develop scores just like bayesian tokens develop scores. As the system learns the scoring takes cate of itself. Messages end up with a number between 0 an 1 and then you just have to figure out where you want to call it spam and what to do with it. >2. What do you use for scoring until the switchover? > > See above. >3. This implies that you have rules for most all of the things that are >interesting spam (and ham) signs. This is a lot of work maintaining rules, >which currently isn't as mandatory with the current Bayes setup. > > We still use the same rules we have now - we just don't need to score them. If the rule is triggered then the name of the rule goes into the bayesian filter. >4. If you don't have a rule for something then you can't score it. >Currently Bayes effectively generates the equivalent of its own rule for >that new spam sign. > > > > > Yes - bayes generates its own scores off of tokens. And we still have bayesian filters that look at the message. But what I'm saying is that we also habe another bayesian filter that is fed only a list of rules that it triggered and it does the final scoring. And - some of those rules in the list include the results of other bayesian filters looking at the message. I hope I'm not losing everyone in this concept. It's really hard to get the big picture into words. ------- You are receiving this mail because: ------- You are the assignee for the bug, or are watching the assignee.
