Hello Matias, Thursday, February 10, 2005, 10:29:08 AM, you wrote:
MLB> Hi, MLB> I'm relatively new using SA and I have a couple of doubts about the MLB> bayes db and the awl db. MLB> I'm running a 3.0.2 site wide install, and I have saw that for each user MLB> there is an .spamassassin directory, storing Bayesian and awl databases MLB> a part from the user preferences file. MLB> The bayes and awl db are working only for the user who owns them right? MLB> What about the bayes data created with the sa-learn command? MLB> I'm being training spamassassin Bayesian filter since the installation MLB> of SA. It sounds to me like you're training a central Bayes database, while the users are using their individual databases. Individual databases are better (non-spam to the business department may look like spam to the bio-chemistry teachers and v.v.), but a central bayes system would work also, but you need to use one method or the other -- teaching a central database which isn't being used by the users will do not good. You should be able to use the command > sa-learn -u $username ... to direct the teaching for each user to his/her database. MLB> The AWL looks like it's activated by default, that could may MLB> cause some problems with the scoring and mark spam as ham and MLB> vise versa? and also could it set wrong scores into the AWL db? Read up on AWL on the wiki. It doesn't store future scores in a database, just records what past scores have been, so it can average out going forward. Yes, its purpose is to bring scores down or up and can cross the spam threshold. If a spam is seen from spammer A with a score of 3, and the next spam gets a score of 6, AWL will bring the score of that second spam down to 4.5. If the next spam is also at 6, AWL will bring the score of that third spam down to 5.25 (this one will be marked as spam). If the next spam (#4) scores at 4.75, AWL will bring that score up to 5.0. Etc. If you have significant false positive or false negative problems, I'd turn AWL off and delete the AWL databases until you resolve those. Once the rules and Bayes systems are working OK, then turn AWL back on. Bob Menschel