Robert Menschel wrote:
Hello Matias,
Thursday, February 10, 2005, 10:29:08 AM, you wrote:
MLB> The bayes and awl db are working only for the user who owns them right?
MLB> What about the bayes data created with the sa-learn command?
MLB> I'm being training spamassassin Bayesian filter since the installation
MLB> of SA.
It sounds to me like you're training a central Bayes database, while
the users are using their individual databases.
Individual databases are better (non-spam to the business department
may look like spam to the bio-chemistry teachers and v.v.), but a
central bayes system would work also, but you need to use one method
or the other -- teaching a central database which isn't being used by
the users will do not good.
uhm...
That's not good. It means that I have being training the bayes db for
nothing :(
Let me see if I get this right. If I'm training the Bayesian filter I
can do two things:
1) Train the filter with a central bayes db for all the SA users.
2) Train the users bayes db one by one, with the same info...
Something like a "for" with and "awk" output from /etc/password and the
ham/spam data will do the trick??
There is not a third option wen I have the bayes db for the user, and
also a central bayes db for all the users??
Which one will accurate the best performance in the matters of spam
detection??
MLB> The AWL looks like it's activated by default, that could may
MLB> cause some problems with the scoring and mark spam as ham and
MLB> vise versa? and also could it set wrong scores into the AWL db?
Read up on AWL on the wiki. It doesn't store future scores in a
database, just records what past scores have been, so it can average
out going forward. Yes, its purpose is to bring scores down or up and
can cross the spam threshold.
Thanks a lot for this explanation Bob, I think that my AWL it's working
fine for now, but I will keep an eye on it :)
BR,
Matías.