Re: Bayes

Matt Kettler Wed, 20 Jul 2005 09:43:04 -0700

Andrea Bencini wrote:

I perform a dump of the Bayes database (sa-learn --dump data)


0.049          0          1 1114618731  7bc32206de
0.026          0          2 1114618851  b214d213fd
0.958          1          0 1114607858  ea2d4af4f7
0.026          0          2 1114618998  5bb6204bec
0.005          0         10 1114619097  3236284507
0.049          0          1 1114618779  7af61f2550

it is not in human readable format, but in hex (7bc32206de).

How can I do?

In sa 3.0.0 or higher the tokens are stored as SHA1 hashes, there is nopractical way to reverse them back to text strings.

The conversion to hashes speeds up SA's database maintenance, as the tokens arenow fixed-size.

Unfortunately you can't browse your bayes db tokens to see what's been learnedas what. This does however have the side-benefit of improving privacy onmulti-user systems.

One thing you CAN still do is run a message through spamassassin -D and look atthe bayes ratings of the tokens that message hit. Since SA has the message intext form, it will be able present the tokens in text format not hash format.

Usually if you're questioning the contents of the bayes DB, it's in relation toa specific message, so this should cover most situations.

Re: Bayes

Reply via email to