Re: bayes not active although enabled?

2009-07-26 Thread Benny Pedersen

On Sun, July 26, 2009 04:43, snowweb wrote:

 In /etc/mail/spamassassin/local.cf

 bayes_auto_learn 1

 But when I examine the message headers,

 X-Spam-Status: No, score=3.0 required=4.7
 tests=ALL_TRUSTED,AWL,HTML_MESSAGE,
 MIME_HTML_ONLY,TVD_RCVD_IP autolearn=no version=3.2.4

 Is there anywhere else that I need to switch this on?

score 3.0 is inside default range for bayes learning as ham or spam, see

perldoc Mail::SpamAssassin::Conf

so bayes wont learn it, if it was -0.2 it will learn as ham, or 12.1 it will 
learn as spam

-- 
xpoint



Re: bayes not active although enabled?

2009-07-26 Thread snowweb



Benny Pedersen wrote:
 
 
 On Sun, July 26, 2009 04:43, snowweb wrote:

 In /etc/mail/spamassassin/local.cf

 bayes_auto_learn 1

 But when I examine the message headers,

 X-Spam-Status: No, score=3.0 required=4.7
 tests=ALL_TRUSTED,AWL,HTML_MESSAGE,
 MIME_HTML_ONLY,TVD_RCVD_IP autolearn=no version=3.2.4

 Is there anywhere else that I need to switch this on?
 
 score 3.0 is inside default range for bayes learning as ham or spam, see
 
 perldoc Mail::SpamAssassin::Conf
 
 so bayes wont learn it, if it was -0.2 it will learn as ham, or 12.1 it
 will learn as spam
 
 -- 
 xpoint
 
 

Sorry Benny, my message wasn't very clear (although your information was
interesting). However, my main concern is that it is not using bayes to
analyse the messages, let alone to learn from them.

I've now trained bayes with well over 200 mails, using sa-learn but I'm
wondering if the parameters you mentions prevent it from learning even when
using sa-learn since it still has not kicked in?

-- 
View this message in context: 
http://www.nabble.com/bayes-not-active-although-enabled--tp24663548p24666833.html
Sent from the SpamAssassin - Users mailing list archive at Nabble.com.



Re: bayes not active although enabled?

2009-07-26 Thread Benny Pedersen

On Sun, July 26, 2009 15:10, snowweb wrote:
 Sorry Benny, my message wasn't very clear (although your information was
 interesting). However, my main concern is that it is not using bayes to
 analyse the messages, let alone to learn from them.

sa-learn --dump magic

 I've now trained bayes with well over 200 mails, using sa-learn but I'm
 wondering if the parameters you mentions prevent it from learning even when
 using sa-learn since it still has not kicked in?

there can be multiple tflags with noautolearn for the msg being tested, i just 
want to be sure you see same problem if scores is
less then -0.1 or bigger then 12.1

-- 
xpoint



Re: bayes not active although enabled?

2009-07-26 Thread snowweb



Benny Pedersen wrote:
 
 
 On Sun, July 26, 2009 15:10, snowweb wrote:
 Sorry Benny, my message wasn't very clear (although your information was
 interesting). However, my main concern is that it is not using bayes to
 analyse the messages, let alone to learn from them.
 
 sa-learn --dump magic
 
 I've now trained bayes with well over 200 mails, using sa-learn but I'm
 wondering if the parameters you mentions prevent it from learning even
 when
 using sa-learn since it still has not kicked in?
 
 there can be multiple tflags with noautolearn for the msg being tested, i
 just want to be sure you see same problem if scores is
 less then -0.1 or bigger then 12.1
 
 -- 
 xpoint
 

I love you guys! You're spot on everytime...

[r...@s1 Maildir]# sa-learn --dump magic
[9647] warn: FuzzyOcr: Cannot find executable for tesseract
0.000  0  3  0  non-token data: bayes db version
0.000  0258  0  non-token data: nspam
0.000  0160  0  non-token data: nham
0.000  0  26027  0  non-token data: ntokens
0.000  0 1245088823  0  non-token data: oldest atime
0.000  0 1248611169  0  non-token data: newest atime
0.000  0 1248614005  0  non-token data: last journal sync
atime
0.000  0  0  0  non-token data: last expiry atime
0.000  0  0  0  non-token data: last expire atime
delta
0.000  0  0  0  non-token data: last expire
reduction count

I see from that, that I've not trained as many HAM as I thought! OK, I'm off
in search of some more HAM! Thanks guys.

-- 
View this message in context: 
http://www.nabble.com/bayes-not-active-although-enabled--tp24663548p24666952.html
Sent from the SpamAssassin - Users mailing list archive at Nabble.com.



Re: bayes not active although enabled?

2009-07-26 Thread Benny Pedersen

On Sun, July 26, 2009 15:29, snowweb wrote:

 0.000  0258  0  non-token data: nspam
 0.000  0160  0  non-token data: nham

try to have them more or less equal to have good bayes db

so if less then 1000 in diff is fine

if more then 1000 adjust learning scores

 I see from that, that I've not trained as many HAM as I thought! OK, I'm off
 in search of some more HAM! Thanks guys.

super you found the problem finaly

-- 
xpoint



Re: bayes not active although enabled?

2009-07-26 Thread snowweb



Benny Pedersen wrote:
 
 
 On Sun, July 26, 2009 15:29, snowweb wrote:
 
 0.000  0258  0  non-token data: nspam
 0.000  0160  0  non-token data: nham
 
 try to have them more or less equal to have good bayes db
 
 so if less then 1000 in diff is fine
 
 if more then 1000 adjust learning scores
 
 I see from that, that I've not trained as many HAM as I thought! OK, I'm
 off
 in search of some more HAM! Thanks guys.
 
 super you found the problem finaly
 
 -- 
 xpoint
 

Sorry, it hasn't solved it :(

As you can see below, I now have more than 200 of both SPAM  HAM trained:

[r...@s1 Maildir]# sa-learn --dump magic
[1644] warn: FuzzyOcr: Cannot find executable for tesseract
0.000  0  3  0  non-token data: bayes db version
0.000  0278  0  non-token data: nspam
0.000  0221  0  non-token data: nham
0.000  0  34120  0  non-token data: ntokens
0.000  0 1245088823  0  non-token data: oldest atime
0.000  0 1248616170  0  non-token data: newest atime
0.000  0 1248620830  0  non-token data: last journal sync
atime
0.000  0  0  0  non-token data: last expiry atime
0.000  0  0  0  non-token data: last expire atime
delta
0.000  0  0  0  non-token data: last expire
reduction count

but here are the spam headers of a message, which show that bayes is not
being used:

X-Spam-Flag: NO
X-Spam-Checker-Version: SpamAssassin 3.2.4 (2008-01-01) on s1.snowweb.info
X-Spam-Level: **
X-Spam-Status: No, score=3.0 required=4.7 tests=RELAYCOUNTRY_US autolearn=no
version=3.2.4
X-Spam-Report:
 * 3.0 RELAYCOUNTRY_US Relayed through United States of America
 * 0.0 HTML_MESSAGE BODY: HTML included in message
X-Spam-Relay-Country: US US US US US US US US US

This is not a random event. Bayes is not being used for any message.
-- 
View this message in context: 
http://www.nabble.com/bayes-not-active-although-enabled--tp24663548p24667935.html
Sent from the SpamAssassin - Users mailing list archive at Nabble.com.



Re: bayes not active although enabled?

2009-07-26 Thread Jari Fredriksson
 Benny Pedersen wrote:
 
 
 On Sun, July 26, 2009 15:29, snowweb wrote:
 
 0.000  0258  0  non-token data:
 nspam 
 0.000  0160  0  non-token data:
 nham 
 
 try to have them more or less equal to have good bayes db
 
 so if less then 1000 in diff is fine
 
 if more then 1000 adjust learning scores
 
 I see from that, that I've not trained as many HAM as I
 thought! OK, I'm off
 in search of some more HAM! Thanks guys.
 
 super you found the problem finaly
 
 --
 xpoint
 
 
 Sorry, it hasn't solved it :(
 
 As you can see below, I now have more than 200 of both
 SPAM  HAM trained: 
 
 [r...@s1 Maildir]# sa-learn --dump magic
 [1644] warn: FuzzyOcr: Cannot find executable for
 tesseract 
 0.000  0  3  0  non-token data:
 bayes db version 
 0.000  0278  0  non-token data:
 nspam 
 0.000  0221  0  non-token data:
 nham 
 0.000  0  34120  0  non-token data:
 ntokens 
 0.000  0 1245088823  0  non-token data:
 oldest atime 
 0.000  0 1248616170  0  non-token data:
 newest atime 
 0.000  0 1248620830  0  non-token data:
 last journal sync atime
 0.000  0  0  0  non-token data:
 last expiry atime 
 0.000  0  0  0  non-token data:
 last expire atime delta
 0.000  0  0  0  non-token data:
 last expire reduction count
 
 but here are the spam headers of a message, which show
 that bayes is not being used:
 
 X-Spam-Flag: NO
 X-Spam-Checker-Version: SpamAssassin 3.2.4 (2008-01-01)
 on s1.snowweb.info X-Spam-Level: **
 X-Spam-Status: No, score=3.0 required=4.7
 tests=RELAYCOUNTRY_US autolearn=no version=3.2.4
 X-Spam-Report:
 * 3.0 RELAYCOUNTRY_US Relayed through United States
 of America 
 * 0.0 HTML_MESSAGE BODY: HTML included in message
 X-Spam-Relay-Country: US US US US US US US US US
 
 This is not a random event. Bayes is not being used for
 any message. 

Which user you run the spam check with? sa-learn shows root's magic. If use for 
example use user spam to check the spam, you should call sa-learn as

sa-learn -u spam --dump magic




Re: bayes not active although enabled?

2009-07-26 Thread Jari Fredriksson
 Benny Pedersen wrote:
 
 
 On Sun, July 26, 2009 15:29, snowweb wrote:
 
 0.000  0258  0  non-token data:
 nspam
 0.000  0160  0  non-token data:
 nham
 
 try to have them more or less equal to have good bayes
 db 
 
 so if less then 1000 in diff is fine
 
 if more then 1000 adjust learning scores
 
 I see from that, that I've not trained as many HAM as I
 thought! OK, I'm off
 in search of some more HAM! Thanks guys.
 
 super you found the problem finaly
 
 --
 xpoint
 
 
 Sorry, it hasn't solved it :(
 
 As you can see below, I now have more than 200 of both
 SPAM  HAM trained:
 
 [r...@s1 Maildir]# sa-learn --dump magic
 [1644] warn: FuzzyOcr: Cannot find executable for
 tesseract
 0.000  0  3  0  non-token data:
 bayes db version
 0.000  0278  0  non-token data:
 nspam
 0.000  0221  0  non-token data:
 nham
 0.000  0  34120  0  non-token data:
 ntokens
 0.000  0 1245088823  0  non-token data:
 oldest atime
 0.000  0 1248616170  0  non-token data:
 newest atime
 0.000  0 1248620830  0  non-token data:
 last journal sync atime
 0.000  0  0  0  non-token data:
 last expiry atime
 0.000  0  0  0  non-token data:
 last expire atime delta
 0.000  0  0  0  non-token data:
 last expire reduction count
 
 but here are the spam headers of a message, which show
 that bayes is not being used:
 
 X-Spam-Flag: NO
 X-Spam-Checker-Version: SpamAssassin 3.2.4 (2008-01-01)
 on s1.snowweb.info X-Spam-Level: **
 X-Spam-Status: No, score=3.0 required=4.7
 tests=RELAYCOUNTRY_US autolearn=no version=3.2.4
 X-Spam-Report:
 * 3.0 RELAYCOUNTRY_US Relayed through United States
 of America
 * 0.0 HTML_MESSAGE BODY: HTML included in message
 X-Spam-Relay-Country: US US US US US US US US US
 
 This is not a random event. Bayes is not being used for
 any message.
 
 Which user you run the spam check with? sa-learn shows
 root's magic. If use for example use user spam to check
 the spam, you should call sa-learn as  
 
 sa-learn -u spam --dump magic

One can also use

bayes_sql_override_username spam

in local.cf, if the bayes is in a SQL database. I have that, and there seems to 
be no difference if I use -u or not.




Re: bayes not active although enabled?

2009-07-26 Thread snowweb



Jari Fredriksson wrote:
 
 
 One can also use
 
 bayes_sql_override_username spam
 
 in local.cf, if the bayes is in a SQL database. I have that, and there
 seems to be no difference if I use -u or not.
 
 

I was wondering whether it was a user issue earlier, but I tried sa-learn
--username=user --dump magic and it seemed to give the same result as the
default user. I tried various different users (even made some up!) but it
didn't complain - still gave me the same dump.

While writing this, I just checked messages on another domain/user and found
that bayes is working there! phew! That's a relief... it seems that it is
only on my domain that we have an issue.

On Friday, due to a lack of understanding of how spamassassin works, I
renamed the following:

/home/snowbnet/.spamassassin/bayes.mutex
/home/snowbnet/.spamassassin/bayes_journal
/home/snowbnet/.spamassassin/bayes_seen
/home/snowbnet/.spamassassin/bayes_toks

I added .old to the end of each.

Afterwards, spamassassin recreated them and I though all was good, but
because bayes stopped working, I changed them back and removed the newly
created versions. In hind sight, I guess it just needed training. Anyway,
don't know if what I did is related to this problem.

Is the 'user', the email account user or the domain user? 

Also, I've just noticed that the bayes_journal has not been modified since
Saturday about the time I renamed stuff.
-- 
View this message in context: 
http://www.nabble.com/bayes-not-active-although-enabled--tp24663548p24668896.html
Sent from the SpamAssassin - Users mailing list archive at Nabble.com.



Re: bayes not active although enabled?

2009-07-26 Thread snowweb



snowweb wrote:
 
 
 I was wondering whether it was a user issue earlier, but I tried sa-learn
 --username=user --dump magic and it seemed to give the same result as the
 default user. I tried various different users (even made some up!) but it
 didn't complain - still gave me the same dump.
 
 While writing this, I just checked messages on another domain/user and
 found that bayes is working there! phew! That's a relief... it seems that
 it is only on my domain that we have an issue.
 
 On Friday, due to a lack of understanding of how spamassassin works, I
 renamed the following:
 
 /home/snowbnet/.spamassassin/bayes.mutex
 /home/snowbnet/.spamassassin/bayes_journal
 /home/snowbnet/.spamassassin/bayes_seen
 /home/snowbnet/.spamassassin/bayes_toks
 
 I added .old to the end of each.
 
 Afterwards, spamassassin recreated them and I thought all was good, but
 because bayes stopped working, I changed them back and removed the newly
 created versions. In hind sight, I guess it just needed training. Anyway,
 don't know if what I did is related to this problem.
 
 Also, I've just noticed that the bayes_journal has not been modified since
 Saturday about the time I renamed stuff.
 
 

To recap, update and summarize:

Bayes is not being used for one domain on the server. For the others it is.

Bayes is file based not mysql based.

When I do sa-learn --username=user --dump magic I get the same result for
whatever username I try, even if the user doesn't exist.

sa-learn --dump tells me that both nspam  nham are 200 so it isn't
waiting for training.

I probably created the problem on Saturday when in ignorance, I decided that
I would erase the bayes db and retrain it. I renamed the bayes_ files in
/home/username/.spamassassin/ by adding .old. They were all automatically
recreated but bayes no longer worked. Not understanding that perhaps they
just needed training, I renamed them back and removed the new files.
However, bayes did not resume working.

I have since used sa-learn --clear and then retrained it with 200+ each of
spam and ham but it still has not started working.

I can't figure out whether on my system SA is using one bayes db for the
whole system or one for each user or domain? However, the other domains seem
to be working ok.

In /etc/mail/spamassassin/ I have use_bayes 1

Perhaps someone could spot something I've missed for me? Meanwhile, I'll go
and study the wiki some more. Thanks.
-- 
View this message in context: 
http://www.nabble.com/bayes-not-active-although-enabled--tp24663548p24672500.html
Sent from the SpamAssassin - Users mailing list archive at Nabble.com.


bayes not active although enabled?

2009-07-25 Thread snowweb

In /etc/mail/spamassassin/local.cf

bayes_auto_learn 1

But when I examine the message headers,

X-Spam-Status: No, score=3.0 required=4.7
tests=ALL_TRUSTED,AWL,HTML_MESSAGE,
MIME_HTML_ONLY,TVD_RCVD_IP autolearn=no version=3.2.4

Is there anywhere else that I need to switch this on?
-- 
View this message in context: 
http://www.nabble.com/bayes-not-active-although-enabled--tp24663548p24663548.html
Sent from the SpamAssassin - Users mailing list archive at Nabble.com.



Re: bayes not active although enabled?

2009-07-25 Thread snowweb

Sorry, got mixed up. In /etc/mail/spamassassin/local.cf

use_bayes 1

Is there anywhere else that I need to switch this on since it does not
appear to be doing bayesian testing at all for any messages.

-- 
View this message in context: 
http://www.nabble.com/bayes-not-active-although-enabled--tp24663548p24663913.html
Sent from the SpamAssassin - Users mailing list archive at Nabble.com.



Re: bayes not active although enabled?

2009-07-25 Thread Matt Kettler
snowweb wrote:
 Sorry, got mixed up. In /etc/mail/spamassassin/local.cf

 use_bayes 1

 Is there anywhere else that I need to switch this on since it does not
 appear to be doing bayesian testing at all for any messages.

   
check your sa-learn --dump magic

SA won't activate bayes until it has learned at least 200 spam, and 200
nonspam messages. (under the general premise that until you have a
decent amount of mail learned, the statistics are going to be a bit
erratic and not worthwhile using)