RE: Bayes db size....

2007-02-17 Thread Dave Koontz
I am sure this has been asked numerous times before, but what is the logic in having auto expiry on the bayes DB, and not seen? Seems that once tokens have been removed from the DB there is little to no use for 'unlearning' any associated messages. Besides on a busy system, this seen file gets

Re: Google Summer of Code 2007 ...

2007-02-17 Thread Justin Mason
Raul Dias writes: On Sat, 2007-02-17 at 02:07 +0100, Mark Martinec wrote: On Saturday February 17 2007 01:49, Matthew Wilson wrote: I was/am primarily concerned with RAM usage for high-concurrency situations. Ok. Still, in my experience about 30 (maybe 50) SA processes can fully

SA not working?

2007-02-17 Thread David Obando
Dear all, I installed SA on a Debian Etch system together with Postfix and Amavis. Strangely SA doens't score mails at all, but I don't see why. See the output of a spam mail I checked manually, When I run a check on the same mail on a different machine, it is scored: [EMAIL PROTECTED] tmp]#

Re: Google Summer of Code 2007 ...

2007-02-17 Thread Raul Dias
On Sat, 2007-02-17 at 11:21 +, Justin Mason wrote: Raul Dias writes: On Sat, 2007-02-17 at 02:07 +0100, Mark Martinec wrote: On Saturday February 17 2007 01:49, Matthew Wilson wrote: I was/am primarily concerned with RAM usage for high-concurrency situations. Ok. Still, in

RE: HarrisPoll

2007-02-17 Thread Michael Scheidell
-Original Message- From: LuKreme [mailto:[EMAIL PROTECTED] Sent: Friday, February 16, 2007 1:26 PM To: users@spamassassin.apache.org Subject: HarrisPoll Where does the WS-SURBL info come from? I ask because the Harris Poll email is getting tagged with it. As far as I

Re: Bayes db size....

2007-02-17 Thread Michael Parker
Dave Koontz wrote: I am sure this has been asked numerous times before, but what is the logic in having auto expiry on the bayes DB, and not seen? Seems that once tokens have been removed from the DB there is little to no use for 'unlearning' any associated messages. Besides on a busy

Re: SA not working?

2007-02-17 Thread Matt Kettler
David Obando wrote: Dear all, I installed SA on a Debian Etch system together with Postfix and Amavis. Strangely SA doens't score mails at all, but I don't see why. See the output of a spam mail I checked manually, When I run a check on the same mail on a different machine, it is scored:

Re: Bayes db size....

2007-02-17 Thread Dave Koontz
Is there a consensus on this need? I deal with the seen db issue by scheduled deletion of that file. That said, with SA becoming more and more prominent all the time, I suspect the Average Joe will miss this oddity until they wind up with a sluggish system, out of drive space or other related

Re: Google Summer of Code 2007 ...

2007-02-17 Thread Matthew Wilson
Raul Dias writes: **snip If I remember correctly spamd was using something between 2 to 5% of memory reported by top (45 process max). If it was really shared, it would have not collapsed. My bet is that the model used on Linux is copy on write. So after a fork, when the child spamd

Re: Export and append Bayes DB

2007-02-17 Thread Sam Przyswa
Michael Parker a écrit : Sam Przyswa wrote: Hi, Is it possible to export a Bayes DB from a server and then append (not restore) it to others servers ? No, you generally can't combine two bayes databases that way. Best bet is to pick the most complete one and use it. For more

Re: SA not working?

2007-02-17 Thread David Obando
Matt Kettler schrieb am 17.02.2007 15:08: David Obando wrote: Dear all, I installed SA on a Debian Etch system together with Postfix and Amavis. Strangely SA doens't score mails at all, but I don't see why. See the output of a spam mail I checked manually, When I run a check on the

Re: HarrisPoll

2007-02-17 Thread LuKreme
On 17-Feb-2007, at 06:39, Michael Scheidell wrote: -Original Message- From: LuKreme [mailto:[EMAIL PROTECTED] Sent: Friday, February 16, 2007 1:26 PM To: users@spamassassin.apache.org Subject: HarrisPoll Where does the WS-SURBL info come from? I ask because the Harris Poll email is

Re: Google Summer of Code 2007 ...

2007-02-17 Thread Chris St. Pierre
On Fri, 16 Feb 2007, Quinn Comendant wrote: How about an extensive statistics reporting tool, possible web-based, that can show how well a current spamassassin installation is performing and where it needs improvements. It could provide trends in different classes of spam and how each is

Re: Google Summer of Code 2007 ...

2007-02-17 Thread Tim B.
Justin Mason wrote: Theo Van Dinter writes: I'm assuming that there will be a Google Summer of Code 2007 going on, and that the ASF will be involved again. So it's a good time to start thinking about things we'd like to put up as possible projects. We still have a number of items from last

Re: Google Summer of Code 2007 ...

2007-02-17 Thread Theo Van Dinter
On Sat, Feb 17, 2007 at 06:56:28PM -0500, Tim B. wrote: How about a How many times have I seen this message body plugin... So each time SA see's the same or similar enough message body, it increases the score. Doesn't SA have at least 3 of those already? Razor, DCC, and Pyzor. --

Re: Google Summer of Code 2007 ...

2007-02-17 Thread Mark Martinec
On Saturday February 17 2007 03:01, Quinn Comendant wrote: How about an extensive statistics reporting tool, ..., that can show how well a current spamassassin installation is performing and where it needs improvements. Well, not exactly by your words, but in the same spirit, this time

Re: [2] How can I configure spamassassin to filter spam jpgs?

2007-02-17 Thread John Thompson
On 2007-02-15, NIbbLLe [EMAIL PROTECTED] wrote: The problem is that we are running spamassassin through plesk 7 and we are running it on a Windows machine. I went to the FuzzyOCR site, I see the only files that they have is .tar (for linux) . Do you maybe have any suggestions on how I can

Re: FuzzyOCR

2007-02-17 Thread John Thompson
On 2007-02-12, Sujit Choudhury [EMAIL PROTECTED] wrote: Is there an easy way to get everything needed for FuzzyOCR? Has somebody built a complete install, so that we don't have to go to various sites to built various bits of FuzzyOCR? On FreeBSD when you install from the ports collection

Re: Google Summer of Code 2007 ...

2007-02-17 Thread Graham Murray
Theo Van Dinter [EMAIL PROTECTED] writes: Doesn't SA have at least 3 of those already? Razor, DCC, and Pyzor. Not quite. Those show how many times *others* have seen it, not how many times *I* have seen it. Also, these have hysteresis so if you are unfortunately to be at the start of the spam

Re: Google Summer of Code 2007 ...

2007-02-17 Thread hamann . w
Not quite. Those show how many times *others* have seen it, not how many times *I* have seen it. Also, these have hysteresis so if you are unfortunately to be at the start of the spam run and receive multiple mails all with the same body then Razor, DCC and Pyzor might not help. Though if

Re: HarrisPoll

2007-02-17 Thread Jeff Chan
On Saturday 17 February 2007 10:44, LuKreme wrote: On 17-Feb-2007, at 06:39, Michael Scheidell wrote: -Original Message- From: LuKreme [mailto:[EMAIL PROTECTED] Sent: Friday, February 16, 2007 1:26 PM To: users@spamassassin.apache.org Subject: HarrisPoll Where does the

Re: HarrisPoll

2007-02-17 Thread Jeff Chan
I should have addded, we are removing the Harris Poll domain hpolsurveys.com from the blacklist.

Re: HarrisPoll

2007-02-17 Thread Jeff Chan
On Saturday 17 February 2007 23:29, Jeff Chan wrote: I should have addded, we are removing the Harris Poll domain hpolsurveys.com from the blacklist. Actually checking more closely, this domain is not on any SURBL blacklists. If you got this result recently, then you may be suffering from the