bayes learn best practice

2009-04-09 Thread Arthur Kerpician
Hi, I recently upgraded to 3.2.5 and re-trained bayes db from scratch. The auto-learn is on so now I have about 6000 mails trained as spam and 3000 as ham. I tried to manually keep both spam and ham at the same level in the bayes db but it seems that spamassassin is learning spam twice as

Re: The value chain of SpamAssassin

2009-04-09 Thread Justin Mason
hi -- I've answered a couple, and cc'd the message to the SA users list for more feedback from other developers and contributors -- it's a lot of questions! ;) --j. On Wed, Apr 8, 2009 at 18:56, DEMBELE Mariam mariam.demb...@unifr.ch wrote: Dear Mr Mason, I am doing my masters in

Re: use_auto_whitelist error in lint

2009-04-09 Thread realshock
Matt Kettler-3 wrote: Find out where else you've got use_auto_whitelist 0 in your config, and remove it. On the plus side, it does confirm you've correctly disabled the plugin. I searched all over the place, and following your directions, do you think this command will find where

Re: use_auto_whitelist error in lint

2009-04-09 Thread Jeff Mincy
From: realshock wael.alt...@gmail.com Date: Thu, 9 Apr 2009 06:56:05 -0700 (PDT) Matt Kettler-3 wrote: Find out where else you've got use_auto_whitelist 0 in your config, and remove it. On the plus side, it does confirm you've correctly disabled the plugin. I

Registry Barrier code

2009-04-09 Thread Marc Perkel
What source file is the registry barrier code in? Thanks in advance.

Re: use_auto_whitelist error in lint

2009-04-09 Thread realshock
Jeff Mincy-2 wrote: spamassassin -D --lint prints out the config files, eg: spamassassin -D --lint 21 | fgrep 'config: read file' The use_auto_whitelist is in one of those config files. -jeff Worked ... spamassassin --lint returning no errors Rules update are back to

Re: bayes learn best practice

2009-04-09 Thread John Hardin
On Thu, 9 Apr 2009, Arthur Kerpician wrote: I tried to manually keep both spam and ham at the same level in the bayes db but it seems that spamassassin is learning spam twice as fast as ham. Not surprising, as raw email traffic has a very skewed spam:ham ratio. Surely you've heard the stats

Re: Registry Barrier code

2009-04-09 Thread Mark Martinec
Marc, What source file is the registry barrier code in? Mail/SpamAssassin/Util/RegistrarBoundaries.pm but is slightly out of date, for example it does not include registered IDN tld names: XN--0ZWM56D XN--11B5BS3A9AJ6G XN--80AKHBYKNJ4F XN--9T4B11YI5A XN--DEBA0AD XN--G6W251D XN--HGBK6AJ7F53BBA

Re: Registry Barrier code

2009-04-09 Thread Karsten Bräckelmann
On Thu, 2009-04-09 at 07:25 -0700, Marc Perkel wrote: What source file is the registry barrier code in? M::SA::Util::RegistrarBoundaries -- char *t=\10pse\0r\0dtu...@ghno\x4e\xc8\x79\xf4\xab\x51\x8a\x10\xf4\xf4\xc4; main(){ char h,m=h=*t++,*x=t+2*h,c,i,l=*x,s=0; for (i=0;il;i++){ i%8? c=1:

Re: Registry Barrier code

2009-04-09 Thread Justin Mason
On Thu, Apr 9, 2009 at 16:14, Mark Martinec mark.martinec...@ijs.si wrote: Marc, What source file is the registry barrier code in? Mail/SpamAssassin/Util/RegistrarBoundaries.pm but is slightly out of date, for example it does not include registered IDN tld names: XN--0ZWM56D

Re: Slightly OT: identifying IP source locations

2009-04-09 Thread Kai Schaetzl
John Rudd wrote on Wed, 8 Apr 2009 12:44:29 -0700: 1) Does anyone know of a convenient command line tool (perl library being ideal) that lets you give it an IP address, and it tells you the country and/or continent (and that's it)? google for GeoIP. 2) similarly, does anyone know of a

Re: Registry Barrier code

2009-04-09 Thread Justin Mason
On Thu, Apr 9, 2009 at 16:31, Justin Mason j...@jmason.org wrote: On Thu, Apr 9, 2009 at 16:14, Mark Martinec mark.martinec...@ijs.si wrote: Marc, What source file is the registry barrier code in? Mail/SpamAssassin/Util/RegistrarBoundaries.pm but is slightly out of date, for example it

Re: Registry Barrier code

2009-04-09 Thread Marc Perkel
Thanks - that was what I was looking for. Mark Martinec wrote: Marc, What source file is the registry barrier code in? Mail/SpamAssassin/Util/RegistrarBoundaries.pm but is slightly out of date, for example it does not include registered IDN tld names: XN--0ZWM56D XN--11B5BS3A9AJ6G

Re: bayes learn best practice

2009-04-09 Thread Kai Schaetzl
Arthur Kerpician wrote on Thu, 09 Apr 2009 09:41:22 +0300: The docs mention that after 5000 spam and ham learned, spamassassin doesn't improve spam detection much. do they? What is meant is that once you reach some threshold the detection rate doesn't improve as good as before. You can't get

unsubscribe

2009-04-09 Thread Juergen Boehm
unsubscribe

Re: unsubscribe

2009-04-09 Thread Evan Platt
As every header states: list-unsubscribe: mailto:users-unsubscr...@spamassassin.apache.org At 09:41 AM 4/9/2009, Juergen Boehm wrote: unsubscribe

Re: bayes learn best practice

2009-04-09 Thread Arthur Kerpician
Kai Schaetzl wrote: Arthur Kerpician wrote on Thu, 09 Apr 2009 09:41:22 +0300: The docs mention that after 5000 spam and ham learned, spamassassin doesn't improve spam detection much. do they? What is meant is that once you reach some threshold the detection rate doesn't improve as

Re: bayes learn best practice

2009-04-09 Thread Michael Scheidell
Arthur Kerpician wrote: I was thinking to increase bayes_auto_learn_threshold_spam to a higher number, so less spam is auto-learned. Is this ok? I try to keep it a 10 to 1 ratio (dropping the _ham threshold and increasing the _spam threshold), basically, trying to mimic the global stats

Re: Spam Rats - does anyone know them?

2009-04-09 Thread Matus UHLAR - fantomas
What I am complaining about is that the IP is reported to be dynamic because it does not have hostname that follows kind of sick rules. On 09.04.09 01:28, Mark wrote: Their rules DO seem a mite odd: Also remember, according to Best Practises, having a reverse DNS that appears to be part

Re: Spam Rats - does anyone know them?

2009-04-09 Thread McDonald, Dan
On Wed, 2009-04-08 at 23:49 +0200, mouss wrote: Matus UHLAR - fantomas a écrit : Even if that record would be listed in SPF? SPF again? any spammer can buy a domain and add arbitrary IPs to the SPF record. you know about fast flux, right? You are thinking of SPF at the wrong layer. It

Re: Spam Rats - does anyone know them?

2009-04-09 Thread Neil Schwartzman
On 09/04/09 2:35 PM, Matus UHLAR - fantomas uh...@fantomas.sk wrote: That's the question. I do not object against listing of a spammer, but dynamic? naming convention? Will they block host if it spams, if it sends mail from gmail com and the hostname is qw-out-1920.google.com which looks like

Re: Spam Rats - does anyone know them?

2009-04-09 Thread Neil Schwartzman
BWA HAHAHAHA Someone here isn't just using SA. Got a bounce saying I said a bad word. For the record, it wasn't me. Microsoft Antigen for SMTP found a message matching a filter. The message is currently Purged. Message: Re_ Spam Rats _ does anyone know them_ Filter name: KEYWORD= profanity:

Re: Spam Rats - does anyone know them?

2009-04-09 Thread McDonald, Dan
On Thu, 2009-04-09 at 15:55 -0400, Neil Schwartzman wrote: On 09/04/09 2:35 PM, Matus UHLAR - fantomas uh...@fantomas.sk wrote: That's the question. I do not object against listing of a spammer, but dynamic? naming convention? Will they block host if it spams, if it sends mail from gmail

Re: Spam Rats - does anyone know them?

2009-04-09 Thread Neil Schwartzman
On 09/04/09 4:06 PM, McDonald, Dan dan.mcdon...@austinenergy.com wrote: I won't block on it alone, but if someone wants a whitelist entry, they have to have rDNS correct. And preferably an SPF or DKIM policy Well, an Sender ID-compliant SPF record has long been a requirement for our

Re: Spam Rats - does anyone know them?

2009-04-09 Thread mouss
McDonald, Dan a écrit : On Wed, 2009-04-08 at 23:49 +0200, mouss wrote: Matus UHLAR - fantomas a écrit : Even if that record would be listed in SPF? SPF again? any spammer can buy a domain and add arbitrary IPs to the SPF record. you know about fast flux, right? You are thinking of SPF at

Re: Spam Rats - does anyone know them?

2009-04-09 Thread mouss
Matus UHLAR - fantomas a écrit : What I am complaining about is that the IP is reported to be dynamic because it does not have hostname that follows kind of sick rules. On 09.04.09 01:28, Mark wrote: Their rules DO seem a mite odd: Also remember, according to Best Practises, having a

livejournal?

2009-04-09 Thread McDonald, Dan
I notice that uribl is listing livejournal.com as a util_rb_2tld host, but http://daryl.dostech.ca/sa-update/sare/90_2tld.cf doesn't have livejournal.com yet, and updates_spamassassin_org/25_uribl.cf has livejournal.com listed in uridnsbl_skip_domain. If I were to add util_rb_2tld livejournal.com

Re: livejournal?

2009-04-09 Thread Benny Pedersen
On Fri, April 10, 2009 00:10, McDonald, Dan wrote: [snip] If I were to add util_rb_2tld livejournal.com to local-foo.cf, would the presence of uridnsbl_skip_domain prevent it from being checked? And if so, how do I unskip that domain? no its just subdomain that might be blacklisted in url,

emailreg.org (was: zen.spamhaus.org)

2009-04-09 Thread SM
Hi Rob, At 12:52 07-04-2009, Rob McEwen wrote: I had no idea that emailreg.org was owned and operated by Barracuda. I http://www.barracudacentral.org/about/emailreg http://www.emailreg.org/index.cgi?p=about But, as the post you mentioned said, emailreg.org resolves to 64.235.146.64 and