Re: A New Approach: Find the Ham

2007-02-11 Thread John Rudd
Giampaolo Tomassoni wrote: From: Miles Fidelman [mailto:[EMAIL PROTECTED] Dan wrote: I've developed a new approach to scoring that I want to 1) share with everyone and 2) make into a working system thats as accurate as what I've already built, but easier to use. First, the theory: NEW

Re: A New Approach: Find the Ham

2007-02-11 Thread John Andersen
On Saturday 10 February 2007, Dan wrote: On Feb 10, 2007, at 14:38, Mathieu Bouchard wrote: How do you ever find FPs if you have so many TP to sort through   that it's not even worth sorting through FP+TP to find the FP ?   IMHO, that'd be why we assume that mails are ham rather than assume

Re: A New Approach: Find the Ham

2007-02-11 Thread Justin Mason
Long-time SpamAssassin users with a good memory might recall back in SpamAssassin 2.4x, we included quite a few ham-targeting rules, such as was this sent using User-Agent: Mozilla?, is this formatted like a reply to a previous message?, does it include headers from a mailing list? and is it

Re: Query regarding whitelist_to

2007-02-11 Thread sushma
could you help me in switch to per-user filtering. On Sun, 11 Feb 2007, Theo Van Dinter wrote: On Sun, Feb 11, 2007 at 11:36:56AM +, sushma wrote: Spam mail originated to list of user, if one user in whitelist_to then score will be neagtive so all other user also get that spam

Re: Query regarding whitelist_to

2007-02-11 Thread sushma
On Sun, 11 Feb 2007, sushma wrote: I can't shift to per-user filtering, could you please explain more about this statement don't filter based on the scan results. could you help me in switch to per-user filtering. On Sun, 11 Feb 2007, Theo Van Dinter wrote: On Sun, Feb 11, 2007 at

New drug spam...

2007-02-11 Thread Burak Ueda
No doubt that spammers watching this list. They update their tactics right after a solution is posted here I got this today im several mail address, and most of them got 4-5 score: Original Message From: - Sun Feb 11 22:15:22 2007 X-Account-Key: account29 X-UIDL:

Why are arguments ignored in the EvalTests.pm methods?

2007-02-11 Thread Robert Nicholson
Can anybody tell me why the argument is passed for raw tests and then subsequently ignored later? # generic test version sub check_for_mime { my ($self, undef, $test) = @_; $self-_check_attachments unless exists $self-{$test}; return $self-{$test}; } for instance the body array goes

Re: sa ignoring whitelist_from in user_prefs

2007-02-11 Thread Matt Kettler
Rich Winkel wrote: For a particular user, I'm finding no correlation between his whitelist_from's in user_prefs and the whitelist status as reported in incoming messages. I see messages with no USER_IN_WHITELIST when both the From and From: addresses match a whitelist_from line in the

saving matching values

2007-02-11 Thread Raul Dias
There are some cases, that it is desired to match part a value from a header, to another value somewhere else. Is there a way for SA to retain the value matched in a RE like $1/$2 matching parentheses, so that it might be used later (or at least in the next rule)? -Raul Dias

_check_attachments....

2007-02-11 Thread Robert Nicholson
Can anybody tell me if this looks at meta tag content types as in? meta http-equiv=3DContent-Type content=3Dtext/html; charset=3Deuc- kr likewise is this considered excessive quoted printable chars? Illegal chars gets this one but if they weren't used none of the charset/illegal rules

Re: A New Approach: Find the Ham

2007-02-11 Thread tom
On Feb 10, 2007, at 3:19 PM, Giampaolo Tomassoni wrote: From: Tom Allison [mailto:[EMAIL PROTECTED] Personally, I think HTML email should be outright discarded from the start. If you look at this arguement presented by the OP then it reinforces the idea that most ascii is ham and most html is

RE: A New Approach: Find the Ham

2007-02-11 Thread Giampaolo Tomassoni
From: tom [mailto:[EMAIL PROTECTED] On Feb 10, 2007, at 3:19 PM, Giampaolo Tomassoni wrote: From: Tom Allison [mailto:[EMAIL PROTECTED] Personally, I think HTML email should be outright discarded from the start. If you look at this arguement presented by the OP then it reinforces

Re: Query regarding whitelist_to

2007-02-11 Thread Theo Van Dinter
On Sun, Feb 11, 2007 at 06:32:06PM +, sushma wrote: I can't shift to per-user filtering, could you please explain more about this statement don't filter based on the scan results. It essentially means that if you want some users to receive mails that other users shouldn't, and you want

Re: A New Approach: Find the Ham

2007-02-11 Thread Theo Van Dinter
On Sat, Feb 10, 2007 at 08:22:41PM +, Nigel Frankcom wrote: What do Theo, Matt Co have to say? They've been doing this a lot longer than us. Unless I'm missing something, this approach is the standard block everything except for what we explicitly want to receive. Which is great, if you

Re: saving matching values

2007-02-11 Thread Matt Kettler
Raul Dias wrote: There are some cases, that it is desired to match part a value from a header, to another value somewhere else. Is there a way for SA to retain the value matched in a RE like $1/$2 matching parentheses, so that it might be used later (or at least in the next rule)? No,

DKIM / DomainKeys

2007-02-11 Thread Alexis Manning
I enabled the DK/DKIM plugins in my SA 3.1.7 setup and I see that the default scores for their tests are negligible, presumably because they're still a bit experimental. Is anyone using these and can suggest appropriate scores for these plugins, or are these really just too unripe for serious

RE: A New Approach: Find the Ham

2007-02-11 Thread Philip Seccombe
Apologies if this has been answered before or anything, but where/how are you generating those stats? I'm not using SA with SQL so I'm not sure if it will work for me, but those I like! Stats in question: http://www.blue-canoe.com/stats/index.php?D1=11 Kind Regards, Philip Seccombe Turnstone

Re: DKIM / DomainKeys

2007-02-11 Thread Michele Neylon :: Blacknight
Alexis Manning wrote: I enabled the DK/DKIM plugins in my SA 3.1.7 setup and I see that the default scores for their tests are negligible, presumably because they're still a bit experimental. Is anyone using these and can suggest appropriate scores for these plugins, or are these really just

Re: Why are arguments ignored in the EvalTests.pm methods?

2007-02-11 Thread Theo Van Dinter
On Sun, Feb 11, 2007 at 08:07:28AM -0600, Robert Nicholson wrote: Can anybody tell me why the argument is passed for raw tests and then subsequently ignored later? The argument is passed because it's a standard call for an eval rule, but the eval code doesn't need the information. Since

Re: saving matching values

2007-02-11 Thread Raul Dias
On Sun, 2007-02-11 at 15:49 -0500, Matt Kettler wrote: Raul Dias wrote: There are some cases, that it is desired to match part a value from a header, to another value somewhere else. Is there a way for SA to retain the value matched in a RE like $1/$2 matching parentheses, so that it

Re: DKIM / DomainKeys

2007-02-11 Thread Alexis Manning
[EMAIL PROTECTED] says... Alexis Manning wrote: [DK/DKIM plugins] Is anyone using these and can suggest appropriate scores for these plugins, or are these really just too unripe for serious use at the moment? Why don't you keep an eye on the activity for those scores and then decide?

Re: A New Approach: Find the Ham

2007-02-11 Thread .rp
On 10 Feb 2007 at 11:43, Dan wrote: I've developed a new approach to scoring that I want to 1) share with everyone and 2) make into a working system thats as accurate as what I've already built, but easier to use. First, the theory: [...] NEW SITUATION Ham is now the tiniest minority of

Re: DKIM / DomainKeys

2007-02-11 Thread Mark Martinec
Alexis, I enabled the DK/DKIM plugins in my SA 3.1.7 setup and I see that the default scores for their tests are negligible, presumably because they're still a bit experimental. Is anyone using these and can suggest appropriate scores for these plugins, or are these really just too unripe

Find the Ham: A Prototype Config

2007-02-11 Thread Dan
Yesterday I described an unorthodox approach to email filtering and generated both interest and confusion. Hopefully by describing it further, I can create understanding. Below is my design and at the bottom a question, but first, a summary of points: 1) I created confusion by starting

Re: spamassassin learning method

2007-02-11 Thread John D. Hardin
On Sun, 11 Feb 2007, Rizal Ferdiyan wrote: My smtp proxy server serve many mail server client. My client build many server with their own, so that server contain two mbox format, mailbox or maildir. But i don't have access for that mail server client. That for i want my client forward spam

How to block yahoogroups?

2007-02-11 Thread Firdaus Tjahyadi
Dear All I'm having trouble blok a few yahoogroups milist i want blok this milist [EMAIL PROTECTED] [EMAIL PROTECTED] [EMAIL PROTECTED] [EMAIL PROTECTED] but i did'nt want to blok this milist [EMAIL PROTECTED] how to set that rule ? i'v tried setting in badmailfrom but did'nt work cause

RE: How to block yahoogroups?

2007-02-11 Thread Philip Seccombe
Can you blacklist @ returns.groups.yahoo.com and then whitelist [EMAIL PROTECTED] or something? I'm not sure how the yahoo groups work, but is the reply address specific to each group or does it get sent from the person to the group address like this list? Kind Regards, Philip Seccombe

FuzzyOCR mature enough?

2007-02-11 Thread Peter
I have seen a lot of buzz around FuzzyOCR lately but by looking at its web site it shows that the project started only last month. Is this tool really advisable on a serious system? PM

Re: How to block yahoogroups? (fwd)

2007-02-11 Thread Doni Indrawan
You could use e-mail property of yahoogroups mailling such as, List-Unsubscribe, List-Post or Mailing-List. Set score for each property -- Forwarded message -- Subject: RE: How to block yahoogroups? Date: Mon, 12 Feb 2007 16:15:53 +1300 Message-ID: [EMAIL PROTECTED] From:

Re: A New Approach: Find the Ham

2007-02-11 Thread Duncan Findlay
Hey Dan, I've read most of the e-mails on this topic and I think the underlying problem is that this method relies on knowing exactly which profiles (i.e. combinations of rules) valid ham can hit. I see a number of problems: - How do we actually generate the profiles that are to be considered

Re: DKIM / DomainKeys

2007-02-11 Thread Alexis Manning
[EMAIL PROTECTED] says... [...] some mailing list also corrupt signatures, and some people use gmail/yahoo sending address even when posting through some other ISP. Before this practice is rooted out, one should probably not score invalid signature from these two domains too harshly. Thanks

Re: FuzzyOCR mature enough?

2007-02-11 Thread René Berber
Peter wrote: I have seen a lot of buzz around FuzzyOCR lately but by looking at its web site it shows that the project started only last month. Wrong, if you mean the timeline shown that's because the _ticket_system_ is new, the project is at least a year old, the current _website_ is about 6