Re: Help with a regex to catch spam with gibberish html tags

2014-01-30 Thread Andy Jezierski
Amir Caspi ceph...@3phase.com wrote on 01/29/2014 11:08:18 AM: From: Amir Caspi ceph...@3phase.com To: Andy Jezierski ajezier...@stepan.com, Cc: users@spamassassin.apache.org users@spamassassin.apache.org Date: 01/29/2014 11:08 AM Subject: Re: Help with a regex to catch spam with gibberish

Re: Help with a regex to catch spam with gibberish html tags

2014-01-30 Thread Kevin A. McGrail
On 1/30/2014 11:23 AM, Andy Jezierski wrote: Amir Caspi ceph...@3phase.com wrote on 01/29/2014 11:08:18 AM: From: Amir Caspi ceph...@3phase.com To: Andy Jezierski ajezier...@stepan.com, Cc: users@spamassassin.apache.org users@spamassassin.apache.org Date: 01/29/2014 11:08 AM Subject: Re:

Re: Help with a regex to catch spam with gibberish html tags

2014-01-30 Thread Amir Caspi
On Jan 30, 2014, at 10:28 AM, Kevin A. McGrail kmcgr...@pccc.com wrote: If you want to share the complete rule, I can throw it into my sandbox and see what masscheck thinks as well. The complete rule would be something like this, assuming Andy implemented it as I wrote it: rawbody

Re: Help with a regex to catch spam with gibberish html tags

2014-01-30 Thread Andy Jezierski
Amir Caspi ceph...@3phase.com wrote on 01/30/2014 11:39:51 AM: From: Amir Caspi ceph...@3phase.com To: Kevin A. McGrail kmcgr...@pccc.com, Cc: Andy Jezierski ajezier...@stepan.com, users@spamassassin.apache.org users@spamassassin.apache.org Date: 01/30/2014 11:40 AM Subject: Re: Help with

Re: Help with a regex to catch spam with gibberish html tags

2014-01-30 Thread John Hardin
On Thu, 30 Jan 2014, Amir Caspi wrote: On Jan 30, 2014, at 10:28 AM, Kevin A. McGrail kmcgr...@pccc.com wrote: If you want to share the complete rule, I can throw it into my sandbox and see what masscheck thinks as well. The complete rule would be something like this, assuming Andy

Re: Help with a regex to catch spam with gibberish html tags

2014-01-30 Thread Amir Caspi
On Jan 30, 2014, at 11:25 AM, John Hardin jhar...@impsec.org wrote: I'd suggest writing it as a subrule first, to see how well it performs against the masscheck corpora. If it does well by itself (good hits, high S/O), then a meta can be added to expose it for scoring. If it hits a lot but

Re: Help with a regex to catch spam with gibberish html tags

2014-01-30 Thread Kevin A. McGrail
On 1/30/2014 1:25 PM, John Hardin wrote: On Thu, 30 Jan 2014, Amir Caspi wrote: On Jan 30, 2014, at 10:28 AM, Kevin A. McGrail kmcgr...@pccc.com wrote: If you want to share the complete rule, I can throw it into my sandbox and see what masscheck thinks as well. The complete rule would be

Re: Help with a regex to catch spam with gibberish html tags

2014-01-30 Thread John Hardin
On Thu, 30 Jan 2014, Kevin A. McGrail wrote: On 1/30/2014 1:25 PM, John Hardin wrote: I'd suggest writing it as a subrule first, to see how well it performs against the masscheck corpora. If it does well by itself (good hits, high S/O), then a meta can be added to expose it for scoring. If

Re: Help with a regex to catch spam with gibberish html tags

2014-01-30 Thread Kevin A. McGrail
On 1/30/2014 12:39 PM, Amir Caspi wrote: On Jan 30, 2014, at 10:28 AM, Kevin A. McGrail kmcgr...@pccc.com mailto:kmcgr...@pccc.com wrote: If you want to share the complete rule, I can throw it into my sandbox and see what masscheck thinks as well. The complete rule would be something like

Re: Help with a regex to catch spam with gibberish html tags

2014-01-30 Thread David B Funk
On Thu, 30 Jan 2014, Amir Caspi wrote: On Jan 30, 2014, at 10:28 AM, Kevin A. McGrail kmcgr...@pccc.com wrote: If you want to share the complete rule, I can throw it into my sandbox and see what masscheck thinks as well. The complete rule would be something like this, assuming Andy