Hello Sandy,

Tuesday, August 3, 2004, 6:43:13 AM, you wrote:

SS> Tom  -
SS> I've seen these too, and have put in the following rules to try and catch
SS> them.  They've helped, but a few are still slipping through.  I think if you
SS> raise the scores some they'd catch a lot more, but I need to be a little
SS> more sure I won't get FPs before I do this!

Sandy, results of my mass-check here:

Section 3 -- Frequencies Log
(First numeric frequencies, followed by percentage frequencies)

OVERALL%   SPAM%     HAM%     S/O          SCORE  NAME
  58315    33581    24734    0.576   0.00   0.00  (all messages)
    668      462      206    0.623   1.00   0.60  ODD_CHAR_COMMA_BA
    287      249       38    0.828   0.89   0.60  ODD_CHAR_CARET_BA
    557      261      296    0.394   0.78   0.60  ODD_CHAR_DOT_BA
     75       44       31    0.511   0.44   0.60  ODD_CHAR_TIC1_BA
    887      197      690    0.174   0.33   0.60  ODD_CHAR_UNDERSCORE_BA
   6363      240     6123    0.028   0.11   0.60  ODD_CHAR_TILDE_BA
   5988      194     5794    0.024   0.00   0.60  ODD_CHAR_DASH_BA
   1151      183      968    0.122   0.00   0.60  ODD_CHAR_TIC2_BA

OVERALL%   SPAM%     HAM%     S/O    RANK   SCORE  NAME
  58315    33581    24734    0.576   0.00    0.00  (all messages)
100.000  57.5855  42.4145    0.576   0.00    0.00  (all messages as %)
  1.146   1.3758   0.8329    0.623   1.00    0.60  ODD_CHAR_COMMA_BA
  0.492   0.7415   0.1536    0.828   0.89    0.60  ODD_CHAR_CARET_BA
  0.955   0.7772   1.1967    0.394   0.78    0.60  ODD_CHAR_DOT_BA
  0.129   0.1310   0.1253    0.511   0.44    0.60  ODD_CHAR_TIC1_BA
  1.521   0.5866   2.7897    0.174   0.33    0.60  ODD_CHAR_UNDERSCORE_BA
 10.911   0.7147  24.7554    0.028   0.11    0.60  ODD_CHAR_TILDE_BA
 10.268   0.5777  23.4252    0.024   0.00    0.60  ODD_CHAR_DASH_BA
  1.974   0.5450   3.9136    0.122   0.00    0.60  ODD_CHAR_TIC2_BA

All these rules hit significant ham.  Indeed, UNDERSCORE_BA, DOT_BA,
TILDE_BA, DASH_BA, and TIC2_BA hit more ham than spam in my corpus.

Bob Menschel


SS> body ODD_CHAR_DASH_BA /\-[,\~\'\`\_\.\^]/
SS> describe ODD_CHAR_DASH_BA Odd combo of special chars
SS> score ODD_CHAR_DASH_BA .6

SS> body ODD_CHAR_CARET_BA /\^[\-,\~\'\`\_\.\^]/
SS> describe ODD_CHAR_CARET_BA Odd combo of special chars
SS> score ODD_CHAR_CARET_BA .6

SS> body ODD_CHAR_DOT_BA /\.[\^\~\'\`\_]/
SS> describe ODD_CHAR_DOT_BA Odd combo of special chars
SS> score ODD_CHAR_DOT_BA .6

SS> body ODD_CHAR_UNDERSCORE_BA /\_[\.\^\-,\~\'\`]/
SS> describe ODD_CHAR_UNDERSCORE_BA Odd combo of special chars
SS> score ODD_CHAR_UNDERSCORE_BA .6

SS> body ODD_CHAR_TIC1_BA /\`[\_\.\^\-,\~\']/
SS> describe ODD_CHAR_TIC1_BA Odd combo of special chars
SS> score ODD_CHAR_TIC1_BA .6

SS> body ODD_CHAR_TIC2_BA /\'[\`\_\.\^\-,\~]/
SS> describe ODD_CHAR_TIC2_BA Odd combo of special chars
SS> score ODD_CHAR_TIC2_BA .6

SS> body ODD_CHAR_TILDE_BA /\~[\'\`\_\.\^\-,\~]/
SS> describe ODD_CHAR_TILDE_BA Odd combo of special chars
SS> score ODD_CHAR_TILDE_BA .6

SS> body ODD_CHAR_COMMA_BA /,[\~\'\`\_\.\^\-]/
SS> describe ODD_CHAR_COMMA_BA Odd combo of special chars
SS> score ODD_CHAR_COMMA_BA .6

SS> Sandy
SS> ----- Original Message ----- 
SS> From: "Thomas Kinghorn" <[EMAIL PROTECTED]>
SS> To: <[EMAIL PROTECTED]>
SS> Sent: Tuesday, August 03, 2004 5:38 AM
SS> Subject: more are more junk getting through


>> Hi List
>>
>> I have had a dramatic increase in junk mail,
>>
>> I am using SA 2.63 with SURBL.
>> MTA is exim 4.34 using sa-exim-4
>>
>> I have attached a sample.
>>
>> Subjects and originating IP's are always different
>> But the formatting of the body is similar.
>>
>> Any ideas would be appreciated.
>>
>>  <<important increased muscle mass without exercise.msg>>
>>
>> Regards,
>>
>> Tom Kinghorn
>>
>>





-- 
Best regards,
 Robert                            mailto:[EMAIL PROTECTED]


Reply via email to