> -----Original Message----- > From: Theo Van Dinter [mailto:[EMAIL PROTECTED] > Sent: Thursday, August 03, 2006 6:52 PM > To: users@spamassassin.apache.org > Subject: Re: ImageInfo plugin for SA > > On Thu, Aug 03, 2006 at 03:14:06PM -0500, Dallas L. Engelken wrote: > > All those scores in the cf are just "WAGs", since none have been > > masschecked. Theo, could you sandbox this? > > I made some major edits (1/3 smaller and also faster :) ), > but the core algorithm is the same. Overall, very good from > my results: > > MSECS SPAM% HAM% S/O RANK SCORE NAME > 0 29412 4952 0.856 0.00 0.00 (all messages) > 0.00000 85.5896 14.4104 0.856 0.00 0.00 (all messages as %) > 7.162 8.3673 0.0000 1.000 0.93 3.00 DC_GIF_UNO_LARGO > 3.681 4.3010 0.0000 1.000 0.79 3.00 DC_IMAGE_SPAM > 0.576 0.6732 0.0000 1.000 0.24 3.00 DC_PNG_UNO_LARGO > 0.000 0.0000 0.0000 0.500 0.17 4.00 DC_PNG_MULTI_LARGO > 0.000 0.0000 0.0000 0.500 0.17 4.00 DC_GIF_MULTI_LARGO >
Awesome... Thanks for that! But no *_MULTI_LARGO hits??? I have tons of these samples (today even) # grep -c MULTI_LARGO spamd.log 83 They all look similar to this... 2006-08-03 03:46:16.847129500 [20349] dbg: imageinfo: 8 gif attachments found 2006-08-03 03:46:16.852860500 [20349] dbg: imageinfo: check images of type gif 2006-08-03 03:46:16.852938500 [20349] dbg: imageinfo: image catholic.gif is 40 x 512 pixels (20480 pixels sq.) 2006-08-03 03:46:16.853007500 [20349] dbg: imageinfo: image flesh.gif is 254 x 4 pixels (1016 pixels sq.) 2006-08-03 03:46:16.853072500 [20349] dbg: imageinfo: image wetback.gif is 254 x 113 pixels (28702 pixels sq.) 2006-08-03 03:46:16.853138500 [20349] dbg: imageinfo: image humorous.gif is 94 x 626 pixels (58844 pixels sq.) 2006-08-03 03:46:16.853203500 [20349] dbg: imageinfo: image willingly.gif is 40 x 28 pixels (1120 pixels sq.) 2006-08-03 03:46:16.853268500 [20349] dbg: imageinfo: image mostly.gif is 40 x 81 pixels (3240 pixels sq.) 2006-08-03 03:46:16.853336500 [20349] dbg: imageinfo: image hailstone.gif is 254 x 509 pixels (129286 pixels sq.) 2006-08-03 03:46:16.853402500 [20349] dbg: imageinfo: image rat race.gif is 40 x 5 pixels (200 pixels sq.) 2006-08-03 03:46:16.896336500 [20349] info: spamd: identified spam (22.7/5.0) for $global:200 in 2.6 seconds, 50713 bytes. 2006-08-03 03:46:16.896520500 [20349] info: spamd: result: Y 22 - BAYES_50,CM_SLICED_STOCK,EXTRA_MPART_TYPE,GIF_AREA_200K,GIF_ATTACH_5P,GI F_MULTI_LARGO,HELO_DYNAMIC_IPADDR2,HELO_DYNAMIC_SPLIT_IP,HTML_40_50,HTML _IMAGE_ONLY_28,HTML_MESSAGE,RCVD_BY_IP,RCVD_NUMERIC_HELO,SARE_GIF_ATTACH ,SARE_GIF_STOX,URI_HTML_ONLY scantime=2.6,size=50713,user=$global,uid=200,required_score=5.0,rhost=lo calhost,raddr=127.0.0.1,rport=34848,mid=<001d01c34465$4cce9bb8$a38ebedc@ dxnd>,bayes=0.546644226347824,autolearn=unavailable,urihits=none Are you sure the logic is working properly there? D