Yes image spam can be a real pain. I have just implemented a new mailserver and 
image spam is certainly on the increase :-

mysql> select count(*) from maillog;
+----------+
| count(*) |
+----------+
|    15091 | 
+----------+
1 row in set (0.00 sec)

mysql> select count(*) from maillog where spamreport like '%FUZZY_OCR%';
+----------+
| count(*) |
+----------+
|     3438 | 
+----------+
1 row in set (0.04 sec)

mysql> select count(*) from maillog where spamreport like 
'%FUZZY_OCR_KNOWN_HASH%';
+----------+
| count(*) |
+----------+
|     1070 | 
+----------+
1 row in set (0.04 sec)


On Fri, 23 Mar 2007 06:46:50 -0700, Marc Perkel <[EMAIL PROTECTED]> wrote:
> Perhaps what I need to do is to get rid of autolearn and write my own
> learning system that strips out the body of messages with images and
> just learns the headers. My problem is that when users get image spam
> they put it in the spam folders and they get learned. But the text in
> the image spam causes ham type text to be learned as spam. That causes
> ham to get higher scores.
> 
> --
> This message has been scanned for viruses and dangerous content by
> MailScanner, and is
> believed to be clean.
-- 
--[ UxBoD ]--
// PGP Key: "curl -s http://www.splatnix.net/uxbod.asc | gpg --import"
// Fingerprint: 543A E778 7F2D 98F1 3E50 9C1F F190 93E0 E8E8 0CF8
// Keyserver: www.keyserver.net Key-ID: 0xE8E80CF8
// SIP Phone: [EMAIL PROTECTED]


-- 
This message has been scanned for viruses and dangerous content by MailScanner, 
and is
believed to be clean.

Reply via email to