Am 24. Apr 2009 um 22:12 CEST schrieb Igor Chudov:
> I get plenty of these also, and cannot get them to score well. 
> 
> These advertise knockoffs of bestselling Pfizer products. The text is
> meaningless garbage text. The sales message is contained in a PNG
> image, but it could be other image types like jpeg. 
> 
>        http://igor.chudov.com/tmp/spam008.txt
> 
> Any ides what I can do?

You can install FuzzyOcr
<http://wiki.apache.org/spamassassin/FuzzyOcrPlugin>

,----
| X-Spam-Status: Yes, score=19.8 required=5.0 tests=BADRELAY,BAYES_99,FUZZY_OCR,
|       HK_IMGSPAM,HTML_MESSAGE,SAGREY autolearn=no version=3.2.5
| X-Spam-Relay-Country: US TR
| X-Spam-Report: =?ISO-8859-1?Q?
|       *  3.5 BAYES_99 BODY: Spamwahrscheinlichkeit nach Bayes-Test: 99-100%
|       *      [score: 1.0000]
|       *  0.3 HTML_MESSAGE BODY: Nachricht enth=e4lt HTML
|       *  2.5 BADRELAY bad Relay
|       *  2.0 HK_IMGSPAM Inline image in message, Bayes think it's spam
|       *   10 FUZZY_OCR BODY:
|       *  1.0 SAGREY Adds 1.0 to spam from first-time senders
`----

,----[ fuzzyocr.log ]
| 2009-04-24 22:30:08 [9756] Scanset "ocrad" found word "cialis" with fuzz of 
0.0000
|                       line: "ur prce viagra  cialis special offer"
| 2009-04-24 22:30:08 [9756] Scanset "ocrad" found word "cialis" with fuzz of 
0.0000
|                       line: "lgg cialis special offer"
| 2009-04-24 22:30:08 [9756] Scanset "ocrad" found word "viagra" with fuzz of 
0.0000
|                       line: "ur prce viagra  cialis special offer"
| 2009-04-24 22:30:08 [9756] Scanset "ocrad" found word "viagra" with fuzz of 
0.1667
|                       line: "l ls lo x vagra loo mg  lo x cals omg"
| 2009-04-24 22:30:08 [9756] Scanset "ocrad" found word "viagra" with fuzz of 
0.0000
|                       line: " viagra hot offer"
| 2009-04-24 22:30:08 [9756] Scanset "ocrad" generates enough hits (5), 
skipping further scansets...
| 2009-04-24 22:30:08 [9756] Message is spam, score = 10.500
| 2009-04-24 22:30:08 [9756] Adding Hash to 
"/home/stefan/.fuzzyocr/FuzzyOcr.hashdb"
| 2009-04-24 22:30:08 [9756] Words found:
|                       "cialis" in 2 lines
|                       "viagra" in 3 lines
|                       (7.5 word occurrences found)
`----


Greets
Stefan
  
-- 
,-----------------------------------------------------------------------------.
|         Stefan Lütje        |   "Die Zukunft wird morgen besser sein."   |
|  stefan.lue...@t-online.de  |               George W. Bush               |
`----Key fingerprint = BCB2 48E4 9211 C975 5A3F  B192 9B6E CCCF 99CC 44FA-----'

Attachment: signature.asc
Description: Digital signature

Reply via email to