On Sat, 2006-10-28 at 16:21 -0700, Dennis Peterson wrote:
> Bill Randle wrote:
> > On Sat, 2006-10-28 at 16:54 -0400, Kris Deugau wrote:
> >>
> >> However, in the long run, OCR to feed the text to SpamAssassin's other
> >> rules is a better solution; it's much more flexible.
> >
> > Indeed. For those interested in the topic of OCR to feed SpamAssassin,
> > there's an active project with its own mailing list that does just this.
> > It turns out to be a non-trivial task because many of these image spam
> > are animated gifs, so you need to find the right frame to pass to the
> > OCR program.
> >
> > Start here: http://wiki.apache.org/spamassassin/FuzzyOcrPlugin then
> > subscribe to the Devel-Spam mailing list (there's a link on that page).
>
>
> You might want to consider the next level of image spam before you go
> too far down the OCR path:
>
> http://www.iss.net/threats/Animated%20GIF.html
Actually, the FuzzyOCR plugin already handles animated gifs using
various techniques to extract the hidden text. It also is able to
decode png and jpeg files.
-Bill
_______________________________________________
http://lurker.clamav.net/list/clamav-users.html