Johnson, S wrote: > Has anyone attempted to write an OCR filter (optical character > recognition) for jpg or gif files that contain spam words? >
Not that I'm aware of, but it's been mentioned MANY times. Really I think it largely boils down to being more CPU load than it's worth. Even the best OCR's are unreliable, and easily evaded if the sender is trying to confuse it. All adding OCR would do would cause Spammers with image-based spams to start using strange fonts which are hard to OCR but easy to read. Besides, image based spams aren't really much of a problem, at least not here. Most web-linked-image based spams are picked up quickly by SURBL and Razor's e8 hash. Most embedded-image based spams are quickly picked up by razor's e4 hash, dcc, and/or pyzor. Many also contain web links to the site they advertise and get hit by SURBL, etc too. *shrug*