Re: Stock spam in images

2006-10-04 Thread Jason Haar
I'm having marvelous luck with FuzzyOCR - but the spammers are learning too. When I first started using it just a couple of months ago, it really whacked the image-based spam. You could see why when gocr file.gif returned nice text that was easy to match against. However, now is a different

RE: Stock spam in images

2006-10-04 Thread Chris Santerre
Title: RE: Stock spam in images Greetings list, The old timers on the list know I tend to try things outside the norm. Like my strong resistence to sitewide bayes. Well for months I've been using a simpler approach to these Stock Spams w/ images. I don't look at the image at all. Heresy I

Re: Stock spam in images

2006-10-04 Thread Jorge Valdes
Jason Haar wrote: I'm having marvelous luck with FuzzyOCR - but the spammers are learning too. When I first started using it just a couple of months ago, it really whacked the image-based spam. You could see why when gocr file.gif returned nice text that was easy to match against. However, now

RE: Stock spam in images

2006-10-03 Thread Balzi Andrea
For Debian Users I've found the follow link, a step by step guide in order to implement FuzzyOCR and ImageInfo with spamassassin. http://www200.pair.com/mecham/spam/image_spam.html Andrea

Stock spam in images

2006-10-02 Thread Dylan Bouterse
I'm a newbie to the list and have been scanning recent posts to see if what I'm about to ask about has been covered but I haven't seen anything yet. Lately I have been getting more and more of the stock alert spam but now all the good info is in an image and typically following the image is

RE: Stock spam in images

2006-10-02 Thread Fabien GARZIANO
@spamassassin.apache.org Objet : Stock spam in images I'm a newbie to the list and have been scanning recent posts to see if what I'm about to ask about has been covered but I haven't seen anything yet. Lately I have been getting more and more of the stock alert spam but now all the good info is in an image

RE: Stock spam in images

2006-10-02 Thread Bowie Bailey
Dylan Bouterse wrote: I'm a newbie to the list and have been scanning recent posts to see if what I'm about to ask about has been covered but I haven't seen anything yet. Lately I have been getting more and more of the stock alert spam but now all the good info is in an image and typically

R: Stock spam in images

2006-10-02 Thread Giampaolo Tomassoni
I'm a newbie to the list and have been scanning recent posts to see if what I'm about to ask about has been covered but I haven't seen anything yet. Lately I have been getting more and more of the stock alert spam but now all the good info is in an image and typically following the image is

RE: Stock spam in images

2006-10-02 Thread Dylan Bouterse
-Original Message- From: Bowie Bailey [mailto:[EMAIL PROTECTED] Sent: Monday, October 02, 2006 9:46 AM To: users@spamassassin.apache.org Subject: RE: Stock spam in images Dylan Bouterse wrote: I'm a newbie to the list and have been scanning recent posts to see if what I'm about to ask

R: Stock spam in images

2006-10-02 Thread Giampaolo Tomassoni
...omissis... How about the FuzzyOCR plugin? That has been discussed quite a bit here recently. http://wiki.apache.org/spamassassin/FuzzyOcrPlugin -- Bowie And, by the way, it seems to work! Actually, the only limit I see is the own-made FuzzyOcr.words (and, maybe, the fact that

RE: Stock spam in images

2006-10-02 Thread Randal, Phil
October 2006 14:38 To: users@spamassassin.apache.org Subject: Stock spam in images I'm a newbie to the list and have been scanning recent posts to see if what I'm about to ask about has been covered but I haven't seen anything yet. Lately I have been getting more and more of the stock alert

RE: Stock spam in images

2006-10-02 Thread Randal, Phil
Giampaolo Tomassoni wrote: And, by the way, it seems to work! Actually, the only limit I see is the own-made FuzzyOcr.words (and, maybe, the fact that script text may probably get undetected). Wouldn't it be better to inject the detected text back to SA? There should be enough variants

Re: Stock spam in images

2006-10-02 Thread Theo Van Dinter
On Mon, Oct 02, 2006 at 03:18:58PM +0100, Randal, Phil wrote: undetected). Wouldn't it be better to inject the detected text back to SA? There should be enough variants of spam worlds to let SA fuzzily catch the ones from images. I think so. Some of the words would be perfectly

R: Stock spam in images

2006-10-02 Thread Giampaolo Tomassoni
On Mon, Oct 02, 2006 at 03:18:58PM +0100, Randal, Phil wrote: undetected). Wouldn't it be better to inject the detected text back to SA? There should be enough variants of spam worlds to let SA fuzzily catch the ones from images. I think so. Some of the words would be perfectly

RE: Stock spam in images

2006-10-02 Thread Fabien GARZIANO
to sleep -Message d'origine- De : Randal, Phil [mailto:[EMAIL PROTECTED] Envoyé : lundi 2 octobre 2006 16:19 À : users@spamassassin.apache.org Objet : RE: Stock spam in images Giampaolo Tomassoni wrote: And, by the way, it seems to work! Actually, the only limit I see is the own

Re: Stock spam in images

2006-10-02 Thread Stuart Johnston
Theo Van Dinter wrote: On Mon, Oct 02, 2006 at 03:18:58PM +0100, Randal, Phil wrote: undetected). Wouldn't it be better to inject the detected text back to SA? There should be enough variants of spam worlds to let SA fuzzily catch the ones from images. I think so. Some of the words would be

Re: Stock spam in images

2006-10-02 Thread Andreas Pettersson
Stuart Johnston wrote: Theo Van Dinter wrote: On Mon, Oct 02, 2006 at 03:18:58PM +0100, Randal, Phil wrote: undetected). Wouldn't it be better to inject the detected text back to SA? There should be enough variants of spam worlds to let SA fuzzily catch the ones from images. I think so.

RE: Stock spam in images

2006-10-02 Thread Randal, Phil
Engineer Herefordshire Council Hereford, UK -Original Message- From: Fabien GARZIANO [mailto:[EMAIL PROTECTED] Sent: 02 October 2006 16:11 To: users@spamassassin.apache.org Subject: RE: Stock spam in images Too bad, cause I agree with Giampaolo, it would be great. What about

RE: Stock spam in images

2006-10-02 Thread Brent Kennedy
. :) -Original Message- From: Dylan Bouterse [mailto:[EMAIL PROTECTED] Sent: Monday, October 02, 2006 9:38 AM To: users@spamassassin.apache.org Subject: Stock spam in images I'm a newbie to the list and have been scanning recent posts to see if what I'm about to ask about has been

R: Stock spam in images

2006-10-02 Thread Giampaolo Tomassoni
The real problem is the potentially fuzzy output from the ocr engine: shure all the copies of the very same spam would be detected the same, but what about slightly different copies? Would the use the sa force approach be feasible? The use of String::Approx in fuzzyocr has shurely a meaning,

R: Stock spam in images

2006-10-02 Thread Giampaolo Tomassoni
Randal Network Engineer Herefordshire Council Hereford, UK -Original Message- From: Fabien GARZIANO [mailto:[EMAIL PROTECTED] Sent: 02 October 2006 16:11 To: users@spamassassin.apache.org Subject: RE: Stock spam in images Too bad, cause I agree with Giampaolo

RE: Stock spam in images

2006-10-02 Thread Bret Miller
...omissis... How about the FuzzyOCR plugin? That has been discussed quite a bit here recently. http://wiki.apache.org/spamassassin/FuzzyOcrPlugin -- Bowie And, by the way, it seems to work! Actually, the only limit I see is the own-made FuzzyOcr.words (and, maybe, the fact

Re: Stock spam in images

2006-10-02 Thread decoder
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Theo Van Dinter wrote: On Mon, Oct 02, 2006 at 03:18:58PM +0100, Randal, Phil wrote: undetected). Wouldn't it be better to inject the detected text back to SA? There should be enough variants of spam worlds to let SA fuzzily catch the ones from

Re: Stock spam in images

2006-10-02 Thread decoder
rulesets gets most of them. Cheers, Phil -- Phil Randal Network Engineer Herefordshire Council Hereford, UK -Original Message- From: Dylan Bouterse [mailto:[EMAIL PROTECTED] Sent: 02 October 2006 14:38 To: users@spamassassin.apache.org Subject: Stock spam in images I'm a newbie

Re: Stock spam in images

2006-10-02 Thread Theo Van Dinter
On Mon, Oct 02, 2006 at 11:05:38AM -0500, Stuart Johnston wrote: Would it also be possible to create a rule that matches on text rendered specifically from a non-text part and not the whole body? That way you You'd have to do that in a plugin, but otherwise, sure. There's currently no

RE: Stock spam in images

2006-10-02 Thread Chan, Wilson
-Original Message- From: Randal, Phil [mailto:[EMAIL PROTECTED] Sent: Monday, October 02, 2006 3:58 AM To: Dylan Bouterse; users@spamassassin.apache.org Subject: RE: Stock spam in images This has been covered so many times on this list. 1: if you're not on spamassassin 3.1.5

RE: Stock spam in images

2006-10-02 Thread Benny Pedersen
On Tue, October 3, 2006 00:01, Gary V wrote: For installing the ImageInfo plugin where do you put the ImageInfo.pm without defining a path? Im running CentOS4.4 Fedora Core 5 as test machines. This should find your Plugin directory (which is where you place it): find /usr -type d -name