> -----Original Message----- > From: Mark > Sent: Friday, August 01, 2003 8:34 AM [...] > > That is the good news. :) The bad news is, that the true > background color, or > I should say, background appearance, is almost impossible to determine. > Consider table colors, <td> colors, etc. Not to mention that white, > stretched gif used for background color. And that is just 'old' > style HTML. > :) > > Hence I gave my rules a low score. But still, you might find them useful.
Clearly, Spamassassin should invoke a browser to render the html, convert that to a graphic file format, and then run an OCR algorithm on the result. Then SA can run it's body filters on the extracted text. What's the problem? <g> ------------------------------------------------------- This SF.Net email sponsored by: Free pre-built ASP.NET sites including Data Reports, E-commerce, Portals, and Forums are available now. Download today and enter to win an XBOX or Visual Studio .NET. http://aspnet.click-url.com/go/psa00100003ave/direct;at.aspnet_072303_01/01 _______________________________________________ Spamassassin-talk mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/spamassassin-talk