> -----Original Message-----
> From: Mark
> Sent: Friday, August 01, 2003 8:34 AM
[...]
> 
> That is the good news. :) The bad news is, that the true 
> background color, or
> I should say, background appearance, is almost impossible to determine.
> Consider table colors, <td> colors, etc. Not to mention that white,
> stretched gif used for background color. And that is just 'old' 
> style HTML.
> :)
> 
> Hence I gave my rules a low score. But still, you might find them useful.

Clearly, Spamassassin should invoke a browser to render the html, convert
that to a graphic file format, and then run an OCR algorithm on the result.
Then SA can run it's body filters on the extracted text.

What's the problem? <g>




-------------------------------------------------------
This SF.Net email sponsored by: Free pre-built ASP.NET sites including
Data Reports, E-commerce, Portals, and Forums are available now.
Download today and enter to win an XBOX or Visual Studio .NET.
http://aspnet.click-url.com/go/psa00100003ave/direct;at.aspnet_072303_01/01
_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to