On 08/23/2014 11:26 AM, Eric Shubert wrote:
It appears that these spams are using random text that's hidden inside
of html in order to beat the bayes filter. At least that's my guess.
I'm guessing that if we write a filter/editor that strips out all
unviewable text from html content in a message before sending it to
sa-learn, the bayes filter will be effective once again.
Thoughts on this? Anyone know of a filter we can pipe messages through
on their way to sa-learn?
It looks as though search engines also consider hidden text to be spam.
http://www.seologic.com/faq/hidden-text
--
-Eric 'shubes'
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]