On Wed, Aug 04, 2010 at 01:23:32AM -0700, Happy Chap wrote: > > Hi, > > We started getting (over the last 2 months say) lots of spam, which > Spamassassin isn't picking up as spam. Analysing these, they all seem to be > of the same type where many paragraphs of random text are "hidden" inside an > HTML comment (either contained in <!-- --> or inbetween /* and */ "tags"). > > Because of this "hidden" text, these messages are triggering BAYES_00 which, > I think, is the major influence on them not being correctly identified by > Spamassassin as spam.
Instead of speculating, try: cat msg | spamassassin -t -D bayes 2>&1 | grep bayes: It will tell you exactly what tokens are considered.