On Wed, 7 Nov 2018 12:33:35 -0700 Amir Caspi wrote: > In many cases, it would appear that these spams have either very > little (real) text (besides the usual attempt at Bayes poisoning) > and/or are using HTML-entity encoding to try to bypass Bayes. Here > are a couple of spamples: > > https://pastebin.com/peiXZivJ > https://pastebin.com/3h3r7r7j > > Does SA decode HTML entities as part of normalize_charset? If > not ... can this be added?
Ordinarily yes, but these don't actually have a separate html part because of the broken mime - the separators don't match. Presumably the raw HTML is being treated as plain text.