Hi,

I'm using SpamAssassin 4.0.

Is there a simple way to give a penalty to messages containing non latin
UTF-8 characters?

I'm asking because we are receiving quite a lot of Chinese junk mail
with subjects in Chinese (or more generally non-latin) characters, but:

- The body is too short for 'ok_languages' to detect and discard the
unwanted language.

- The charset is UTF-8, and therefore 'ok_locales en' doesn't mind.

- I shouldn't blacklist domains such as @163.com (a major source of
spam) because there is legitimate traffic coming from this domain, for
example e-mails sent to the LKML, which most of us subscribe to.

I'm seeing fairly elaborate solutions on the net, but it surprises me
that an apparently simple problem doesn't have a simple solution yet.

Thank you in advance for your insights.

Cheers,

Michael.


-- 
Michael Opdenacker, CEO, Free Electrons
Embedded Linux, Kernel and Android engineering
http://free-electrons.com
+33 484 258 098

Reply via email to