Re: [Mailman-Users] Chinese characters spam filter?

Yasuhito FUTATSUKI Sat, 09 Jul 2016 08:53:05 -0700

On 07/09/16 10:04, I wrote:
> How about using 'backslashreplace' instead of 'replace' to encode to
> list's preferred language in Mailman/Handlers/SpamDetect.py ?
> 
> Then, desirable pattern in this case seems to be
> 
> ~Subject.*(\\u[0-9a-f]{4}){4}
^Subject:.*(\\u[0-9a-f]{4,5}){4}


(CJK Unified Ideographs Extension uses code point scope >= u+20000)

Of course, you can specify code point scope more pricisely, not
to match other than Chinese characters.
--
Yasuhito FUTATSUKI <[email protected]>
------------------------------------------------------
Mailman-Users mailing list [email protected]
https://mail.python.org/mailman/listinfo/mailman-users
Mailman FAQ: http://wiki.list.org/x/AgA3
Security Policy: http://wiki.list.org/x/QIA9
Searchable Archives: http://www.mail-archive.com/mailman-users%40python.org/
Unsubscribe: 
https://mail.python.org/mailman/options/mailman-users/archive%40jab.org

Re: [Mailman-Users] Chinese characters spam filter?

Reply via email to