Unicode will help the regular expression:
\p{Hebrew} for example, will detect a singe hebrew character Thomas Von: K Post <nntp.p...@gmail.com> An: ASSP development mailing list <assp-test@lists.sourceforge.net> Datum: 08.01.2016 17:36 Betreff: [Assp-test] Multiple languages / charactersets One of our users is bilingual and apparently switches between English and Hebrew character set frequently This is the only user like this that I know of. I worry about not having enough HM / Bayes statistics on hebrew and I'd hate to reduce the amount of English data if we were to start gathering Hebrew for this one user. Is there a way to tell ASSP to essentially redlist and no-process mail send with the hebrew characters in the email (even if it's a mixed message)? That way his english emails would enjoy the same antispam protection as the other users. If not, I guess I need to redlist and no-process (or no collect, bayes lover, hm lover) his email address if I want to keep hebrew out? Is there a better way of thinking about this? Thanks Ken ------------------------------------------------------------------------------ _______________________________________________ Assp-test mailing list Assp-test@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/assp-test DISCLAIMER: ******************************************************* This email and any files transmitted with it may be confidential, legally privileged and protected in law and are intended solely for the use of the individual to whom it is addressed. This email was multiple times scanned for viruses. There should be no known virus in this email! *******************************************************
------------------------------------------------------------------------------
_______________________________________________ Assp-test mailing list Assp-test@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/assp-test