Re: [sa] Re: ruleset for German Bettchen and Schlafzimmer spam

2010-03-16 Thread Henrik K
On Mon, Mar 15, 2010 at 11:17:09PM +0100, Karsten Bräckelmann wrote:
 On Mon, 2010-03-15 at 11:15 -0400, Charles Gregory wrote:
  H. I guess this goes back to my inquiry about the Brazilian spam
  
  I'm still looking for a way (hopefully) to simply identify the *language* 
  of the mail (when not determined from CHARSET_FARAWAY rules), so that our 
  users may opt-in for additional filtering based on language
 
 The TextCat plugin. Even part of stock SA, though not enabled by
 default. Supports per-user settings.

Though given the current bugs, I really would do any hard filtering..



Re: [sa] Re: ruleset for German Bettchen and Schlafzimmer spam

2010-03-15 Thread Charles Gregory

On Sun, 14 Mar 2010, Jörg Frings-Fürst wrote:

take a look at http://wiki.apache.org/spamassassin/CustomRulesets
and search to German Language Ruleset.


H. I guess this goes back to my inquiry about the Brazilian spam

I'm still looking for a way (hopefully) to simply identify the *language* 
of the mail (when not determined from CHARSET_FARAWAY rules), so that our 
users may opt-in for additional filtering based on language


- Charles

Re: [sa] Re: ruleset for German Bettchen and Schlafzimmer spam

2010-03-15 Thread Karsten Bräckelmann
On Mon, 2010-03-15 at 11:15 -0400, Charles Gregory wrote:
 H. I guess this goes back to my inquiry about the Brazilian spam
 
 I'm still looking for a way (hopefully) to simply identify the *language* 
 of the mail (when not determined from CHARSET_FARAWAY rules), so that our 
 users may opt-in for additional filtering based on language

The TextCat plugin. Even part of stock SA, though not enabled by
default. Supports per-user settings.

But you just forked (to avoid the word hijacked) this thread, which is
about a very specific, on-going spam run. The OP really doesn't want to
identify German spam for scoring, cause that's likely his first
language. ;)


-- 
char *t=\10pse\0r\0dtu...@ghno\x4e\xc8\x79\xf4\xab\x51\x8a\x10\xf4\xf4\xc4;
main(){ char h,m=h=*t++,*x=t+2*h,c,i,l=*x,s=0; for (i=0;il;i++){ i%8? c=1:
(c=*++x); c128  (s+=h); if (!(h=1)||!t[s+h]){ putchar(t[s]);h=m;s=0; }}}