Pawe³ Têcza wrote on Thu, 16 Aug 2007 14:28:05 +0200:

> 1. try to speed up my MySQL server
> 2. decrease a number of unique tokens for "punctuation spam"
> 
> The first of them is a task for me, of course.

Well, it's nevertheless a good question for this list as others may have 
the same problem. I just wanted to stress that your problem is not 
detection as others seem to have overlooked this and give hints on better 
detection. But you are detecting just fine.

But the second
> is rather Spamassassin's job.
> 
> I'm thinking whether it's really necessary to keep *all* tokens
> for that kind of spam...  Maybe Spamassassin could save only
> some part of them?  What's your opinion about it?

I really don't know enough about Bayes and SA to say much about it. I 
think it would be difficult for SA to determine what are "good" and "bad" 
tokens.

Kai

-- 
Kai Schätzl, Berlin, Germany
Get your web at Conactive Internet Services: http://www.conactive.com



Reply via email to