On 02.03.2012 00:57, Vlad Sedov wrote: > >>> I was thinking about using maildrop to deliver tagged mail to the >>> customers' SPAM folders, and let them use the spam/not spam buttons in >>> webmail to correct false positives and missed spams. This is basically >>> how I'm doing it now with qmail. >> I would advice against this. Maildrop would add an additional dependency >> for nothing. Rather than using maildrop I would suggest you to look at >> sieve. And if you like buttons then why not taking it to the next level >> and use something like the dovecot antispam plugin. That plugin allows >> you to offer the same functionality without the need of buttons. You can >> do all of that just by dragging and dropping messages from/out of the >> spam folder. > > Wow, I was not aware that you could do that with dovecot! So, if I > understand this right, you can re-train messages by just moving them > from one folder to another via any IMAP client? > Yes and more. Read here -> http://johannes.sipsolutions.net/Projects/dovecot-antispam
>>> I'm still a bit in the dark about the tokenizers. I've read several >>> dspam HOW-TOs, and it seems everyone has their preference, but they >>> don't explain why. >> This is often a problem with how-to's. They help you to get something >> working but they lack the profound knowledge and explanation why to do >> something. Anyway... if you want to understand tokenizers than have a >> look here -> >> http://sourceforge.net/apps/mediawiki/dspam/index.php?title=Tokenizers >> >> This should help you understand what each tokenizer is producing. >> > > Ok, this explains a lot!I was using the osb tokenizer with TEFT mode > for everyone.. This probably explains the huge database. > Ohhh boy. Don't use TEFT. It is okay but if you really are thinking long term and want very good accuracy then go with TOE. TEFT will deliver good result from the beginning while TOE will use slightly longer (in the beginning) to get to very good results. But in the long term TOE will outperform TEFT. > It's hard to guesstimate the amount of mail coming in.. If i had to > guess, we get 1-3 new SMTP connections per second, sometimes more > during busy hours. This is after all the RBL checks. I think I will > experiment with a few tokenizers and see which one will yield the best > accuracy with the least amount of tokens. > If you want then send me your dspam.conf and I will look over it and send you my suggestions. I can do the same for your main.cf/master.cf. > > Thanks again for your help, > > Vlad > -- Kind Regards from Switzerland, Stevan Bajić ------------------------------------------------------------------------------ Virtualization & Cloud Management Using Capacity Planning Cloud computing makes use of virtualization - but cloud computing also focuses on allowing computing to be delivered as a service. http://www.accelacomm.com/jaw/sfnl/114/51521223/ _______________________________________________ Dspam-user mailing list Dspam-user@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspam-user