Hello Eric, > Maybe a more idea situation would be to have several people from within your > organization all extract a subsection of their historical emails,
Sure, except this is not much of an option in case of a hosting company (i.e. us) with a few thousand customers and some 5 thousand e-mail addresses. > or if you > have access to them on a central server, grab them from there. Certainly - that is, if you know MTA (Exim in our case) well enough to be able to capture _only_ SASL-authenticated mail. I don't know it (yet) well enough. :-( > I wouldn't > expect there to be privacy issues as you aren't reading them - you are just > running them through a spam analyzer. As for finding a good source of spam, > I don't think that is a particularly big issue, or we wouldn't here > discussing it already. :) I've set up a spam trap and it grew to almost half of a GB in several days. =:-) > In my case, it seems to have worked fairly well in filtering out spam and > notspam right now. There is still some manual sorting to do, as well as > figuring out which emails belong on redlists and/or NP lists, but am slowly > getting there. The volume of mail in our case is so immense that manual redlisting/whitelisting is not an option really... Automatic whitelisting is the main reason I chose ASSP. Volume of mail is one of reasons I wrote that script - I needed reduction of manual workload associated with training the filters, while I found the hard way that spam and ham characteristics on every machine is different and just copying one's own spam/ham corpus to machine serving a different set of domains and customers ends up with many false positives and false negatives. Ouch. I also have to admit we have problems with sustained throughput of ASSP on some machines - it's barely able to keep up with the flow of mail in rush hours and delays in response time of mailservers when processing SMTP commands are significant (up to several seconds in worst cases). When I turn it off, Exim is reacting much faster than when preceded with ASSP. Regards, Marcin Krol ------------------------------------------------------------------------- Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV _______________________________________________ Assp-user mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/assp-user
