Dickson, Paul wrote:
I enjoy success with the following: MaxFiles: 30000 MaxBytes: 5000 Another *very* important aspect to all of this, is controlling what is actually getting into your corpus. Is it good/relevant to your organization, or is it bad/irrelevant and make you more susceptible to spammy content? I *highly* recommend using your redRe to weed out personal emails that people send from polluting your corpus. To give an example: I don't allow any messages that have been forwarded (fw|fwd): to enter my corpus. I found via observation that >75% was garbage and added no value to the corpus. |
------------------------------------------------------------------------- Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys -- and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________ Assp-user mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/assp-user
