Dickson, Paul wrote:

At one time I had asked around what others were using for their max files.  If I remember correctly, Fritz responded to me that he had his set @ 3000, and considering his organization is about the same size as mine, I went ahead and set it at that.  For some reason later on, I decided to up it to 30k.


I enjoy success with the following:

MaxFiles: 30000
MaxBytes: 5000

Another *very* important aspect to all of this, is controlling what is actually getting into your corpus.  Is it good/relevant to your organization, or is it bad/irrelevant and make you more susceptible to spammy content?

I *highly* recommend using your redRe to weed out personal emails that people send from polluting your corpus.  To give an example: I don't allow any messages that have been forwarded (fw|fwd): to enter my corpus.  I found via observation that >75% was garbage and added no value to the corpus.
-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys -- and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Assp-user mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/assp-user

Reply via email to