> Andrea, Hi there, Thomas, we are on the public list, aren't we :) ? > your request was very logical.
Well... to tell it all, I reported about such a behavior here and there, but then, I didn't really pay attention to it... until I was forced to setup a script, scheduled at intervals, to "trim" the corpus and restore it to "normal" and, sincerely, given that ASSP has options to deal with this, I think ASSP *should* deal with this :) and keep the corpus balanced > Why is assp not able to produce a fine corpusnorm/spamdb/HMM, if all > information is available and the folders are full of files? > Had a sleepness night. I think I've found a way to fix this. Now ... you make me feel somewhat guilty !! Sleep is a need and sincerely, causing a sleepless night isn't exactly something I like to cause (ok, given that the night went wasted thinking to code <grin>) > After the error folders are processed, a temporary corpusnorm is > calculated. The files in the spam and notspam folder are counted - > and depending on the temp-corpusnorm, the spam-file-count and > notspam-file-count, the apx. required count of spam files is > calcuated. If these spam files are finished processed - based on the > needed notspam word count - the apx. required count of notspam files > is calculated. > > So (I hope), even if a machine gets too many or too less spams over a > time , this logic will be able to ensure a fine corpusnorm. I see, so, basically, you're saying that the weight reported in the "rebuild report" isn't correct ?!? Not that it's an issue, I can live with that but... did I get it right ? (sorry if I didn't but last night I slept 2 hours +/- [yeah, I know, but I was dealing with some *darn* UTM issues and had to "protect the innocent"] and today I had to travel @ a customer site... just got back) If so, then, maybe slightly changing the rebuild code to emit correct values may be a good idea :) ------------------------------------------------------------------------------ Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ _______________________________________________ Assp-test mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/assp-test
