See attached...
S

-----Original Message-----
From: Thomas Eckardt [mailto:thomas.ecka...@thockar.com] 
Sent: Monday, November 21, 2011 3:38 PM
To: ASSP development mailing list
Subject: [Assp-test] Antwort: Re: Antwort: Rebuildspamdb has deleted most of 
the spam....

>I'm running 2.1.2 build 11321

So assp has not removed the files. If so - there should be log or rebuild-log 
entries to find about that.

Thomas




Von:    Steve Moffat <st...@optimum.bm>
An:     ASSP development mailing list <assp-test@lists.sourceforge.net>
Datum:  21.11.2011 20:33
Betreff:        Re: [Assp-test] Antwort: Rebuildspamdb has deleted most of 
the spam....





I'm running 2.1.2 build 11321
S

-----Original Message-----
From: Thomas Eckardt [mailto:thomas.ecka...@thockar.com]
Sent: Monday, November 21, 2011 3:16 PM
To: ASSP development mailing list
Subject: [Assp-test] Antwort: Rebuildspamdb has deleted most of the spam....

2011-11-15
fixed in assp 2.1.2 build 11319:
 
.....

- the rebuildspamdb has removed too young files from the long time corpus 

Thomas


Von:    Steve Moffat <st...@optimum.bm>
An:     "'assp-test@lists.sourceforge.net'" 
<assp-test@lists.sourceforge.net>
Datum:  21.11.2011 19:18
Betreff:        [Assp-test] Rebuildspamdb has deleted most of the spam....




Hi
Yesterday's rebuildspamdb....

Nov-20-11 00:18:46 Spam Weight:               3,697,885
Nov-20-11 00:18:46 Not-Spam Weight:   4,710,446

Nov-20-11 00:18:46 Corpus norm:             0.7850 - (ok - slighly ham 
heavy)
Nov-20-11 00:18:46 Corpus confidence: 0.67744728
Nov-20-11 00:18:46 Recommendation: RebuildSpamDB will limit the number of used 
messages in your corpus. Excess files will be ingored.


This afternoons....

Nov-21-11 13:58:10 Spam Weight:               518,146
Nov-21-11 13:58:10 Not-Spam Weight:   4,583,191

Nov-21-11 13:58:10 Corpus norm:             0.1131 - (warning: extremely 
ham heavy)
Nov-21-11 13:58:10 Corpus confidence: 0.07887905
Nov-21-11 13:58:10 Corpus norm should be between 0.6 and 1.4

That's definitely a bummer..
Steve
------------------------------------------------------------------------------
All the data continuously generated in your IT infrastructure 
contains a definitive record of customers, application performance, 
security threats, fraudulent activity, and more. Splunk takes this 
data and makes sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-novd2d
_______________________________________________
Assp-test mailing list
Assp-test@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/assp-test




DISCLAIMER:
*******************************************************
This email and any files transmitted with it may be confidential, legally 
privileged and protected in law and are intended solely for the use of the 


individual to whom it is addressed.
This email was multiple times scanned for viruses. There should be no 
known virus in this email!
*******************************************************



------------------------------------------------------------------------------
All the data continuously generated in your IT infrastructure 
contains a definitive record of customers, application performance, 
security threats, fraudulent activity, and more. Splunk takes this 
data and makes sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-novd2d
_______________________________________________
Assp-test mailing list
Assp-test@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/assp-test




DISCLAIMER:
*******************************************************
This email and any files transmitted with it may be confidential, legally 
privileged and protected in law and are intended solely for the use of the 

individual to whom it is addressed.
This email was multiple times scanned for viruses. There should be no 
known virus in this email!
*******************************************************


File rebuildrun.txt follows:



Nov-21-11 00:00:00 RebuildSpamDB-thread rebuildspamdb-version 3.10 started in 
ASSP version 2.1.2(11321)

Nov-21-11 00:00:00 ---ASSP Settings---
Nov-21-11 00:00:00 Do Not Collect RedRe Messages: Enabled **Messages matching 
the RedRe will be removed from the corpus!**

Nov-21-11 00:00:00 Use Subject as Maillog Names: True
Nov-21-11 00:00:00 Maxbytes: 4000 

Nov-21-11 00:00:00 c:/assp/errors/spam
Nov-21-11 00:00:00 File Count:  55
Nov-21-11 00:00:00 Processing... errors/spam with 55 files
Nov-21-11 00:00:01 Imported Files:      55
Nov-21-11 00:00:01 Finished in 1 second(s)

Nov-21-11 00:00:01 c:/assp/errors/notspam
Nov-21-11 00:00:01 File Count:  42
Nov-21-11 00:00:01 Processing... errors/notspam with 42 files
Nov-21-11 00:00:04 Imported Files:      42
Nov-21-11 00:00:04 Finished in 3 second(s)

Nov-21-11 00:00:04 c:/assp/spam
Nov-21-11 00:00:05 File Count:  9,865
Nov-21-11 00:00:05 Processing... spam with 9865 files
Nov-21-11 00:00:25 remove 
c:/assp/spam/BEST_quality_generic_CIALIS_wi--18407.eml WhiteList: 
'lupelucie...@egroups.com'
Nov-21-11 00:00:31 remove c:/assp/spam/FW_David_Cameron--18360.eml WhiteList: 
'zant...@tiscali.co.uk'
Nov-21-11 00:00:48 remove 
c:/assp/spam/tickets_for_Cinderalla_Pantomi--18433.eml WhiteList: 
'luc...@northrock.bm'
Nov-21-11 00:00:52 Removed White:       3
Nov-21-11 00:00:52 Removed Old: 8,795
Nov-21-11 00:00:52 Imported Files:      1,067
Nov-21-11 00:00:52 Finished in 48 second(s)

Nov-21-11 00:00:52 c:/assp/notspam
Nov-21-11 00:00:55 File Count:  24,291
Nov-21-11 00:00:55 Processing... notspam with 24291 files
Nov-21-11 00:08:39 Imported Files:      24,291
Nov-21-11 00:08:39 Folder contents exceeded `MaxFiles`(12000). 
Nov-21-11 00:08:39 Finished in 467 second(s)

Nov-21-11 00:08:39 Generating weighted Bayesian tuplets
Nov-21-11 00:12:02 cleaning old Spamdb records
Nov-21-11 00:12:43 done - cleaning old Spamdb records - removed 101871 from 
51761
Nov-21-11 00:12:43 done - Generating weighted Bayesian tuplets

Nov-21-11 00:12:43 Bayesian Pairs: 22,118 new, 22,118 now in list
Nov-21-11 00:12:43 generating Spamdb.helo records from 1770 collected HELO's
Nov-21-11 00:12:43 cleaning old Spamdb.helo records
Nov-21-11 00:12:44 done - cleaning old Spamdb.helo records

Nov-21-11 00:12:44 HELO Blacklist: 0 new, 16 now in list

Nov-21-11 00:12:44 Spam Weight:    502,718
Nov-21-11 00:12:44 Not-Spam Weight:   4,710,986

Nov-21-11 00:12:44 Corpus norm: 0.1067 - (warning: extremely ham heavy)
Nov-21-11 00:12:44 Corpus confidence:   0.07782751
Nov-21-11 00:12:44 Recommendation: RebuildSpamDB will limit the number of used 
messages in your corpus. Excess files will be ingored.
Nov-21-11 00:12:44 Corpus norm should be between 0.6 and 1.4

Nov-21-11 00:12:44 Recommendation: You need more spam messages in the corpus.

Nov-21-11 00:12:44 starting auto correction for corpus - delete old ham files 
from notspam

Nov-21-11 00:13:09 info: starting cleanup for to much (old) files in folder 
c:/assp/notspam
info: deleted 9716 old files from folder c:/assp/notspam

Nov-21-11 00:13:09 Recommendation: You should reduce now MaxBytes to 2500!  

Nov-21-11 00:13:09 Total processing time: 789 second(s)

Nov-21-11 00:13:09 Total processing data: 93.49 MByte

Nov-21-11 00:13:09 building new GripList records and bounce report
Nov-21-11 00:13:09 processing Logfile c:/assp/logs/maillog.txt
Nov-21-11 00:13:13 processing Logfile c:/assp/logs/11-11-16.maillog.txt

Nov-21-11 00:13:18 bounce report for the last two days: no bounces received

Nov-21-11 00:13:19 Uploading Griplist via Direct Connection
Nov-21-11 00:13:19 Submitted 2730 bytes: 0 IPv6 addresses, 302 IPv4 addresses

Nov-21-11 00:13:19 Trashlist was saved to c:/assp/trashlist.db
------------------------------------------------------------------------------
All the data continuously generated in your IT infrastructure 
contains a definitive record of customers, application performance, 
security threats, fraudulent activity, and more. Splunk takes this 
data and makes sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-novd2d
_______________________________________________
Assp-test mailing list
Assp-test@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/assp-test

Reply via email to