See attached...
S
-----Original Message-----
From: Thomas Eckardt [mailto:thomas.ecka...@thockar.com]
Sent: Monday, November 21, 2011 3:38 PM
To: ASSP development mailing list
Subject: [Assp-test] Antwort: Re: Antwort: Rebuildspamdb has deleted most of
the spam....
>I'm running 2.1.2 build 11321
So assp has not removed the files. If so - there should be log or rebuild-log
entries to find about that.
Thomas
Von: Steve Moffat <st...@optimum.bm>
An: ASSP development mailing list <assp-test@lists.sourceforge.net>
Datum: 21.11.2011 20:33
Betreff: Re: [Assp-test] Antwort: Rebuildspamdb has deleted most of
the spam....
I'm running 2.1.2 build 11321
S
-----Original Message-----
From: Thomas Eckardt [mailto:thomas.ecka...@thockar.com]
Sent: Monday, November 21, 2011 3:16 PM
To: ASSP development mailing list
Subject: [Assp-test] Antwort: Rebuildspamdb has deleted most of the spam....
2011-11-15
fixed in assp 2.1.2 build 11319:
.....
- the rebuildspamdb has removed too young files from the long time corpus
Thomas
Von: Steve Moffat <st...@optimum.bm>
An: "'assp-test@lists.sourceforge.net'"
<assp-test@lists.sourceforge.net>
Datum: 21.11.2011 19:18
Betreff: [Assp-test] Rebuildspamdb has deleted most of the spam....
Hi
Yesterday's rebuildspamdb....
Nov-20-11 00:18:46 Spam Weight: 3,697,885
Nov-20-11 00:18:46 Not-Spam Weight: 4,710,446
Nov-20-11 00:18:46 Corpus norm: 0.7850 - (ok - slighly ham
heavy)
Nov-20-11 00:18:46 Corpus confidence: 0.67744728
Nov-20-11 00:18:46 Recommendation: RebuildSpamDB will limit the number of used
messages in your corpus. Excess files will be ingored.
This afternoons....
Nov-21-11 13:58:10 Spam Weight: 518,146
Nov-21-11 13:58:10 Not-Spam Weight: 4,583,191
Nov-21-11 13:58:10 Corpus norm: 0.1131 - (warning: extremely
ham heavy)
Nov-21-11 13:58:10 Corpus confidence: 0.07887905
Nov-21-11 13:58:10 Corpus norm should be between 0.6 and 1.4
That's definitely a bummer..
Steve
------------------------------------------------------------------------------
All the data continuously generated in your IT infrastructure
contains a definitive record of customers, application performance,
security threats, fraudulent activity, and more. Splunk takes this
data and makes sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-novd2d
_______________________________________________
Assp-test mailing list
Assp-test@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/assp-test
DISCLAIMER:
*******************************************************
This email and any files transmitted with it may be confidential, legally
privileged and protected in law and are intended solely for the use of the
individual to whom it is addressed.
This email was multiple times scanned for viruses. There should be no
known virus in this email!
*******************************************************
------------------------------------------------------------------------------
All the data continuously generated in your IT infrastructure
contains a definitive record of customers, application performance,
security threats, fraudulent activity, and more. Splunk takes this
data and makes sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-novd2d
_______________________________________________
Assp-test mailing list
Assp-test@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/assp-test
DISCLAIMER:
*******************************************************
This email and any files transmitted with it may be confidential, legally
privileged and protected in law and are intended solely for the use of the
individual to whom it is addressed.
This email was multiple times scanned for viruses. There should be no
known virus in this email!
*******************************************************
File rebuildrun.txt follows:
Nov-21-11 00:00:00 RebuildSpamDB-thread rebuildspamdb-version 3.10 started in
ASSP version 2.1.2(11321)
Nov-21-11 00:00:00 ---ASSP Settings---
Nov-21-11 00:00:00 Do Not Collect RedRe Messages: Enabled **Messages matching
the RedRe will be removed from the corpus!**
Nov-21-11 00:00:00 Use Subject as Maillog Names: True
Nov-21-11 00:00:00 Maxbytes: 4000
Nov-21-11 00:00:00 c:/assp/errors/spam
Nov-21-11 00:00:00 File Count: 55
Nov-21-11 00:00:00 Processing... errors/spam with 55 files
Nov-21-11 00:00:01 Imported Files: 55
Nov-21-11 00:00:01 Finished in 1 second(s)
Nov-21-11 00:00:01 c:/assp/errors/notspam
Nov-21-11 00:00:01 File Count: 42
Nov-21-11 00:00:01 Processing... errors/notspam with 42 files
Nov-21-11 00:00:04 Imported Files: 42
Nov-21-11 00:00:04 Finished in 3 second(s)
Nov-21-11 00:00:04 c:/assp/spam
Nov-21-11 00:00:05 File Count: 9,865
Nov-21-11 00:00:05 Processing... spam with 9865 files
Nov-21-11 00:00:25 remove
c:/assp/spam/BEST_quality_generic_CIALIS_wi--18407.eml WhiteList:
'lupelucie...@egroups.com'
Nov-21-11 00:00:31 remove c:/assp/spam/FW_David_Cameron--18360.eml WhiteList:
'zant...@tiscali.co.uk'
Nov-21-11 00:00:48 remove
c:/assp/spam/tickets_for_Cinderalla_Pantomi--18433.eml WhiteList:
'luc...@northrock.bm'
Nov-21-11 00:00:52 Removed White: 3
Nov-21-11 00:00:52 Removed Old: 8,795
Nov-21-11 00:00:52 Imported Files: 1,067
Nov-21-11 00:00:52 Finished in 48 second(s)
Nov-21-11 00:00:52 c:/assp/notspam
Nov-21-11 00:00:55 File Count: 24,291
Nov-21-11 00:00:55 Processing... notspam with 24291 files
Nov-21-11 00:08:39 Imported Files: 24,291
Nov-21-11 00:08:39 Folder contents exceeded `MaxFiles`(12000).
Nov-21-11 00:08:39 Finished in 467 second(s)
Nov-21-11 00:08:39 Generating weighted Bayesian tuplets
Nov-21-11 00:12:02 cleaning old Spamdb records
Nov-21-11 00:12:43 done - cleaning old Spamdb records - removed 101871 from
51761
Nov-21-11 00:12:43 done - Generating weighted Bayesian tuplets
Nov-21-11 00:12:43 Bayesian Pairs: 22,118 new, 22,118 now in list
Nov-21-11 00:12:43 generating Spamdb.helo records from 1770 collected HELO's
Nov-21-11 00:12:43 cleaning old Spamdb.helo records
Nov-21-11 00:12:44 done - cleaning old Spamdb.helo records
Nov-21-11 00:12:44 HELO Blacklist: 0 new, 16 now in list
Nov-21-11 00:12:44 Spam Weight: 502,718
Nov-21-11 00:12:44 Not-Spam Weight: 4,710,986
Nov-21-11 00:12:44 Corpus norm: 0.1067 - (warning: extremely ham heavy)
Nov-21-11 00:12:44 Corpus confidence: 0.07782751
Nov-21-11 00:12:44 Recommendation: RebuildSpamDB will limit the number of used
messages in your corpus. Excess files will be ingored.
Nov-21-11 00:12:44 Corpus norm should be between 0.6 and 1.4
Nov-21-11 00:12:44 Recommendation: You need more spam messages in the corpus.
Nov-21-11 00:12:44 starting auto correction for corpus - delete old ham files
from notspam
Nov-21-11 00:13:09 info: starting cleanup for to much (old) files in folder
c:/assp/notspam
info: deleted 9716 old files from folder c:/assp/notspam
Nov-21-11 00:13:09 Recommendation: You should reduce now MaxBytes to 2500!
Nov-21-11 00:13:09 Total processing time: 789 second(s)
Nov-21-11 00:13:09 Total processing data: 93.49 MByte
Nov-21-11 00:13:09 building new GripList records and bounce report
Nov-21-11 00:13:09 processing Logfile c:/assp/logs/maillog.txt
Nov-21-11 00:13:13 processing Logfile c:/assp/logs/11-11-16.maillog.txt
Nov-21-11 00:13:18 bounce report for the last two days: no bounces received
Nov-21-11 00:13:19 Uploading Griplist via Direct Connection
Nov-21-11 00:13:19 Submitted 2730 bytes: 0 IPv6 addresses, 302 IPv4 addresses
Nov-21-11 00:13:19 Trashlist was saved to c:/assp/trashlist.db
------------------------------------------------------------------------------
All the data continuously generated in your IT infrastructure
contains a definitive record of customers, application performance,
security threats, fraudulent activity, and more. Splunk takes this
data and makes sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-novd2d
_______________________________________________
Assp-test mailing list
Assp-test@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/assp-test