I've been running DSPAM for approximately 2 weeks and looking at the output of dspam_stats, I'm curious how long training normally takes.
A script is run nightly to check .Junk mailboxes for false negatives and .NotJunk mailboxes for false positives and retrains on error. (Richard Valk's http://switch.richard5.net/serverinstall/train.dspam) Here's sample output from dspam_stats -H x...@dop.com: TP True Positives: 0 TN True Negatives: 19 FP False Positives: 0 FN False Negatives: 348 SC Spam Corpusfed: 0 NC Nonspam Corpusfed: 0 TL Training Left: 2481 SHR Spam Hit Rate 0.00% HSR Ham Strike Rate: 0.00% PPV Positive predictive value: 100.00% OCA Overall Accuracy: 5.18% y...@dop.com: TP True Positives: 0 TN True Negatives: 0 FP False Positives: 0 FN False Negatives: 3035 SC Spam Corpusfed: 0 NC Nonspam Corpusfed: 0 TL Training Left: 2500 SHR Spam Hit Rate 0.00% HSR Ham Strike Rate: 100.00% PPV Positive predictive value: 100.00% OCA Overall Accuracy: 0.00% z...@dop.com: TP True Positives: 0 TN True Negatives: 0 FP False Positives: 0 FN False Negatives: 358 SC Spam Corpusfed: 0 NC Nonspam Corpusfed: 0 TL Training Left: 2500 SHR Spam Hit Rate 0.00% HSR Ham Strike Rate: 100.00% PPV Positive predictive value: 100.00% OCA Overall Accuracy: 0.00% te...@dop.com: TP True Positives: 0 TN True Negatives: 3 FP False Positives: 0 FN False Negatives: 5108 SC Spam Corpusfed: 0 NC Nonspam Corpusfed: 0 TL Training Left: 2497 SHR Spam Hit Rate 0.00% HSR Ham Strike Rate: 0.00% PPV Positive predictive value: 100.00% OCA Overall Accuracy: 0.09% Is so much "Training Left" normal? Do I have something misconfigured? Will DSPAM start tagging email as SPAM only after 2500 successfully classified emails? $ dspam --version DSPAM Anti-Spam Suite 3.9.0 (agent/library) Copyright (c) 2002-2009 DSPAM Project http://dspam.sourceforge.net. DSPAM may be copied only under the terms of the GNU General Public License, a copy of which can be found with the DSPAM distribution kit. $ cat /usr/local/dspam.conf | grep -v ^# | grep -v ^$ Home /usr/local/var/dspam StorageDriver /usr/local/lib/dspam/libmysql_drv.dylib TrustedDeliveryAgent "/usr/bin/procmail" DeliveryHost 127.0.0.1 DeliveryPort 10026 DeliveryIdent localhost DeliveryProto SMTP OnFail error Trust root Trust dspam Trust apache Trust mail Trust mailnull Trust smmsp Trust daemon Trust _dspam Trust _postfix Trust _www TrainingMode toe TestConditionalTraining on Feature whitelist Algorithm graham burton Tokenizer osb PValue bcr WebStats on Preference "trainingMode=TOE" # { TOE | TUM | TEFT | NOTRAIN } -> default:teft Preference "spamAction=tag" # { quarantine | tag | deliver } -> default:quarantine Preference "spamSubject=[SPAM]" # { string } -> default:[SPAM] Preference "statisticalSedation=5" # { 0 - 10 } -> default:0 Preference "enableBNR=on" # { on | off } -> default:off Preference "enableWhitelist=on" # { on | off } -> default:on Preference "signatureLocation=headers" # { message | headers } -> default:message Preference "tagSpam=off" # { on | off } Preference "tagNonspam=off" # { on | off } Preference "showFactors=on" # { on | off } -> default:off Preference "optIn=off" # { on | off } Preference "optOut=off" # { on | off } Preference "whitelistThreshold=10" # { Integer } -> default:10 Preference "makeCorpus=off" # { on | off } -> default:off Preference "storeFragments=off" # { on | off } -> default:off Preference "localStore=" # { on | off } -> default:username <---- ** okay to be blank? ** Preference "processorBias=on" # { on | off } -> default:on Preference "fallbackDomain=off" # { on | off } -> default:off Preference "trainPristine=off" # { on | off } -> default:off Preference "optOutClamAV=off" # { on | off } -> default:off Preference "ignoreRBLLookups=off" # { on | off } -> default:off Preference "RBLInoculate=off" # { on | off } -> default:off AllowOverride enableBNR AllowOverride enableWhitelist AllowOverride fallbackDomain AllowOverride ignoreGroups AllowOverride ignoreRBLLookups AllowOverride localStore AllowOverride makeCorpus AllowOverride optIn AllowOverride optOut AllowOverride optOutClamAV AllowOverride processorBias AllowOverride RBLInoculate AllowOverride showFactors AllowOverride signatureLocation AllowOverride spamAction AllowOverride spamSubject AllowOverride statisticalSedation AllowOverride storeFragments AllowOverride tagNonspam AllowOverride tagSpam AllowOverride trainPristine AllowOverride trainingMode AllowOverride whitelistThreshold AllowOverride dailyQuarantineSummary MySQLServer /var/mysql/mysql.sock MySQLUser * MySQLPass * MySQLDb * MySQLCompress false MySQLVirtualTable dspam_virtual_uids MySQLVirtualUIDField uid MySQLVirtualUsernameField username MySQLUIDInSignature on HashRecMax 98317 HashAutoExtend on HashMaxExtents 0 HashExtentSize 49157 HashPctIncrease 10 HashMaxSeek 10 HashConnectionCache 10 Notifications off PurgeSignatures 14 # Stale signatures PurgeNeutral 90 # Tokens with neutralish probabilities PurgeUnused 90 # Unused tokens PurgeHapaxes 30 # Tokens with less than 5 hits (hapaxes) PurgeHits1S 15 # Tokens with only 1 spam hit PurgeHits1I 15 # Tokens with only 1 innocent hit LocalMX 127.0.0.1 SystemLog on UserLog on Opt out ParseToHeaders on ChangeModeOnParse on ChangeUserOnParse full ServerPID /var/run/dspam.pid ServerParameters "--deliver=innocent,spam" ServerIdent "localhost.local" ProcessorURLContext on ProcessorBias on StripRcptDomain off ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Dspam-user mailing list Dspam-user@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspam-user