On Apr 20, 2012, at 11:39 AM, Stevan Bajić wrote:

> On 20.04.2012 07:32, Steve Fatula wrote:
>> [...]
> 
>> If you give me the SPAM corpus, I can just run dspam_train on it (and I'd 
>> even add my 80). But it will be pretty unbalanced since I have few HAM 
>> messages since I only keep a month (maybe a few thousand messages). I am not 
>> sure that matters much? In the end, won't the detection still work, maybe 
>> biased towards SPAM at first, but, surely, it woudln't take too long to stop 
>> false positives?
>> 
> Lets say you want to make that merged global group. Then this is what you 
> should do:
> 
> 1) Create a new DSPAM user. If you can create a flat user (no 
> localp...@domain.tld) because a flat user name will be easier to recognize on 
> your setup where you usually have full blown up email addresses as user name. 
> Lets say that new created user is called "SpamHitRate".
> 
> 2) Change preferences for that user to:
> dspam_admin change preference "SpamHitRate" "dailyQuarantineSummary" "off"
> dspam_admin change preference "SpamHitRate" "enableBNR" "on"
> dspam_admin change preference "SpamHitRate" "enableWhitelist" "off"
> dspam_admin change preference "SpamHitRate" "fallbackDomain" "off"
> dspam_admin change preference "SpamHitRate" "ignoreGroups" "on"
> dspam_admin change preference "SpamHitRate" "ignoreRBLLookups" "on"
> dspam_admin change preference "SpamHitRate" "makeCorpus" "off"
> dspam_admin change preference "SpamHitRate" "optIn" "on"
> dspam_admin change preference "SpamHitRate" "optOut" "off"
> dspam_admin change preference "SpamHitRate" "optOutClamAV" "on"
> dspam_admin change preference "SpamHitRate" "processorBias" "off"
> dspam_admin change preference "SpamHitRate" "showFactors" "off"
> dspam_admin change preference "SpamHitRate" "signatureLocation" "headers"
> dspam_admin change preference "SpamHitRate" "spamAction" "deliver"
> dspam_admin change preference "SpamHitRate" "spamSubject" ""
> dspam_admin change preference "SpamHitRate" "statisticalSedation" "0"
> dspam_admin change preference "SpamHitRate" "storeFragments" "off"
> dspam_admin change preference "SpamHitRate" "tagNonspam" "off"
> dspam_admin change preference "SpamHitRate" "tagSpam" "off"
> dspam_admin change preference "SpamHitRate" "trainingMode" "TOE"
> dspam_admin change preference "SpamHitRate" "trainPristine" "off"
> dspam_admin change preference "SpamHitRate" "whitelistThreshold" "9999999"
> 
> Basically you want that user to not use ClamAV, nor any groups, nor any RBL, 
> nor do you want whitelisting or any other mambo jambo. Usually you would not 
> turn off that many helper mechanism on a normal user but this is not a normal 
> user. You want that user to be as hard as possible. You don't care about 
> false positive or false negative on that user. In fact this is exactly what 
> you want. You want that user to generate as much false positive / negative as 
> needed. Because the more FP/FN you have the more you can make DSPAM to learn. 
> And this is what you are going to do mainly with that user. You are going to 
> use dspam_train with Spam/Ham corpi.
> 
> 3) Now go on and train with dspam_train: dspam_train SpamHitRate [spam_corpus 
> maildir or mbox] [nonspam_corpus maildir or mbox]


Would something like

/usr/local/bin/dspam --client --mode=toe --source=innoculation --class=spam 
--user SpamHitRate --deliver=summary < msgs

work just as well?    

I'm curious to learn the difference between source=corpus and 
source=inoculation.  If I'm setting up a new dspam system and have messages 
that are nothing but 100% spam... would inoculation be better?


Thank you,
Chad




------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
Dspam-user mailing list
Dspam-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspam-user

Reply via email to