[dspam-users] Different results on different platforms

Peter Larkowski Tue, 13 Nov 2007 14:41:32 -0800

Hello:

I'm in the process of converting my small freebsd server fromsendmail/crm114/mbox to exim/dspam/maildir. Anyway, this is provingto be a bigger job than I originally planned (aren't they all). Iapologize for the long email, but I need to explain the situation.

I first installed all the necessary programs on my desktop pc(freebsd/i386) so that I could get my test configuration workingbefore I took everything "live".

I installed dspam from freebsd's ports. I'm using a mysql 5.0backend, teft, not in daemon mode with exim as the LDA and procmailputting my spam into a mailbox based on headers. I decided to traindspam with my last 1000 hams and 1000 spam messages, so I filteredout my CRM114 headers with grep, converted each mbox to maildir andfed the resulting directories to dspam_train. I'm still not sure ifwant "pretraining" or not, but it at least confirmed dspam wasworking. The results were as follows:


                TP True Positives:            977
                TN True Negatives:            998
                FP False Positives:             2
                FN False Negatives:            23
                SC Spam Corpusfed:              0
                NC Nonspam Corpusfed:           0
                TL Training Left:            1500
                SHR Spam Hit Rate          97.70%
                HSR Ham Strike Rate:        0.20%
                OCA Overall Accuracy:      98.75%

Not bad I thought, so then, I felt happy with everything and Iinstalled everything the same way (from the ports tree with the sameoptions) on my server which incidentally is freebsd/sparc64. Theresults of my identical training are as follows:


                TP True Positives:            913
                TN True Negatives:           1000
                FP False Positives:             0
                FN False Negatives:            87
                SC Spam Corpusfed:              1
                NC Nonspam Corpusfed:           0
                TL Training Left:            1500
                SHR Spam Hit Rate          91.30%
                HSR Ham Strike Rate:        0.00%
                OCA Overall Accuracy:      95.65%

Why is dspam so much better on my athlon than it is on myultrasparc. The versions of freebsd are identical, with the sameversion of dspam, the same build variables, the same training corpus(and the logs indicate the messages were processed in the sameorder. I've run the training several times now (starting with anempty mysql db and the same X-CRM114 header stripped mbox files) andthe results from each machine are reproducible.

Any ideas? I don't want to put the less accurate dspam intoproduction (especially if I've found a bug). I can send the logfiles if that would help, just let me know what other info is relevant.


Thanks,
-Peter

[dspam-users] Different results on different platforms

Reply via email to