[Dspam-user] Training seems to be ineffective

David Rees Wed, 10 Apr 2013 12:56:59 -0700

I've been running dspam for a while and have a few users using it with good
filtering rates, although I had noticed that on my account despite repeated
training there are certain newsletters that continue to get tagged as false
positives. I chalked it up to using TEFT mode for quite a while after some
research and switched to TUM.


But now I've started a new user which receives a good volume of extremely
predictable SPAM and despite training these mails continue to be delivered
with a typical Confidence of 0.9899 and Probability of 0.0000.

The killer is that even despite training, some of these emails have been
Whitelisted despite a recent retraining as SPAM!

So it appears that the training is completely ineffective as I understand
that once an email as been marked as spam, it should no longer consider the
email for the whitelist.

If you look at the dspam log or look at the dspam webui, the history page
seems to indicate that the email is indeed being retrained successfully.

Now, if I take one of these emails, remove the dspam headers and train then
as an inoculation source, after retraining around 5 times the email will be
successfully marked as spam.

How can I debug this issue further?

The system in question is running dspam 3.10.2 with a PostgreSQL backend.

Thanks

Dave

------------------------------------------------------------------------------
Precog is a next-generation analytics platform capable of advanced
analytics on semi-structured data. The platform includes APIs for building
apps and a phenomenal toolset for data science. Developers can use
our toolset for easy data analysis & visualization. Get a free account!
http://www2.precog.com/precogplatform/slashdotnewsletter

_______________________________________________
Dspam-user mailing list
Dspam-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspam-user

[Dspam-user] Training seems to be ineffective

Reply via email to