RE: [qmailtoaster] dspam

Helmut Fritz Sun, 23 Mar 2014 13:38:23 -0700

Just observing the topic - this 'seems' a lot more complicated than
spamassasin.

From: Eric Broch [mailto:[email protected]] 
Sent: Sunday, March 23, 2014 8:32 AM
To: [email protected]
Subject: Re: [qmailtoaster] dspam

On 3/23/2014 8:05 AM, Angus McIntyre wrote:

On Mar 20, 2014, at 6:34 PM, Eric Broch <[email protected]> wrote:

Your welcome. Since November, I've created a much easier automated install
here <ftp://ftp.whitehorsetc.com/pub/dspam/> . Be sure to look at the Readme
file. And, as always, check the script.

Hmm. That seems to be an FTP link. I tried logging on as 'guest', but it
doesn't seem to want to talk to me.

I'm not really convinced by dspam yet. Untrained, it classifies everything
as 'Innocent'. I fed it a massive corpus of spam and it then classified
everything as 'Spam'. So I blew everything away and started over. This time,
I've been feeding it an unrecognized spam (which is to say, all of it) in
correction mode (i.e. --source=error). This is having a limited effect.
After feeding it many hundreds of spams, it still believes that all my spam
is actually 'Innocent', but at least I've shaken its confidence a bit - it's
now only 85% convinced that 'Pro Viagra for Men' is a valid message.

It looks like I will have a lot more training to do before I can persuade it
to successfully recognize any spam at all . and then only for the particular
user that I've trained. I'm also concerned that many of the messages I see
are filled with hash buster text, which is designed specifically to dodge
and poison statistical filters like dspam.

Apologies if this is slightly off-topic, but given that dspam is under
consideration for future QMT releases, I felt that I should share my
experience. It's certainly not looking like a magic bullet to me at the
moment.

Angus

Angus,

The FTP site should work now. My firewall was blocking it for some reason
(testing fail2ban). 

Anyway, for my set ups I did not train on the 'corpus' setting only on
'error.' In order to train on 'error' the message must have a DSPAM header
which I configured to be in the email header NOT them message. On my own
machine I trained (as error) about 30 spam messages marked by DSPAM as
innocent. And, now I get 1 spam a month, if that.
On my client's site which receives about 60,000 emails a month on average I
trained between 100 and 200 messages the same way with similar results.

I read through the users email directory and catenate (cat) each spam marked
as Innocent through the dspam client program as follows:

cat $email | dspamc --user user@domain --mode=teft --class=spam
--source=error

The results have been excellent. 

EricB

RE: [qmailtoaster] dspam

Reply via email to