On Tue, 15 Jun 2010 12:42:54 +0300
Jari Juslin <z...@iki.fi> wrote:

> Stevan Bajić kirjoitti:
> > 10 years? DSPAM does not exist that long :)
> 
> That's why I said "almost" ;-).
> 
> > See? The whole message just has an attachment called latterly.rtf and DSPAM 
> > does not tokenize attachments.
> > 
> > So you limit DSPAM to just be able to tokenize stuff it finds in the 
> > headers. I don't have show factors on and I add my signature just to the 
> > headers but for the test I am quickly going to turn them on.
> 
> I understand that it is hard for DSpam to correctly classify messages 
> like that.
> 
> But the bug is not about the classification; the bug is about DSpam not 
> finding the signature while it exists on both message body and the headers.
> 
Lets process that message:
=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
nyx ~ # dspam --user ste...@bajic.ch --process < jari.juslin.eml
nyx ~ #
=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=

And this is the mail how I get it after doing that:
=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
Return-Path: <lawbreak...@seawolfchile.cl>
X-Original-To: z...@localhost
Delivered-To: z...@localhost
Received: from terra.nblnetworks.fi (localhost [127.0.0.1])
        by terra.nblnetworks.fi (Postfix) with ESMTP id C42669B243
        for <z...@localhost>; Mon, 14 Jun 2010 15:52:03 +0300 (EEST)
Received: from mail.netsonic.fi [194.29.192.90]
        by terra.nblnetworks.fi with IMAP (fetchmail-6.3.9-rc2)
        for <z...@localhost> (single-drop); Mon, 14 Jun 2010 15:52:03 +0300 
(EEST)
Received: from netsonic.fi ([unix socket])
         by mail.netsonic.fi (Cyrus v2.3.7-Invoca-RPM-2.3.7-4mke) with LMTPA;
         Mon, 14 Jun 2010 16:51:58 +0300
X-Sieve: CMU Sieve 2.3
Received: from leimasin.iki.fi (leimasin.iki.fi [212.16.98.49])
        by netsonic.fi (Postfix) with ESMTP id 07D901F11AA0
        for <nanos...@netsonic.fi>; Mon, 14 Jun 2010 16:51:57 +0300 (EEST)
Received: from ikiaikainen.iki.fi (r...@ikiaikainen.iki.fi [212.16.98.54])
        by leimasin.iki.fi (8.13.8/8.13.4) with ESMTP id o5ECoqxk002492
        for <jari.jus...@asetus1.silppuri.iki.fi>; Mon, 14 Jun 2010 15:50:52
+0300 (EEST):
Received: from jrkh.qtrduaf.com ([83.153.36.71])
        by ikiaikainen.iki.fi (8.14.4/8.14.4) with SMTP id o5ECoogw018334
        for <jari.jus...@iki.fi>; Mon, 14 Jun 2010 15:50:51 +0300 (EEST)
Message-ID: <4c1624e1.9060...@horngshiue.com>
Date: Mon, 14 Jun 2010 14:49:21 +0200
From: Brindamour Siew <lawbreak...@seawolfchile.cl>
MIME-Version: 1.0
To: Baillet Segerson <jari.jus...@iki.fi>
Subject: [SPAM] "We will keep the sun
Content-Type: application/octet-stream; name="latterly.rtf"
Content-Transfer-Encoding: base64
X-Spam-Status: No, score=2.2 required=5.0 tests=RCVD_IN_BL_SPAMCOP_NET
        autolearn=disabled version=3.2.5
X-Spam-Level: **
X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on leimasin.iki.fi
X-DSPAM-Result: Spam
X-DSPAM-Processed: Tue Jun 15 11:56:24 2010
X-DSPAM-Confidence: 0.9884
X-DSPAM-Improbability: 1 in 8501 chance of being ham
X-DSPAM-Probability: 0.9931
X-DSPAM-Signature: 4,4c174e48303731540914790

e1xydGYxXGFuc2lcYW5zaWNwZzEyNTFcZGVmZjBcZGVmbGFuZzEwNDl7XGZvbnR0Ymx7XGYw
XGZzd2lzc1xmcHJxMlxmY2hhcnNldDIwNHtcKlxmbmFtZSBBcmlhbDt9QXJpYWwgQ1lSO317
XGYxXGZzd2lzc1xmY2hhcnNldDIwNHtcKlxmbmFtZSBBcmlhbDt9QXJpYWwgQ1lSO319DQp7
XGNvbG9ydGJsIDtccmVkMFxncmVlbjBcYmx1ZTI1NTtccmVkMFxncmVlbjEyOFxibHVlMDt9
DQp7XCpcZ2VuZXJhdG9yIE1zZnRlZGl0IDQuNS4zMC4zOTc0O31cdmlld2tpbmQ0XHVjMVxw
YXJkXHNhMjAwXHNsMjc2XHNsbXVsdDFcbGFuZzlcZjBcZnMzMntcZmllbGR7XCpcZmxkaW5z
dHtIWVBFUkxJTksgImh7XCpcZGQgNC41LjMwLjM5NzQ7fXR0cDovL2NsdWJraW5nLmluZm8i
fX17XGZsZHJzbHR7XHVsXGNmMSBodHRwOi8vY2x1YmtpbmcuaW5mb319fVxmMFxjZjFcYlxm
czMyICAtIE9OTElORSBDQVNJTk8hXHBhcg0KXGxpbmVcY2YyXGJcZjBcZnMyOCBWSVAgQ0xV
QiBDYXNpbm8gaXMgYSBncmVhdCBvbmxpbmUgY2FzaW5vIHRoYXQgb2ZmZXJzIHRoZSB1bmlx
dWUgY29tYmluYXRpb24gb2YgdG9wIHF1YWxpdHkgZ2FtZXMsIGhpZ2ggcGF5b3V0cyBhbmQg
YSAyNC83IHByb2Zlc3Npb25hbCBjdXN0b21lciBzdXBwb3J0LlxwYXINClxwYXIxMDAgcHJv
Z3Jlc3NpdmUgZ2FtZXMgd2l0aCB0b3dlcmluZyBqYWNrcG90cywgd2hpY2ggYXJlIHJlYWR5
IHRvIGV4cGxvZGUgYW5kIGNhbiBtYWtlIG11bHRpLW1pbGxpb25haXJlcyBvdXQgb2YgVklQ
IENMVUIgcGxheWVycyEgRG93bmxvYWQgdGhlIHNvZnR3YXJlIGZvciBmcmVlLCBwaWNrIHVw
IHRoZSBpbmNyZWRpYmxlICQ3NzcgV2VsY29tZSBCb251cyBvbiB5b3Ugd2F5IGluIGFuZCBz
dGFydCBwbGF5aW5nICYgd2lubmluZyFccGFyDQp9DQoA
=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=

Let me check if the signature is in MySQL:
=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
nyx ~ # mysql --user=root --password=$(cat /root/.mysql.pwd) 
--socket=/var/run/mysqld/mysqld.sock -e "select 
count(uid),uid,signature,length,created_on from 
sysdb_dspam.dspam_signature_data where uid=4 and 
signature='4,4c174e48303731540914790'"
+------------+-----+---------------------------+--------+------------+
| count(uid) | uid | signature                 | length | created_on |
+------------+-----+---------------------------+--------+------------+
|          1 |   4 | 4,4c174e48303731540914790 |    456 | 2010-06-15 |
+------------+-----+---------------------------+--------+------------+
nyx ~ #
=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=

Yes. It's there. Let me check the DSPAM log:
=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
nyx ~ # grep "4,4c174e48303731540914790" /var/spool/dspam/system.log
1276595784      S       Brindamour Siew <lawbreak...@seawolfchile.cl>   
4,4c174e48303731540914790       "We will keep the sun     0.034437        
ste...@bajic.ch Tagged  <4c1624e1.9060...@horngshiue.com>
nyx ~ #
=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=

Yes. Message is logged in global DSPAM system log. Since you wrote that you use 
a script for retraining, let me try the script that you can find in GIT:
=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
nyx ~ # ./dspam-retrain-forward.pl --debug=yes --user ste...@bajic.ch 
--class=innocent --source=error --full-email=yes --headers-only=yes 
--bodies-only=no --first-only=no < jari.juslin.classified.eml
DEBUG: Found DSPAM signature '4,4c174e48303731540914790' in header.
DEBUG: /usr/bin/dspam --source=error --class=innocent 
--signature=4\,4c174e48303731540914790 --user stev...@bajic\.ch
nyx ~ #
=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=

Looks good. Okay. Let me this time not use the debug switch:
=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
nyx ~ # ./dspam-retrain-forward.pl --user ste...@bajic.ch --class=innocent 
--source=error --full-email=yes --headers-only=yes --bodies-only=no 
--first-only=no < jari.juslin.classified.eml
nyx ~ # echo ${?}
0
nyx ~ #
=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=

Execution was okay and no error. Let me again try to see what the global DSPAM 
system log is telling me:
=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
nyx ~ # grep "4,4c174e48303731540914790" /var/spool/dspam/system.log
1276595784      S       Brindamour Siew <lawbreak...@seawolfchile.cl>   
4,4c174e48303731540914790       "We will keep the sun     0.034437        
ste...@bajic.ch Tagged  <4c1624e1.9060...@horngshiue.com>
1276596403      F       <None Specified>        4,4c174e48303731540914790       
<None Specified>        0.049404  ste...@bajic.ch Retrained
nyx ~ #
=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=

Not bad. The script retrained DSPAM the proper way. Let me try to do retraining 
as an spam:
=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
nyx ~ # ./dspam-retrain-forward.pl --user ste...@bajic.ch --class=spam 
--source=error --full-email=yes --headers-only=yes --bodies-only=no 
--first-only=no < jari.juslin.classified.eml
nyx ~ # echo ${?}
0
nyx ~ #
=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=

No error. Good. And now let me see what the global DSPAM system log is telling 
me:
=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
nyx ~ # grep "4,4c174e48303731540914790" /var/spool/dspam/system.log
1276595784      S       Brindamour Siew <lawbreak...@seawolfchile.cl>   
4,4c174e48303731540914790       "We will keep the sun     0.034437        
ste...@bajic.ch Tagged  <4c1624e1.9060...@horngshiue.com>
1276596403      F       <None Specified>        4,4c174e48303731540914790       
<None Specified>        0.049404  ste...@bajic.ch Retrained
1276597044      M       <None Specified>        4,4c174e48303731540914790       
<None Specified>        0.054198  ste...@bajic.ch Retrained
nyx ~ #
=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=

Looks all fine and dandy.



>       -Jari
> 
-- 
Kind Regards from Switzerland,

Stevan Bajić

------------------------------------------------------------------------------
ThinkGeek and WIRED's GeekDad team up for the Ultimate 
GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the 
lucky parental unit.  See the prize list and enter to win: 
http://p.sf.net/sfu/thinkgeek-promo
_______________________________________________
Dspam-user mailing list
Dspam-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspam-user

Reply via email to