On Tue, 15 Jun 2010 12:42:54 +0300 Jari Juslin <z...@iki.fi> wrote: > Stevan Bajić kirjoitti: > > 10 years? DSPAM does not exist that long :) > > That's why I said "almost" ;-). > > > See? The whole message just has an attachment called latterly.rtf and DSPAM > > does not tokenize attachments. > > > > So you limit DSPAM to just be able to tokenize stuff it finds in the > > headers. I don't have show factors on and I add my signature just to the > > headers but for the test I am quickly going to turn them on. > > I understand that it is hard for DSpam to correctly classify messages > like that. > > But the bug is not about the classification; the bug is about DSpam not > finding the signature while it exists on both message body and the headers. > Lets process that message: =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-= nyx ~ # dspam --user ste...@bajic.ch --process < jari.juslin.eml nyx ~ # =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
And this is the mail how I get it after doing that: =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-= Return-Path: <lawbreak...@seawolfchile.cl> X-Original-To: z...@localhost Delivered-To: z...@localhost Received: from terra.nblnetworks.fi (localhost [127.0.0.1]) by terra.nblnetworks.fi (Postfix) with ESMTP id C42669B243 for <z...@localhost>; Mon, 14 Jun 2010 15:52:03 +0300 (EEST) Received: from mail.netsonic.fi [194.29.192.90] by terra.nblnetworks.fi with IMAP (fetchmail-6.3.9-rc2) for <z...@localhost> (single-drop); Mon, 14 Jun 2010 15:52:03 +0300 (EEST) Received: from netsonic.fi ([unix socket]) by mail.netsonic.fi (Cyrus v2.3.7-Invoca-RPM-2.3.7-4mke) with LMTPA; Mon, 14 Jun 2010 16:51:58 +0300 X-Sieve: CMU Sieve 2.3 Received: from leimasin.iki.fi (leimasin.iki.fi [212.16.98.49]) by netsonic.fi (Postfix) with ESMTP id 07D901F11AA0 for <nanos...@netsonic.fi>; Mon, 14 Jun 2010 16:51:57 +0300 (EEST) Received: from ikiaikainen.iki.fi (r...@ikiaikainen.iki.fi [212.16.98.54]) by leimasin.iki.fi (8.13.8/8.13.4) with ESMTP id o5ECoqxk002492 for <jari.jus...@asetus1.silppuri.iki.fi>; Mon, 14 Jun 2010 15:50:52 +0300 (EEST): Received: from jrkh.qtrduaf.com ([83.153.36.71]) by ikiaikainen.iki.fi (8.14.4/8.14.4) with SMTP id o5ECoogw018334 for <jari.jus...@iki.fi>; Mon, 14 Jun 2010 15:50:51 +0300 (EEST) Message-ID: <4c1624e1.9060...@horngshiue.com> Date: Mon, 14 Jun 2010 14:49:21 +0200 From: Brindamour Siew <lawbreak...@seawolfchile.cl> MIME-Version: 1.0 To: Baillet Segerson <jari.jus...@iki.fi> Subject: [SPAM] "We will keep the sun Content-Type: application/octet-stream; name="latterly.rtf" Content-Transfer-Encoding: base64 X-Spam-Status: No, score=2.2 required=5.0 tests=RCVD_IN_BL_SPAMCOP_NET autolearn=disabled version=3.2.5 X-Spam-Level: ** X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on leimasin.iki.fi X-DSPAM-Result: Spam X-DSPAM-Processed: Tue Jun 15 11:56:24 2010 X-DSPAM-Confidence: 0.9884 X-DSPAM-Improbability: 1 in 8501 chance of being ham X-DSPAM-Probability: 0.9931 X-DSPAM-Signature: 4,4c174e48303731540914790 e1xydGYxXGFuc2lcYW5zaWNwZzEyNTFcZGVmZjBcZGVmbGFuZzEwNDl7XGZvbnR0Ymx7XGYw XGZzd2lzc1xmcHJxMlxmY2hhcnNldDIwNHtcKlxmbmFtZSBBcmlhbDt9QXJpYWwgQ1lSO317 XGYxXGZzd2lzc1xmY2hhcnNldDIwNHtcKlxmbmFtZSBBcmlhbDt9QXJpYWwgQ1lSO319DQp7 XGNvbG9ydGJsIDtccmVkMFxncmVlbjBcYmx1ZTI1NTtccmVkMFxncmVlbjEyOFxibHVlMDt9 DQp7XCpcZ2VuZXJhdG9yIE1zZnRlZGl0IDQuNS4zMC4zOTc0O31cdmlld2tpbmQ0XHVjMVxw YXJkXHNhMjAwXHNsMjc2XHNsbXVsdDFcbGFuZzlcZjBcZnMzMntcZmllbGR7XCpcZmxkaW5z dHtIWVBFUkxJTksgImh7XCpcZGQgNC41LjMwLjM5NzQ7fXR0cDovL2NsdWJraW5nLmluZm8i fX17XGZsZHJzbHR7XHVsXGNmMSBodHRwOi8vY2x1YmtpbmcuaW5mb319fVxmMFxjZjFcYlxm czMyICAtIE9OTElORSBDQVNJTk8hXHBhcg0KXGxpbmVcY2YyXGJcZjBcZnMyOCBWSVAgQ0xV QiBDYXNpbm8gaXMgYSBncmVhdCBvbmxpbmUgY2FzaW5vIHRoYXQgb2ZmZXJzIHRoZSB1bmlx dWUgY29tYmluYXRpb24gb2YgdG9wIHF1YWxpdHkgZ2FtZXMsIGhpZ2ggcGF5b3V0cyBhbmQg YSAyNC83IHByb2Zlc3Npb25hbCBjdXN0b21lciBzdXBwb3J0LlxwYXINClxwYXIxMDAgcHJv Z3Jlc3NpdmUgZ2FtZXMgd2l0aCB0b3dlcmluZyBqYWNrcG90cywgd2hpY2ggYXJlIHJlYWR5 IHRvIGV4cGxvZGUgYW5kIGNhbiBtYWtlIG11bHRpLW1pbGxpb25haXJlcyBvdXQgb2YgVklQ IENMVUIgcGxheWVycyEgRG93bmxvYWQgdGhlIHNvZnR3YXJlIGZvciBmcmVlLCBwaWNrIHVw IHRoZSBpbmNyZWRpYmxlICQ3NzcgV2VsY29tZSBCb251cyBvbiB5b3Ugd2F5IGluIGFuZCBz dGFydCBwbGF5aW5nICYgd2lubmluZyFccGFyDQp9DQoA =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-= Let me check if the signature is in MySQL: =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-= nyx ~ # mysql --user=root --password=$(cat /root/.mysql.pwd) --socket=/var/run/mysqld/mysqld.sock -e "select count(uid),uid,signature,length,created_on from sysdb_dspam.dspam_signature_data where uid=4 and signature='4,4c174e48303731540914790'" +------------+-----+---------------------------+--------+------------+ | count(uid) | uid | signature | length | created_on | +------------+-----+---------------------------+--------+------------+ | 1 | 4 | 4,4c174e48303731540914790 | 456 | 2010-06-15 | +------------+-----+---------------------------+--------+------------+ nyx ~ # =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-= Yes. It's there. Let me check the DSPAM log: =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-= nyx ~ # grep "4,4c174e48303731540914790" /var/spool/dspam/system.log 1276595784 S Brindamour Siew <lawbreak...@seawolfchile.cl> 4,4c174e48303731540914790 "We will keep the sun 0.034437 ste...@bajic.ch Tagged <4c1624e1.9060...@horngshiue.com> nyx ~ # =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-= Yes. Message is logged in global DSPAM system log. Since you wrote that you use a script for retraining, let me try the script that you can find in GIT: =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-= nyx ~ # ./dspam-retrain-forward.pl --debug=yes --user ste...@bajic.ch --class=innocent --source=error --full-email=yes --headers-only=yes --bodies-only=no --first-only=no < jari.juslin.classified.eml DEBUG: Found DSPAM signature '4,4c174e48303731540914790' in header. DEBUG: /usr/bin/dspam --source=error --class=innocent --signature=4\,4c174e48303731540914790 --user stev...@bajic\.ch nyx ~ # =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-= Looks good. Okay. Let me this time not use the debug switch: =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-= nyx ~ # ./dspam-retrain-forward.pl --user ste...@bajic.ch --class=innocent --source=error --full-email=yes --headers-only=yes --bodies-only=no --first-only=no < jari.juslin.classified.eml nyx ~ # echo ${?} 0 nyx ~ # =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-= Execution was okay and no error. Let me again try to see what the global DSPAM system log is telling me: =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-= nyx ~ # grep "4,4c174e48303731540914790" /var/spool/dspam/system.log 1276595784 S Brindamour Siew <lawbreak...@seawolfchile.cl> 4,4c174e48303731540914790 "We will keep the sun 0.034437 ste...@bajic.ch Tagged <4c1624e1.9060...@horngshiue.com> 1276596403 F <None Specified> 4,4c174e48303731540914790 <None Specified> 0.049404 ste...@bajic.ch Retrained nyx ~ # =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-= Not bad. The script retrained DSPAM the proper way. Let me try to do retraining as an spam: =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-= nyx ~ # ./dspam-retrain-forward.pl --user ste...@bajic.ch --class=spam --source=error --full-email=yes --headers-only=yes --bodies-only=no --first-only=no < jari.juslin.classified.eml nyx ~ # echo ${?} 0 nyx ~ # =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-= No error. Good. And now let me see what the global DSPAM system log is telling me: =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-= nyx ~ # grep "4,4c174e48303731540914790" /var/spool/dspam/system.log 1276595784 S Brindamour Siew <lawbreak...@seawolfchile.cl> 4,4c174e48303731540914790 "We will keep the sun 0.034437 ste...@bajic.ch Tagged <4c1624e1.9060...@horngshiue.com> 1276596403 F <None Specified> 4,4c174e48303731540914790 <None Specified> 0.049404 ste...@bajic.ch Retrained 1276597044 M <None Specified> 4,4c174e48303731540914790 <None Specified> 0.054198 ste...@bajic.ch Retrained nyx ~ # =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-= Looks all fine and dandy. > -Jari > -- Kind Regards from Switzerland, Stevan Bajić ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Dspam-user mailing list Dspam-user@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspam-user