Hello,
I'm trying to figure out if a corpus classification is working
properly. I've setup a Global Group as described in the README but it
doesn't appear to be working properly. I have a user named corpususer
and have trained it on 120756 spam and 20581 ham.
I've added the following to the group file:
corpususer:classification:*corpususer
When sending mail to a test user it does not appear to mark any mail as
spam. Looking at the debug output there is also no indication that the
corpususer tokens are being consulted. (I've included the debug outputs
below)
As a test I setup a merged group with the same corpususer by adding this
to the group file:
corpususer:merged:*
This appears to classify mail better and the debug output (listed below)
also shows the corpususer being consulted. At the moment I am using the
merged group but would much prefer to use the classification group
becuase I don't like that because I'm using a large corpus which would
override anything the user did to train their own email.
Trying to combine the 2 ideas above I tried this in the group file:
corpususer:classification:*
But unfortunately this causes a double free or corruption error in glibc
when trying to classify any message. (I saw a ticket on that
http://sourceforge.net/tracker/?func=detail&atid=1126467&aid=2990455&group_id=250683
<http://sourceforge.net/tracker/?func=detail&atid=1126467&aid=2990455&group_id=250683>
and will be posting to that ticket right after this email)
My questions are:
How should a Global Group be setup to get the results describe in the
README?
Is there any way to tell if a Global Group is being used?
I'm using dspam 3.9.0 with a postgresql backend, compiled from source
with the following options:
./configure --prefix=/usr/local/dspam --sysconfdir=/usr/local/dspam/etc
--with-storage-driver=mysql_drv,pgsql_drv
--with-mysql-includes=/usr/include/mysql
--with-pgsql-includes=/usr/include/postgresql --enable-daemon
--enable-debug -
-enable-virtual-users --enable-preferences-extension --enable-clamav
Thanks,
Ed
corpususer:classification:*corpususer debug output:
6565: [04/29/2010 14:47:37] No QuarantineAgent option found. Using
standard quarantine.
6565: [04/29/2010 14:47:37] DSPAM Instance Startup
6565: [04/29/2010 14:47:37] input args: /usr/local/dspam/bin/dspam
--stdout --deliver=innocent,spam --user testu...@testdomain.com --debug
6565: [04/29/2010 14:47:37] pass-thru args:
6565: [04/29/2010 14:47:37] processing user testu...@testdomain.com
6565: [04/29/2010 14:47:37] uid = 0, euid = 0, gid = 0, egid = 8
6565: [04/29/2010 14:47:37] loading preferences for user
testu...@testdomain.com
6565: [04/29/2010 14:47:37] _pgsql_drv_getpwnam: successful returning
struct for name: testu...@testdomain.com
6565: [04/29/2010 14:47:37] Loading preferences for uid 3856
6565: [04/29/2010 14:47:37] Loading preferences for uid 0
6565: [04/29/2010 14:47:37] Loading preferences for uid 0
6565: [04/29/2010 14:47:37] default preferences empty. reverting to
dspam.conf preferences.
6565: [04/29/2010 14:47:37] Loading preferences from dspam.conf
6565: [04/29/2010 14:47:37] using
/usr/local/dspam/var/dspam/opt-in/testu...@testdomain.com.dspam as path
6565: [04/29/2010 14:47:37] using
/usr/local/dspam/var/dspam/opt-out/testu...@testdomain.com.nodspam as path
6565: [04/29/2010 14:47:37] sedation level set to: 5
6565: [04/29/2010 14:47:37] _pgsql_drv_getpwnam: successful returning
struct for name: testu...@testdomain.com
6565: [04/29/2010 14:47:37] _pgsql_drv_getpwnam returning cached name
testu...@testdomain.com.
6565: [04/29/2010 14:47:39] Loading 7 BNR patterns
6565: [04/29/2010 14:47:39] _pgsql_drv_getpwnam returning cached name
testu...@testdomain.com.
6565: [04/29/2010 14:47:39] Whitelist threshold: 10
<snip tokens>
6565: [04/29/2010 14:47:39] Graham-Bayesian Probability: 0.002278
Samples: 15
6565: [04/29/2010 14:47:39] Burton-Bayesian Probability: 0.000018
Samples: 27
6565: [04/29/2010 14:47:39] no factors specified; using default
6565: [04/29/2010 14:47:39] Result Confidence: 1.00
6565: [04/29/2010 14:47:39] _pgsql_drv_getpwnam returning cached name
testu...@testdomain.com.
6565: [04/29/2010 14:47:39] Control: [10 10] [10 11] Delta: [0 1]
6565: [04/29/2010 14:47:40] total processing time: 3.01203s
6565: [04/29/2010 14:47:40] _pgsql_drv_getpwnam returning cached name
testu...@testdomain.com.
6565: [04/29/2010 14:47:40] _pgsql_drv_getpwnam returning cached name
testu...@testdomain.com.
6565: [04/29/2010 14:47:40] saving signature as
3856,4bd9d44c65657166613715
6565: [04/29/2010 14:47:40] _pgsql_drv_getpwnam returning cached name
testu...@testdomain.com.
6565: [04/29/2010 14:47:40] libdspam returned probability of 0.002278
6565: [04/29/2010 14:47:40] message result: NOT SPAM
6565: [04/29/2010 14:47:40] _pgsql_drv_getpwnam returning cached name
testu...@testdomain.com.
6565: [04/29/2010 14:47:40] delivering message
6565: [04/29/2010 14:47:40] DSPAM Instance Shutdown. Exit Code:
0
corpususer:merged:* output:
21270: [04/29/2010 13:58:49] No QuarantineAgent option found. Using
standard quarantine.
21270: [04/29/2010 13:58:49] DSPAM Instance Startup
21270: [04/29/2010 13:58:49] input args: /usr/local/dspam/bin/dspam
--stdout --deliver=innocent,spam --user testu...@testdomain.com --debug
21270: [04/29/2010 13:58:49] pass-thru args:
21270: [04/29/2010 13:58:49] processing user testu...@testdomain.com
21270: [04/29/2010 13:58:49] uid = 0, euid = 0, gid = 0, egid = 8
21270: [04/29/2010 13:58:49] loading preferences for user
testu...@testdomain.com
21270: [04/29/2010 13:58:49] _pgsql_drv_getpwnam: successful returning
struct for name: testu...@testdomain.com
21270: [04/29/2010 13:58:49] Loading preferences for uid 3856
21270: [04/29/2010 13:58:49] Loading preferences for uid 0
21270: [04/29/2010 13:58:49] Loading preferences for uid 0
21270: [04/29/2010 13:58:49] default preferences empty. reverting to
dspam.conf preferences.
21270: [04/29/2010 13:58:49] Loading preferences from dspam.conf
21270: [04/29/2010 13:58:49] using
/usr/local/dspam/var/dspam/opt-in/testu...@testdomain.com.dspam as path
21270: [04/29/2010 13:58:49] using
/usr/local/dspam/var/dspam/opt-out/testu...@testdomain.com.nodspam as path
21270: [04/29/2010 13:58:49] adding user to merged group corpususer
21270: [04/29/2010 13:58:49] sedation level set to: 5
21270: [04/29/2010 13:58:49] _pgsql_drv_getpwnam: successful returning
struct for name: testu...@testdomain.com
21270: [04/29/2010 13:58:49] _pgsql_drv_getpwnam: successful returning
struct for name: corpususer
21270: [04/29/2010 13:58:49] _pgsql_drv_getpwnam: successful returning
struct for name: testu...@testdomain.com
21270: [04/29/2010 13:58:49] _pgsql_drv_getpwnam: successful returning
struct for name: corpususer
21270: [04/29/2010 13:58:58] Loading 1033 BNR patterns
21270: [04/29/2010 13:58:58] _pgsql_drv_getpwnam: successful returning
struct for name: testu...@testdomain.com
21270: [04/29/2010 13:58:58] _pgsql_drv_getpwnam: successful returning
struct for name: corpususer
21270: [04/29/2010 13:58:58] Whitelist threshold: 10
<snip tokens>
21270: [04/29/2010 13:58:58] Graham-Bayesian Probability: 0.000000
Samples: 15
21270: [04/29/2010 13:58:58] Burton-Bayesian Probability: 0.000000
Samples: 27
21270: [04/29/2010 13:58:58] no factors specified; using default
21270: [04/29/2010 13:58:58] Result Confidence: 0.99
21270: [04/29/2010 13:58:58] _pgsql_drv_getpwnam: successful returning
struct for name: testu...@testdomain.com
21270: [04/29/2010 13:58:58] Control: [10 10] [10 11] Delta: [0 1]
21270: [04/29/2010 13:58:58] total processing time: 9.26809s
21270: [04/29/2010 13:58:58] _pgsql_drv_getpwnam returning cached name
testu...@testdomain.com.
21270: [04/29/2010 13:58:58] _pgsql_drv_getpwnam returning cached name
testu...@testdomain.com.
21270: [04/29/2010 13:58:58] saving signature as
3856,4bd9c8e2212701461562737
21270: [04/29/2010 13:58:58] _pgsql_drv_getpwnam returning cached name
testu...@testdomain.com.
21270: [04/29/2010 13:58:58] libdspam returned probability of 0.000000
21270: [04/29/2010 13:58:58] message result: NOT SPAM
21270: [04/29/2010 13:58:58] _pgsql_drv_getpwnam returning cached name
testu...@testdomain.com.
21270: [04/29/2010 13:58:58] delivering message
21270: [04/29/2010 13:58:58] DSPAM Instance Shutdown. Exit Code: 0
--
Ed Szynaka
Network/Systems Manager
LocalNet Corp./CoreComm Internet Services
------------------------------------------------------------------------------
_______________________________________________
Dspam-user mailing list
Dspam-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspam-user