-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 On 29/03/11 22:55, Stevan Bajić wrote: > On Tue, 29 Mar 2011 22:40:30 +0200 Tom Hendrikx <t...@whyscream.net> > wrote: > > On 29/03/11 20:44, Kenneth Marshall wrote: >>>> On Tue, Mar 29, 2011 at 09:40:19PM +0300, Ibrahim Harrani >>>> wrote: >>>>> Hi Ken, >>>>> >>>>> I added the list to dspam.conf. Should/Can I delete from >>>>> database those pattern as well? >>>>> >>>>> Thanks >>>>> >>>> >>>> Unfortunately, only the hash of the header is stored so there >>>> is not a way to identify the applicable tokens. You may want >>>> to regenerate the corpus. If you create the table with a <100% >>>> fillfactor, the training should go faster. >>>> > > That is incorrect. You can easily recreate the hash with dspam_crc, > f you know the original token: > > $ dspam_crc "Received*from+userid" TOKEN: 'Received*from+userid' CRC: > 2977181303331328604 > > vmail=# select * from dspam_token_data where token = > '2977181303331328604'; uid | token | spam_hits | > innocent_hits | last_hit > -----+---------------------+-----------+---------------+------------ > 1 | 2977181303331328604 | 4 | 2203 | 2011-03-29 > > >> That is right however one needs to know as well how the tokenizer >> has assembled those tokens. For the dull tokenizers (aka word and >> chain) things are easy but as soon as you enter the more >> intelligent tokenizers (aka osb and sbph) then things can quicky >> get complicated. >
That is true, but the OP had questions about tokens that appeared in his logfile, these are ready to use with dspam_crc. I agree that you should not try to manually reconstruct the tokens from original data: I created my example copying a tokn from the X-DSPAM-Factors mail header. Regards, Tom -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.10 (GNU/Linux) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/ iEYEARECAAYFAk2SShsACgkQeEmCqmj6IjY5UwCghfCfnj+E8DTQ54d2TWftQyfq 9EoAn3ZuKM6bvTl4/1xjNawLki+9FId4 =+Rg7 -----END PGP SIGNATURE----- ------------------------------------------------------------------------------ Enable your software for Intel(R) Active Management Technology to meet the growing manageability and security demands of your customers. Businesses are taking advantage of Intel(R) vPro (TM) technology - will your software be a part of the solution? Download the Intel(R) Manageability Checker today! http://p.sf.net/sfu/intel-dev2devmar _______________________________________________ Dspam-user mailing list Dspam-user@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspam-user