-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

On 29/03/11 22:55, Stevan Bajić wrote:
> On Tue, 29 Mar 2011 22:40:30 +0200 Tom Hendrikx <t...@whyscream.net>
> wrote:
> 
> On 29/03/11 20:44, Kenneth Marshall wrote:
>>>> On Tue, Mar 29, 2011 at 09:40:19PM +0300, Ibrahim Harrani
>>>> wrote:
>>>>> Hi Ken,
>>>>> 
>>>>> I added the list to dspam.conf. Should/Can  I delete from
>>>>> database those pattern as well?
>>>>> 
>>>>> Thanks
>>>>> 
>>>> 
>>>> Unfortunately, only the hash of the header is stored so there 
>>>> is not a way to identify the applicable tokens. You may want
>>>> to regenerate the corpus. If you create the table with a <100% 
>>>> fillfactor, the training should go faster.
>>>> 
> 
> That is incorrect. You can easily recreate the hash with dspam_crc,
> f you know the original token:
> 
> $ dspam_crc "Received*from+userid" TOKEN: 'Received*from+userid' CRC:
> 2977181303331328604
> 
> vmail=# select * from dspam_token_data where token =
> '2977181303331328604'; uid |        token        | spam_hits |
> innocent_hits |  last_hit 
> -----+---------------------+-----------+---------------+------------ 
> 1 | 2977181303331328604 |         4 |          2203 | 2011-03-29
> 
> 
>> That is right however one needs to know as well how the tokenizer
>> has assembled those tokens. For the dull tokenizers (aka word and
>> chain) things are easy but as soon as you enter the more
>> intelligent tokenizers (aka osb and sbph) then things can quicky
>> get complicated.
> 

That is true, but the OP had questions about tokens that appeared in his
logfile, these are ready to use with dspam_crc.

I agree that you should not try to manually reconstruct the tokens from
original data: I created my example copying a tokn from the
X-DSPAM-Factors mail header.

Regards,
        Tom
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.10 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/

iEYEARECAAYFAk2SShsACgkQeEmCqmj6IjY5UwCghfCfnj+E8DTQ54d2TWftQyfq
9EoAn3ZuKM6bvTl4/1xjNawLki+9FId4
=+Rg7
-----END PGP SIGNATURE-----

------------------------------------------------------------------------------
Enable your software for Intel(R) Active Management Technology to meet the
growing manageability and security demands of your customers. Businesses
are taking advantage of Intel(R) vPro (TM) technology - will your software 
be a part of the solution? Download the Intel(R) Manageability Checker 
today! http://p.sf.net/sfu/intel-dev2devmar
_______________________________________________
Dspam-user mailing list
Dspam-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspam-user

Reply via email to