Bartlomiej Moczulski skrev, on 10-09-2007 08:43:

short question to those, who mastered dspam's source: what's exactly
stored in dspam_signature_data?

Looking at my MySQL data with phpMyAdmin, I see the fields:

uid, signature, data (blob), length, created on.

Or rather: is it (in theory) possible
to recover a full message from data stored in that table?

I reckon not. If one runs debug on a retrain, one sees that the data consists of chained token hashes that are classed either as spam or ham, plus the incidence of these in all tokens. Retraining simply reverses them in the DB.

Since not all the data in a message is used for judging the spamminess of a message, and since certain parts can be excluded from the judgment, iy wouldn't be possible to reconstruct a message from the hashes.

I'm preparing a privacy policy for my users and I'm not perfectly sure
if I can write "no copies of clean messages are stored in antispam
system".

Can't parse this ...

--Tonni

--Tonni

--
Tony Earnshaw
Email: tonni at hetnet dot nl

Reply via email to