Re: [dspam-users] multiple use?

Dov Zamir Mon, 15 Jan 2007 11:05:42 -0800

????? Tom Allison:

Tony Earnshaw wrote:
Tom Allison wrote:
[...]

What about the tokens and the signature from the first instance?
What are the chances that I could do this without doing dataintegrity damage or suffering other inconsistencies in performance?
If it had been such a good idea, don't you reckon that Jonathan (whoafter all wrote dspam in the first place) would have "stumbled" on itlong ago?
--Tonni
If that was the case then why would I consider the wheel since someonemight have stumbled on that one too...
I was working on a few assumptions:
a token is a representation of essentially a regex match in eithercase, CRM114 or Bayes. Any overlap is purely coincidental.
How you manipulate the tokens, based on history, is dependent upon themethod of calculation, markov/chi-square/naive, but they are dependenton the same base history of good/bad messages and good/bad tokens.
So a signature can consist of both naive derived tokens and SPBHderived tokens.Any learning or correction of that token will be to apply a correctionto the historical count (+1/-1) in either case. So the data and it'shistory remains consistent.
The more variations you can deploy in checking for spam the better thechances that something will get trapped.
The biggest advantage that dspam can provide is a lighter weight naiveor chi-square determination, removing some of the more obvious spamvia quarantine, followed by the slower CRM114 methodology to furtherdetermine what's left over from the bayes determination.
It probably won't work because there just isn't enough data capturedabout the tokens. But if it was truely a bad idea then why do so manypeople use multiple filters to capture spam?
_________________________________________________________________________
This message has been scanned by Kibbutz Beit Kama's Anti Virus software,
and is believed to be clean of any viruses.
_________________________________________________________________________

!DSPAM:500,45aba9a6309065939618124!

IMHO you would have serious problem retraining, since you may be tryingto retrain the "wrong" instance of dspam on about half the cases. Youwould also have two dspam signatures and double headers, etc. I can seeusing two seperate anti-spam suites (and I do), but not two instances ofdspam.

_________________________________________________________________________
This message has been scanned by Kibbutz Beit Kama's Anti Virus software,
and is believed to be clean of any viruses.
_________________________________________________________________________

Re: [dspam-users] multiple use?

Reply via email to