Bill Stewart
Mon, 06 Sep 2004 13:37:58 -0700
> On Behalf Of Marcel Popescu ... > My problem is that I don't know what happens with the email in transit > (this, I believe, is an observation in the hashcash FAQ). I > am worried that some mail server might dislike ASCII characters with .... > > Hence my question: is there some "approximate" hash function > (which I could > use instead of SHA-1) which can verify that a > text hashes "very close" to a value?
nilsimsa Computes nilsimsa codes of messages and compares the codes and finds clusters of similar messages so as to trash spam.
Check out Vipul's Razor, which uses an approach similar to this. You'll find information at Cloudmark and on Sourceforge.
Vipul's Razor and related approaches try to calculate a unique id for each message so that if a human detects that a message is spam, the id can be published so everybody else trashes it. This usually needs more than one human rating something as spam to prevent abuse, and there's some tuning, but it's a good start.
--------------------------------------------------------------------- The Cryptography Mailing List Unsubscribe by sending "unsubscribe cryptography" to [EMAIL PROTECTED]