[cdr] Re: Random musing about words and spam

2003-09-06 Thread Jim Choate
On Sat, 6 Sep 2003, Eric Murray wrote: On Fri, Sep 05, 2003 at 09:01:51AM -0700, Major Variola (ret) wrote: Can we assume that the spam is generated by regexp-type programs? If so, are there good methods for inferring the regexp from examples, and using this to infer spamfiltering

Re: Random musing about words and spam

2003-09-05 Thread Eric Murray
On Thu, Sep 04, 2003 at 09:02:30PM -0400, Steve Furlong wrote: On Tuesday 02 September 2003 19:00, Thomas Shaddack wrote: Spammers recently adopted tactics of using randomly generated words, eg. wryqf, in both the subject and the body of the message. ... Could the pseudowords be easily

Re: Random musing about words and spam

2003-09-05 Thread Steve Furlong
On Tuesday 02 September 2003 19:00, Thomas Shaddack wrote: Spammers recently adopted tactics of using randomly generated words, eg. wryqf, in both the subject and the body of the message. .. Could the pseudowords be easily detected by their characteristics, .. Presence of pseudowords then

Re: Random musing about words and spam

2003-09-05 Thread Major Variola (ret)
At 09:09 PM 9/4/03 -0700, Eric Murray wrote: (it's one of about 200 checks my program makes). Can we assume that the spam is generated by regexp-type programs? If so, are there good methods for inferring the regexp from examples, and using this to infer spamfiltering rules? Good project for a

Re: Random musing about words and spam

2003-09-04 Thread Thomas Shaddack
On Wed, 3 Sep 2003, John Kozubik wrote: I must reiterate that, given the relentless efficiency of spam-spiders, merely publishing a shadow email address on all web documents that your real email address reside on, and deleting all email sent to both accounts is my current favorite anti-spam

Re: Random musing about words and spam

2003-09-03 Thread John Kozubik
Hello, On Wed, 3 Sep 2003, Thomas Shaddack wrote: Spammers recently adopted tactics of using randomly generated words, eg. wryqf, in both the subject and the body of the message. These pseudowords are random, which makes them different from real words that are made of syllables. Could the

Re: Random musing about words and spam

2003-09-03 Thread Thomas Shaddack
On Wed, 3 Sep 2003, John Kozubik wrote: Try to write the logic that distinguishes this: if_gre in the tree passes the mbuf to netisr_dispatch(), which in turn calls if_handoff(), which does something similar. ([EMAIL PROTECTED]) from this: dyeiluykxoer dyeiluykcqkutknig dyeiluykkrpmhrku

Random musing about words and spam

2003-09-03 Thread Thomas Shaddack
Spammers recently adopted tactics of using randomly generated words, eg. wryqf, in both the subject and the body of the message. These pseudowords are random, which makes them different from real words that are made of syllables. Could the pseudowords be easily detected by their characteristics,