On Sun, 2004-02-22 at 15:39, Jeff Waugh wrote:
> Well, that was certainly an interesting post.

Although he didn't provide it, his web site is remarkably informative:

http://www.nuclearelephant.com/projects/dspam/

> So, who's been having fun with their anti-spam tools recently? I'm still
> using spamassassin and bogofilter [1] these days, but finding more and more
> crap in my real inbox, thanks to all this random-text crap. Gar.

I use bogofilter [only]. When the heap-of-random-dictionary-words
technique cropped up, I was really worried - it seemed a good
workaround. For a while I started getting 15-20% false negatives.

I thought I'd have to ditch and go to a full blown SpamAssassin or
whatever setup, but I faithfully kept training for a week or two, and
suddenly, my false negatives are right back down to 1-2 per 1000.

My guess [this is entirely unscientific] is that it backfired on them.
The dictionary is relatively big, but the set of words commonly used is
*really* small in comparison. Because they use words that I and my
correspondents *never* use, the score on  uncommon words (take
"lanthanide" and "dispensary". Who are they kidding?) goes up, and they
become clear markers for spam.

[I wonder how many people are spam blocking this thread? :)]

AfC

-- 
Andrew Frederick Cowie
Operational Dynamics Consulting Pty Ltd

Australia: +61 2 9977 6866  North America: +1 646 472 5054

http://www.operationaldynamics.com/
-- 
SLUG - Sydney Linux User's Group Mailing List - http://slug.org.au/
Subscription info and FAQs: http://slug.org.au/faq/mailinglists.html

Reply via email to