--On Tuesday, February 10, 2004 4:24 PM -0700 Mark <[EMAIL PROTECTED]> wrote:

Here I am thinking out loud while running through my spam log, would a
perl interface to a spell checker with some sort of weighting help kill
more of this junk?

i.e if there are 100 words in the body, and 75% are misspelled, add it to
the score...

This comes up regularly on the SpamAssassin lists. Another approach I'm seeing lately is to use Bayes but assign high spam probabilities to tokens that haven't been seen before. So misspelled words have high spam probabilities, and spam with lots of misspelled words gets a much higher Bayes vote.



_______________________________________________ Visit http://www.mimedefang.org and http://www.canit.ca MIMEDefang mailing list [EMAIL PROTECTED] http://lists.roaringpenguin.com/mailman/listinfo/mimedefang

Reply via email to