so, we had a brief discussion on training spam filters the other day, and
about weather you can train a filter on someone else's spam data.

I borrowed Chris's spam data, and trained my spam filter (spamoracle) on
that. I also trained it on a couple of thousand of my own messages (good
ones --- so the filter learns to tell the difference).

Happily, all my spam is now being correctly diverted to my spam folder.
Guess this means it's OK to use a database of known spams as long as you
use lots of your own email for the good examples.

Tim Wright

Assistant Lecturer
Department of Computer Science
University of Canterbury

"Language, like terrorism, targets civilians and generates fear to
effect political change."

  -- "Collateral Language" John Collins and Ross Glover ed.

Reply via email to