Sidney Markowitz <[EMAIL PROTECTED]> writes:
> Does this make sense to people, or should we just dedicate ourselves
> to making sure that Bayes processing is so efficient that there will
> be no need to treat it as a special case?
There are other slow rules. Language guessing, for example.
I'd rather devote time to:
- making code generally more efficient
- ways to make message checks more efficient in general (early exit is
one option if it actually speeds things up)
I just had an interesting idea of how to make checks much faster. What
if we did decision tree, but only to determine whether or not all rules
would be evaluated?
[DECISION TREE] -> definitely spam OR maybe spam
(there is no "maybe ham" or "ham" output from the tree, so no free
pass if a spammer figures out a safe path through the tree)
if maybe spam, then
[PERCEPTRON] -> spam or ham
if definitely spam, then
no more work to do
Daniel
--
Daniel Quinlan anti-spam (SpamAssassin), Linux,
http://www.pathname.com/~quinlan/ and open source consulting