http://issues.apache.org/SpamAssassin/show_bug.cgi?id=5041





------- Additional Comments From [EMAIL PROTECTED]  2006-08-11 05:15 -------
Profiling on the second example showed that most of the time is spent on
check_unique_words, and then various DRUG_PAIN and FUZZY_ rules. The problem is
that no one of them takes all that much time, but they each take enough that it
adds up to a lot. The thing they have in common is that they are matching regexp
patterns with multiple things like [_\W]{0,3}. It isn't the \W in particular,
but something that gets a lot of matches per line and is variable length.
Experimenting, I didn't see enough difference to be worth bothering with by
timing matches using .{0,3} or \.{0,3} so it isn't just the character class that
is being used.

Unless someone knows of a way to do that kind of matching significantly faster,
I don't see what we can do to make the rules run faster on full message bodies,
which leaves Justin's suggestion from comment #15




------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

Reply via email to