http://issues.apache.org/SpamAssassin/show_bug.cgi?id=4773





------- Additional Comments From [EMAIL PROTECTED]  2006-01-31 20:49 -------
(In reply to comment #1)
> Hi,
> 
> Thank you for your suggestion.  You may want to take a look at the FUZZY_*
rules in 3.1 which utilize the 
> ReplaceTags plugin to do this kind of thing on a generic level.  There's a
FUZZY_PHARMACY rule already, 
> but I don't know if we tried pharmaceutical.

I'm sorry, I was running off the SA ver.3.04 rules.  I took a look at the new
ReplaceTags plugin and it looks like an outstanding addition.  Unfortunately, I
don't have any time soon when I can test these new capabilities.  I will have to
get upgraded to the new version ASAP and see how things go.

As you mentioned, I don't see any rules for 'Pharmaceutical' so it probably
could be added to the FUZZY_* rules.  I think something like:

body FUZZY_PHARMACEUTICAL
/<inter W2><post 
P2>(?!pharmaceutical)<P><H><A><R><M><A><C><E><U><T><I><C><A><L>/i
describe FUZZY_PHARMACEUTICAL   Attempt to obfuscate words in spam
replace_rules FUZZY_PHARMACEUTICAL

added to the 25_replace.cf file would do it.

I admit, I don't completely understand how the ReplaceTags plugin works, but it
looks like it is still trying to find all the letters of the word in the correct
order.  The rules I suggest actually look for common miss-spellings of the
original words.

Pharmacy -> Pharamacy
Pharmaceutical -> Phrmaceutical, Pharamaceutical

So in conclusion, I will do my best to upgrade to the current version of SA and
evaluate how the new rules catch these words.  If I find that the current
version of SA does not do a good job of finding these, I will re-post to this
report (or start a new report).




------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

Reply via email to