http://issues.apache.org/SpamAssassin/show_bug.cgi?id=4892





------- Additional Comments From [EMAIL PROTECTED]  2006-05-03 11:45 -------
The problem is not the word Oxon by itself. The example contains an address that
has the lines

Abingdon,
Oxon.
OX14 3JF.

which in HTML look like

Abingdon,<BR>Oxon.<BR>OX14 3JF.<BR>

When the HTML tags are removed to process the text in the body, the string

Oxon.
OX14

is a fuzzy match for 'xanax' in the FUZZY_XPILL_BODY rule. The initial 'O' and
the final '14' are ignored, as are newlines and spaces, leaving xon.OX as what
is fuzzily matching with 'xanax'.

Whether that should be a match I leave for someone with more familiarity with
the fuzzy match rules to decide now that the problem has been narrowed down.

I do wonder if <br> should be replaced with a newline and the fuzzy match should
not go across lines, if that is possible with the way we parse out text from 
HTML.




------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

Reply via email to