https://issues.apache.org/SpamAssassin/show_bug.cgi?id=6317
Adam Katz <[email protected]> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |[email protected] --- Comment #1 from Adam Katz <[email protected]> 2010-02-01 10:46:00 UTC --- This stems from a list conversation archived at http://old.nabble.com/forum/ViewPost.jtp?post=27384882&framed=y and my tests were also mentioned in another thread from last week at http://old.nabble.com/forum/ViewPost.jtp?post=27328212&framed=y I'm not sure I agree with the full concept though, and I think my participatory remarks may have been misread. Bayesian rules already examine From and Subject fields in addition to the body, and they rightly mark the collected words with the field name (e.g. "from:adam" is a word plucked by Bayes when it sees "Adam Katz" in the From header, with the colon being a forbidden character in standard word parsing. This is not necessarily the exact mechanism SA uses to delimit, but it is close.) The topic that spurred this request was related to spamvertised websites that appear in the From header rather than the body and thus are immune to SA's uri detection. Martin has abstracted this idea to all body tests, which may not be as wise. Furthermore, URI detection for the From header may be a frivolous exercise, as my tests at http://ruleqa.spamassassin.org/?rule=/FROM_W&srcpath=khop seem to indicate that *any* URI in this location is itself a strong an indicator of spam. Further parsing is therefore unnecessary. Publishing this rule with SA before legit mail starts clutching this concept might deter its adoption. -- Configure bugmail: https://issues.apache.org/SpamAssassin/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are the assignee for the bug.
