https://issues.apache.org/SpamAssassin/show_bug.cgi?id=6675
Bug #: 6675
Summary: HTML_TITLE_SUBJ_DIFF hits subject "<3"
Product: Spamassassin
Version: 3.4.0
Platform: All
OS/Version: All
Status: NEW
Severity: normal
Priority: P2
Component: Rules
AssignedTo: [email protected]
ReportedBy: [email protected]
Classification: Unclassified
I just got a false positive on an email largely because its entire subject was
"<3" (ascii heart), which hits HTML_TITLE_SUBJ_DIFF, which has a score of 2.2.
Doesn't seem like it should have hit that rule.
In 20_html_tests.cf:
meta HTML_TITLE_SUBJ_DIFF __HTML_TITLE_SUBJ_DIFF && !__MIME_ATTACHMENT
body __HTML_TITLE_SUBJ_DIFF eval:html_title_subject_ratio('3.5')
No description.
html_title_subject_ratio seems to come from the HTMLEval plugin, lacking a man
page ( Mail::SpamAssassin::Plugin::HTMLEval ). Looks like it's converting "<3"
to html, coming up with a result of an empty string by ignoring the fact that
there is no ">", and deciding that because the length ratio of an empty string
to the original 2 character string is bad.
http://ruleqa.spamassassin.org/?daterev=20111008-r1180336-n&rule=HTML_TITLE_SUBJ_DIFF&srcpath=&g=Change
The ham to spam ratio of this rule is terrible. Why is it in the default rule
set, and why does it have such a high score?
--
Configure bugmail:
https://issues.apache.org/SpamAssassin/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.