https://bz.apache.org/SpamAssassin/show_bug.cgi?id=7193
Bug ID: 7193
Summary: Misfiring of SUBJECT_DRUG_GAP_C
Product: Spamassassin
Version: 3.4 SVN branch
Hardware: PC
OS: Windows 7
Status: NEW
Severity: normal
Priority: P2
Component: Rules
Assignee: [email protected]
Reporter: [email protected]
Jari Fredriksson reports that Capitalist is hitting SUBJECT_DRUG_GAP_C
>> The Subject is in this case:
>>
>> Subject: DealBook: European Antitrust Investigation to Affect U.S.
>> Tech Firms | Fears About Bond Market Volatility | Netflix Objects
>> to AT&T-DirecTV Merger | Value of Celebrity Venture Capitalists
>
> header SUBJECT_DRUG_GAP_C Subject =~
> /\bc.{0,2}i.{0,2}a.{0,2}l.{0,2}i.{0,2}s\b/i describe
> SUBJECT_DRUG_GAP_C Subject contains a gappy version of 'cialis'
>
> "Capitalists" looks like the word "cialis" with extra letters mixed
> in. Maybe this test should look for non-alphabetic characters between
> the letters?
>
Changed to /\bc[\sc]{0,2}i[\si]{0,2}a[\sa]{0,2}l[\sl]{0,2}i[\si]{0,2}s{1,3}\b/i
This hits on a real test cases below but doesn't hit on capitalist:
cciiaalliisss Tyenes TYENES
Great - C II A L I S - Woldrwide shipping - Interneet storre
PLANKS C I A L I S
Regards,
KAM
--
You are receiving this mail because:
You are the assignee for the bug.