https://bz.apache.org/SpamAssassin/show_bug.cgi?id=7193

            Bug ID: 7193
           Summary: Misfiring of SUBJECT_DRUG_GAP_C
           Product: Spamassassin
           Version: 3.4 SVN branch
          Hardware: PC
                OS: Windows 7
            Status: NEW
          Severity: normal
          Priority: P2
         Component: Rules
          Assignee: [email protected]
          Reporter: [email protected]

Jari Fredriksson reports that Capitalist is hitting SUBJECT_DRUG_GAP_C 

>> The Subject is in this case:
>>
>> Subject:  DealBook: European Antitrust Investigation to Affect U.S.
>> Tech Firms | Fears About Bond Market Volatility | Netflix Objects
>> to AT&T-DirecTV Merger | Value of Celebrity Venture Capitalists
>
> header SUBJECT_DRUG_GAP_C   Subject =~
> /\bc.{0,2}i.{0,2}a.{0,2}l.{0,2}i.{0,2}s\b/i describe
> SUBJECT_DRUG_GAP_C Subject contains a gappy version of 'cialis'
>
> "Capitalists" looks like the word "cialis" with extra letters mixed
> in. Maybe this test should look for non-alphabetic characters between
> the letters?
>


Changed to /\bc[\sc]{0,2}i[\si]{0,2}a[\sa]{0,2}l[\sl]{0,2}i[\si]{0,2}s{1,3}\b/i

This hits on a real test cases below but doesn't hit on capitalist:

cciiaalliisss Tyenes TYENES 
Great - C II A L I S - Woldrwide shipping - Interneet storre 
PLANKS C I A L I S 

Regards,
KAM

-- 
You are receiving this mail because:
You are the assignee for the bug.

Reply via email to