On 5/8/2015 6:42 AM, Jari Fredriksson wrote:
On 5/6/2015 10:11 AM, Jari Fredriksson wrote:
>> The Subject is in this case:
>>
>> Subject: DealBook: European Antitrust Investigation to Affect U.S.
>> Tech Firms | Fears About Bond Market Volatility | Netflix Objects
>> to AT&T-DirecTV Merger | Value of Celebrity Venture Capitalists
>
> header SUBJECT_DRUG_GAP_C Subject =~
> /\bc.{0,2}i.{0,2}a.{0,2}l.{0,2}i.{0,2}s\b/i describe
> SUBJECT_DRUG_GAP_C Subject contains a gappy version of 'cialis'
>
> "Capitalists" looks like the word "cialis" with extra letters mixed
> in. Maybe this test should look for non-alphabetic characters between
> the letters?
>
Good call on capitalists.
/\bc.{0,2}i.{0,2}a.{0,2}l.{0,2}i.{0,2}s\b/i
Is this what you are thinking?
/\bc[^a-z0-9]{0,2}i[^a-z0-9]{0,2}a[^a-z0-9]{0,2}l[^a-z0-9]{0,2}i[^a-z0-9]{0,2}s\b/i