On Jun 25, 2014, at 3:00 PM, Axb <[email protected]> wrote:

> On 06/25/2014 10:37 PM, Philip Prindeville wrote:
>> 
>> On Jun 25, 2014, at 3:09 AM, Axb <[email protected]> wrote:
>> 
>>> On 06/25/2014 03:07 AM, Philip Prindeville wrote:
>>> 
>>>> Anyone have rules to catch these they could point me at?  Or any empirical 
>>>> evidence about how successful they’ve been with such?
>>> 
>>> Wouldn't use this for a rule unless you meta it with lots of other traits
>>> 
>>> the rawbody /href\=\"#\"/ plus other traits could be combined.
>>> 
>>> Can you pastebin a sample ?
>>> 
>> 
>> 
>> Sure:
>> 
>> http://pastebin.com/4QFUZ6vd
> 
> 
> the href template bork + the Base8 hashes are giveaways.
> meta those rawbody traits together and you're rocking (for a while)
> 

Sorry, which base8 hashes?

Also, I’m noticing the tracking info following the href…

F1B9215E-B1D0-40BC-92D1-F13D501596B7;F1B9215E-B1D0-40BC-92D1-F13D501596B7;F1B9215E-B1D0-40BC-92D1-F13D501596B7;F1B9215E-B1D0-40BC-92D1-F13D501596B7;F1B9215E-B1D0-40BC-92D1-F13D501596B7;F1B9215E-B1D0-40BC-92D1-F13D501596B7

Including 6 distinct UUID’s would seem to be useful.  Including the same UUID 6 
times seems broken.

Perhaps a pattern like:

body /((;[A-F0-9]{8}-[A-F0-9]{4}-[A-F0-9]{4}-[A-F0-9]{4}-[A-F0-9]{12})){4,}/

would be… no, wait… we’d need to save the first one, and then check for 3 or 
more recurrences of the exact same literal string.

rawbody L_REPEATING_UUIDS       /<a href="\#" 
.*(;[A-F0-9]{8}-[A-F0-9]{4}-[A-F0-9]{4}-[A-F0-9]{4}-[A-F0-9]{12}){4,}>/i
describe L_REPEATING_UUIDS      Seeing the same tracking info repeated
score L_REPEATING_UUIDS         0.1

Reply via email to