On Mon, 2009-10-05 at 11:21 -0700, John Hardin wrote: > On Mon, 5 Oct 2009, Warren Togami wrote: > > > Did the old rule decode %2E%63%6E as .cn though? > > The URI parser does that for you: > > [11433] dbg: rules: ran uri rule ALL_URI ======> got hit: > "http://fnord:b...@321%2e%63%6e" > [11433] dbg: rules: ran uri rule ALL_URI ======> got hit: "http://321.cn" > [11433] dbg: rules: ran uri rule ALL_URI ======> got hit: > "http://fnord:b...@321.cn"
Didn't I say that? ;) The list of URIs does contain cleaned and decoded versions, in addition to the raw URI. -- char *t="\10pse\0r\0dtu...@ghno\x4e\xc8\x79\xf4\xab\x51\x8a\x10\xf4\xf4\xc4"; main(){ char h,m=h=*t++,*x=t+2*h,c,i,l=*x,s=0; for (i=0;i<l;i++){ i%8? c<<=1: (c=*++x); c&128 && (s+=h); if (!(h>>=1)||!t[s+h]){ putchar(t[s]);h=m;s=0; }}}