On Thu, 21 Sep 2006, mouss wrote:
> Theo Van Dinter wrote:
> > On Tue, Sep 19, 2006 at 10:58:46PM +0200, mouss wrote:
> >
> >> URI_NOVOWEL fires with things like href="#id" where id is a string that
> >> starts with 7 "no-vowel" chars.
> >>
> >> uri URI_NOVOWEL m%^https?://[^/?]*[bcdfghjklmnpqrstvwxz]{7}%i
> >> uri URI_NOVOWEL m%^https?://[^/?\#]*[bcdfghjklmnpqrstvwxz]{7}%i
> >>
> >> is this correct?
> >
> > That depends on your definition of "correct". The RE looks ok, but the
> > hitrate could change dramatically. It's hard to say without testing.
>
> my understanding is that the rule looks for "dummy" hostnames in the
> server part. unfortunately, the way URIs are "exposed" by SA, this rule
> also applies to any thing that resembles a URI. This is a problem with
> relative URIs (aka href="foo.html" if foo matches the rule).
Erm. How can it match relative and "#gibberish" URIs at all if the RE
is explicitly anchored to "https?://" at the start of the URI?
--
John Hardin KA7OHZ ICQ#15735746 http://www.impsec.org/~jhardin/
[EMAIL PROTECTED] FALaholic #11174 pgpk -a [EMAIL PROTECTED]
key: 0xB8732E79 - 2D8C 34F4 6411 F507 136C AF76 D822 E6E6 B873 2E79
-----------------------------------------------------------------------
If someone has a gun and is trying to kill you, it would be
reasonable to shoot back with your own gun.
-- the Dalai Lama, May 15, 2001
-----------------------------------------------------------------------