On Feb 19,  5:17pm, Dan Harkless wrote:
> Subject: Re: how to filter only certain URL's?
> 
> [EMAIL PROTECTED] (Gary Funck) writes:
> > Dan, sorry to trouble you
> 
> Please post to [EMAIL PROTECTED] rather that putting the onus on one person to
> answer you (and depriving everyone else of the information).

Okay. no prob.

> 
> > - but that RPM URL that I mentioned,
> > appeared to be version that has pattern matching in it, but now it
> > appears that this version has some sort of shell-like globbing,
> > but doesn't have the regex stuff.  I actually would prefer the
> > regex version for what      I'm trying to do, and there's no docs.
> > on how the globbing works, how much of a pathname I can use it
> > on, etc.  (ie, does it only match the part of the URL after the
> > rightmost slash?)

I experimented with the above mentioned "globbing", but couldn't
figure out how it works (though admittedly I didn't try firing up
the debugger to see what's going on).

One thing that the matching did appear to doing however, is
first *downloading the entire page* before making the decision
as to whether to keep the page or not.  This is decidely not the
preferred implementationn -- it wastes bandwidth.

> > 
> > If you can point me at a copy of the regex version, that'd be
> > great.  thanks, - Gary
> 
> The sum of my knowledge on 1.5.3gold is what I read in the author's post.
> You could try emailing him.  Also, perhaps the patch in the email I
> forwarded successfully applies to the RPM'd version and converts it to
> regexp.

Thanks - I didn't notice the patchset attachment when I first read
your e-mail.  I'll give it a shot.

Reply via email to