On Feb 19, 5:17pm, Dan Harkless wrote:
> Subject: Re: how to filter only certain URL's?
>
> [EMAIL PROTECTED] (Gary Funck) writes:
> > Dan, sorry to trouble you
>
> Please post to [EMAIL PROTECTED] rather that putting the onus on one person to
> answer you (and depriving everyone else of the information).
Okay. no prob.
>
> > - but that RPM URL that I mentioned,
> > appeared to be version that has pattern matching in it, but now it
> > appears that this version has some sort of shell-like globbing,
> > but doesn't have the regex stuff. I actually would prefer the
> > regex version for what I'm trying to do, and there's no docs.
> > on how the globbing works, how much of a pathname I can use it
> > on, etc. (ie, does it only match the part of the URL after the
> > rightmost slash?)
I experimented with the above mentioned "globbing", but couldn't
figure out how it works (though admittedly I didn't try firing up
the debugger to see what's going on).
One thing that the matching did appear to doing however, is
first *downloading the entire page* before making the decision
as to whether to keep the page or not. This is decidely not the
preferred implementationn -- it wastes bandwidth.
> >
> > If you can point me at a copy of the regex version, that'd be
> > great. thanks, - Gary
>
> The sum of my knowledge on 1.5.3gold is what I read in the author's post.
> You could try emailing him. Also, perhaps the patch in the email I
> forwarded successfully applies to the RPM'd version and converts it to
> regexp.
Thanks - I didn't notice the patchset attachment when I first read
your e-mail. I'll give it a shot.