I am trying to resolve a similar problem, where I want to extract just the <a href=""> alone from the response content. Not intersted in parsing and getting other tag URI information.

-Mani

----- Original Message ----- From: "Tim Bray" <[EMAIL PROTECTED]>
To: <modules-dev@httpd.apache.org>
Sent: Friday, August 17, 2007 10:17 PM
Subject: Re: Extract Url from response page


In general it's a hard problem. How much do you know of the possible format of the response page? Does it use AJAX magic to pull in pieces XmlHTTPRequest? Do you only care about URIs in <a href=""> or are you interested in <link> or even URIs embedded in running text? -Tim

On Aug 17, 2007, at 7:00 AM, prasanna wrote:

I am looking for the things from APR or core apache libraries. If we don't have one from APR then we will go for libxml


Ray Morris wrote:
   See libxml.
--
Ray B. Morris
[EMAIL PROTECTED]

Strongbox - The next generation in site security:
http://www.bettercgi.com/strongbox/



On 08/17/2007 04:03:24 AM, prasana wrote:
Hi,

Is there any API functions to extract url from an response page. I
need to log all the url in the response page.

Thanks in Advance.








Reply via email to