Question concerning parse_html

bobmin Thu, 21 Sep 2000 01:47:14 -0700

Greetings, 
 I am wondering if someone can shed some light on this problem. 

 The following code fragment has been sucessful in extracting links 
to nearly every url I've tried in the last few weeks.

# Parse the html, breaking out the links

$tree=HTML::Parse::parse_html($body);
$link_refs=$tree->extract_links(); # pull out ALL links, images etc.
@links=sort(@$link_refs);

$tree=$tree->delete();

 Unfortunately this evening while continuing my testing, I ran into a 
problem concerning the html found at www.trainyardsoftware.com. 
Even though this looks like perfectly good html to me, the code is 
failing to detect any links at all.

 Could my server be using an out of date html module? And if so, 
how can I found out the version number of any of the modules that 
are installed?

        Thanks
        Bob Minnick, Northern Webs
      http://www.northernwebs.com

Question concerning parse_html

Reply via email to