Greetings,
I am wondering if someone can shed some light on this problem.
The following code fragment has been sucessful in extracting links
to nearly every url I've tried in the last few weeks.
# Parse the html, breaking out the links
$tree=HTML::Parse::parse_html($body);
$link_refs=$tree->extract_links(); # pull out ALL links, images etc.
@links=sort(@$link_refs);
$tree=$tree->delete();
Unfortunately this evening while continuing my testing, I ran into a
problem concerning the html found at www.trainyardsoftware.com.
Even though this looks like perfectly good html to me, the code is
failing to detect any links at all.
Could my server be using an out of date html module? And if so,
how can I found out the version number of any of the modules that
are installed?
Thanks
Bob Minnick, Northern Webs
http://www.northernwebs.com