Esteban Soto's bits of Sat, 13 Jul 2002 translated to: >i'm trying to index a site that has pages genereted by php, those pages >contain >links to other html docs but htdig seems to be ignoring those links. > >in www.lansys.cl for instance: it doesn't follow the link to > >010402 PDFs vs HTMLs para publicaci�n WEB > >in fact it doesn't follow any link that is generated by php. > >how can i make it index those links?
The problem is the JavaScript. The less-than signs (<) in the if statements are interpreted as starting new tags. This throws things off, resulting in a failure to match the closing script tag with the opening tag. Since htdig processes the rest of the page as if it were still inside the script tag, remaining URLs on the page are not followed. Your JavaScript should really be enclosed in HTML comments for the sake of browsers that are either by design or configuration not able to process JavaScript. This would also allow htdig to correctly process the page. Jim ------------------------------------------------------- This sf.net email is sponsored by:ThinkGeek Welcome to geek heaven. http://thinkgeek.com/sf _______________________________________________ htdig-general mailing list <[EMAIL PROTECTED]> To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe FAQ: http://htdig.sourceforge.net/FAQ.html

