Esteban Soto's bits of Sat, 13 Jul 2002 translated to:

>i'm trying to index a site that has pages genereted by php, those pages
>contain
>links to other html docs but htdig seems to be ignoring those links.
>
>in www.lansys.cl for instance: it doesn't follow the link to
>
>010402 PDFs vs HTMLs para publicaci�n WEB
>
>in fact it doesn't follow any link that is generated by php.
>
>how can i make it index those links?

The problem is the JavaScript. The less-than signs (<) in the if
statements are interpreted as starting new tags. This throws
things off, resulting in a failure to match the closing script
tag with the opening tag. Since htdig processes the rest of the
page as if it were still inside the script tag, remaining URLs on
the page are not followed.

Your JavaScript should really be enclosed in HTML comments for
the sake of browsers that are either by design or configuration
not able to process JavaScript. This would also allow htdig to
correctly process the page.

Jim



-------------------------------------------------------
This sf.net email is sponsored by:ThinkGeek
Welcome to geek heaven.
http://thinkgeek.com/sf
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to