I am running the latest version, 3.1.2.  I just changed one of the links to an
absolute URL, and the results are the same - htdig sees the HREF in the main
index.html file (the one that links to the page with all the tables), however, it
never "pushes" it into the queue.  It tries to "resolve" it, but never pushes it.

I am running some more tests now....

Gilles Detillieux wrote:

> According to scottb:
> > The main URL is in the form of:
> >
> > http://www.somewhere.com
> >
> > The page that is getting overlooked is:
> >
> > http://www.somewhere.com/MyDir/index.html
> >
> > The page in question (second URL above contains a table with about 70 links in
> > it to other subdirs and pages beneath MyDir):
> >
> > http://www.somewhere.com/MyDir/A/index.html
> > http://www.somewhere.com/MyDir/B/index.html
> > ..
> >
> > The link at the top level index page (first URL above) is a relative URL, not
> > absolute (href=/MyDir/index.html).
> >
> > My "limit_urls_to" keyword is simply set to the "start_url":
> >
> > start_url: http://www.somewhere.com
> > limit_urls_to: ${start_url}
> >
> > Question: will htdig convert the relative URLs to absolute URLs using the FQDN,
> > or do I need to add "MyDir" or something to limit_urls_to?
>
> htdig will convert relative URLs to absolute, so that should work fine,
> as long as the HTML for the link is properly structured.  You don't need
> (and likely don't want) to add MyDir to limit_urls_to.
>
> You haven't mentioned which version of htdig you're running.  If it's
> an older version, it may be missing the link in your main index page
> (particularly if the closing </a> tag is missing).  What does the link
> look like?  It is an HTML link, and not JavaScript, right?
>
> --
> Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
> Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
> Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
> Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the SUBJECT of the message.

Reply via email to