According to Thomas Guettler:
> On Sun, Jan 06, 2002 at 03:29:22PM -0600, Geoff Hutchison wrote:
> > At 11:56 AM +0100 1/6/02, Thomas Guettler wrote:
> > >I got the following error:
> > >...
> > >It took me some time to find this. Can someone add this to the FAQ, or
> > >patch it?
> > 
> > What about the FAQ: <http://www.htdig.org/FAQ.html#q5.28>. Is this unclear?
> 
> Yes, it is unclear it. It doesn't tell me to put a slash at the end of
> http://foo/~foo.

Of course it doesn't tell you to put the slash at the end of your URLs
because that's not the root cause of the error you were getting, but just
a side issue.  There are many reasons why URLs might get rejected, and FAQ
5.28 refers to 3 other FAQs that help figure out why.  In the specifics of
your case, question 5.27 would tell you about the "URL not in the limit!"
error.

> If you don't do this you get redirected from ~foo to
> ~foo/ which is not in the limits:
...
> redirect: http://sonne.heaven/~guettli/
> 
>    Rejected: URL not in the limits!pick: localhost, # servers = 1

htdig does handle redirects properly, and redirects are standard practice
for directory URLs that are missing the required trailing slash.  The
problem is your start_url is set to "http://localhost/~guettli";, so
presumably the limit_urls_to value will default to the same setting,
but the redirect uses a different host name, so it doesn't match your
limit_urls_to setting.

Adding the trailing slash to your start_url neatly avoids the problem, but
it will happen again if htdig finds a link elsewhere in your documents
that triggers another redirect.  The proper solution is to make sure the
correct host name is used in start_url or added to limit_urls_to.  See
http://www.htdig.org/attrs.html for information on all config attributes.

-- 
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to