Thank you so much, Gilles, for helping me out. When I changed the
limit_urls_to to include our web domain name, http://www.jhuccp.org/ it
worked like a charm. I guess I just kept thinking that "localhost" was
an alias for "www.jhuccp.org" which it is in a certain sense, but not
here.

Thank you again for all your help.

-Kevin Zembower

>>> Gilles Detillieux <[EMAIL PROTECTED]> 01/29/02 11:25AM >>>
According to KEVIN ZEMBOWER:
> I'm having some problems indexing some files on my site. If I run
htdig
> with -v -v -v, I get lines like these in the output:
> +A tag: pos = 2, position =
> ="http://www.jhuccp.org/centerpubs/impact/number13/pdf/Impact13.p 
> image: http://localhost/centerpubs/impact/pdf-icon.gif 
> href:
http://www.jhuccp.org/centerpubs/impact/number13/pdf/Impact13.pdf 
> (PDF format )
> 
>    Rejected: URL not in the limits!
> url rejected: (level
> 1)http://www.jhuccp.org/centerpubs/impact/number13/pdf/Impact13.pdf
> 
> I can't figure out why this file would be rejected.

See http://www.htdig.org/FAQ.html#q5.27 for explanations of all these
rejection messages.

> My htdig.conf file
> has these pertinent lines:
> start_url:              http://localhost/ 
> limit_urls_to:          ${start_url}

OK, the limit_urls_to list takes on the value of start_url, so any URL
that doesn't include the string http://localhost/ will be rejected. 
The
URL for the PDF above, which was rejected, wasn't a "localhost" URL.

-- 
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:   
http://www.scrc.umanitoba.ca/~grdetil 
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to