According to Ted Rogers:
> here is what's in my htdig.conf and then a part of ./rundig -vvvv -but
> first, here's my problem... below contains one example of a URL .../~G634-xx
> REJECTED... i have a lot of those that i need not to be rejected.
> further you will see, toward the end, these URLs...
> URL: http://slis-two.lis.fsu.edu/websitedev/fall2001/dev.html
> URL: http://slis-two.lis.fsu.edu/websitedev/fall2001/
> URL: http://slis-two.lis.fsu.edu/websitedev/fall2001/pre.html
> URL: http://slis-two.lis.fsu.edu/websitedev/fall2001/maint.html
>
> i don't know if it got those but i need those too.
> HELP!? Please. (this stuff got changed on me at the last minute, i'm a
> beginner and thought i was all set!
>
> Thank you,
> Ted Rogers
>
> htdig.conf excerpt:
>
> local_user_urls:
> http://slis-two.lis.fsu.edu/=/usr2/websitedev/fall2001/,/public_html/
>
> start_url: http://slis-two.lis.fsu.edu/websitedev/fall2001/
>
> # This attribute limits the scope of the indexing process. The default is
> to
> # set it to the same as the start_url above. This way only pages that are
> on
> # the sites specified in the start_url attribute will be indexed and it will
> # reject any URLs that go outside of those sites.
> #
> # Keep in mind that the value for this attribute is just a list of string
> # patterns. As long as URLs contain at least one of the patterns it will be
> # seen as part of the scope of the index.
> #
> limit_urls_to: ${start_url}
It seems to me you need to pay more attention to the comments in the
configuration file! If your start_url is set to
http://slis-two.lis.fsu.edu/websitedev/fall2001/
and limit_urls_to is set to the value of start_url, then of course htdig
will limit the URLs it accepts to those under websitedev/fall2001/ on
your site, rejecting URLs like http://slis-two.lis.fsu.edu/~G...
> part of my ./rundig -vvvv output:
>
> Rejected: URL not in the limits!
> url rejected: (level 1)http://slis-two.lis.fsu.edu/~G634-19/
If you had looked up that message in FAQ 5.27, you'd see that the cause
of the rejection is limit_urls_to's value.
So, can I suggest you calm down and _carefully_ read the docs and FAQ?
I'm sure you'd save a lot more time that way than by waiting for replies
from the list. If you set
limit_urls_to: ${start_url} http://slis-two.lis.fsu.edu/~
you'll probably be able to index what you want.
See http://www.htdig.org/attrs.html#start_url,
http://www.htdig.org/attrs.html#limit_urls_to and
http://www.htdig.org/FAQ.html#q5.27
--
Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba Phone: (204)789-3766
Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html