I'm sure that I must be doing something wrong here, but I've looked
through the faq and the list archive to no avail.
I can't get htdig (v3.2.b3) to accept the URLs I specify. Here's what I
have:
start_url: http://www.shaksper.net/~eluhrs/sites.html
limit_urls_to: ${start_url}
I expect htdig to spider each link specified in sites.html, and index
everything ON those site, but not links to OTHER sites. Instead,
htdig -vvv gives me a bunch of errors like this:
href: http://www.motleyltd.com.au/ (http://www.motleyltd.com.au/)
Rejected: URL not in the limits!
url rejected: (level 1)http://www.motleyltd.com.au/
I tried replacing ${start_url} with a list of strings from the link list,
but then I get a bunch of (seemingly indiscriminant) sites indexed along
with the ones I want. I also tried using max_hope_count:, but was also
unsuccessful.
I only want to index pages WITHIN the sites that I specify, and nothing
else. What am I missing here?
Eric
--
Eric Luhrs | [EMAIL PROTECTED] | http://www.shaksper.net/tech.html
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html