I'm sure that I must be doing something wrong here, but I've looked
through the faq and the list archive to no avail.

I can't get htdig (v3.2.b3) to accept the URLs I specify. Here's what I
have:

        start_url: http://www.shaksper.net/~eluhrs/sites.html
        limit_urls_to: ${start_url}

I expect htdig to spider each link specified in sites.html, and index
everything ON those site, but not links to OTHER sites.  Instead,
htdig -vvv gives me a bunch of errors like this:

        href: http://www.motleyltd.com.au/ (http://www.motleyltd.com.au/)
        Rejected: URL not in the limits!
        url rejected: (level 1)http://www.motleyltd.com.au/

I tried replacing ${start_url} with a list of strings from the link list,
but then I get a bunch of (seemingly indiscriminant) sites indexed along
with the ones I want.  I also tried using max_hope_count:, but was also
unsuccessful.

I only want to index pages WITHIN the sites that I specify, and nothing
else. What am I missing here?

  Eric

--

Eric Luhrs | [EMAIL PROTECTED] | http://www.shaksper.net/tech.html



_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to