On Tuesday, March 19, 2002, at 07:47  PM, Jim Cole wrote:

>> I only want to index pages WITHIN the sites that I specify, and nothing
>> else. What am I missing here?
>
> limit_urls_to should consist of patterns that define the sites to
> which you want to limit the dig. Any URL that does not have a
> substring matching something in limit_urls_to will be rejected as
> "not in the limits!".

To add to Jim's excellent answer, the point is that if you specify a 
single page as a start_url then with the default limit_urls_to, it will 
only index that one page.

I think what you're trying to do is to index a list of URLs, in which 
case, it's much better to just make a text file of the URLs (e.g. one 
per line) and include it in the start_url attribute:

start_url: `/path/to/text/file`


--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/


_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to