Certainly a limit_urls_to value of /foo/bar/ would restrict your
indexing to those parts of your servers. But there are two caveats
here:
First, how common is "/foo/bar" in terms of URLs on other servers? If
a URL from another server matches that path, it will be indexed.
Secondly, unless you only want to index the index.html pages, don't
include that in the pattern--a URL must match one of the
limit_urls_to patterns *exactly*. So if you used the pattern you
mentioned, then:
http://www.foo.com/path/foo/bar/title.html
Would not be indexed--it doesn't match the pattern.
--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/
At 5:08 PM -0400 7/19/01, Downey, Michael {10-6~Indianapolis} wrote:
>Geoff,
>
>Thanks for your help. In light of the logical OR, would it seem that
>
> */foo/bar/index.html
>
>would work as well? My problem is I want to avoid spidering other
>servers that these index.html pages link to. Again, I appreicate
>your assistance. Thanks!
>
>Best Regards,
>Michael Downey
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html