On Thu Oct  4 04:55:53 2001, Ian Abbott wrote:
> On 3 Oct 2001, at 16:01, CJ Kucera wrote:
> 
> > The closest I've come is (and there's lots of extraneous stuff in there):
> > 
> > > wget -r -l inf -k -p --wait=1 -H 
>--domains=theonion.com,graphics.theonion.com,www.theonion.com,theonionavclub.com,www.theonionavclub.com
> http://www.theonion.com
> 
> The domains you specify with --domains also match all the
> subdomains, so if that's what you want, you can simplify the above
> to:
> 
> wget -r -l inf -k -p --wait=1 -H --domains=theonion.com,theonionavclub.com 
>http://www.theonion.com

This actually doesn't do what I want, either.  Every page hosted
at "www.theonion.com" is complete, but every page hosted on any
of the other URLs doesn't continue the recursion.

For instance, on the front page, there's a link to "www.thenionavclub.com".
The index.html from theonionavclub.com is retrieved completely, but
none of the pages IT links to are retrieved.

-CJ

WOW: Rapacious           | A priest advised Voltaire on his death bed to
apocalyptech.com/wow     |  renounce the devil.  Replied Voltaire, "This
[EMAIL PROTECTED]     |              is no time to make new enemies."

Reply via email to