On Thu Oct 4 04:55:53 2001, Ian Abbott wrote: > On 3 Oct 2001, at 16:01, CJ Kucera wrote: > > > The closest I've come is (and there's lots of extraneous stuff in there): > > > > > wget -r -l inf -k -p --wait=1 -H >--domains=theonion.com,graphics.theonion.com,www.theonion.com,theonionavclub.com,www.theonionavclub.com > http://www.theonion.com > > The domains you specify with --domains also match all the > subdomains, so if that's what you want, you can simplify the above > to: > > wget -r -l inf -k -p --wait=1 -H --domains=theonion.com,theonionavclub.com >http://www.theonion.com
This actually doesn't do what I want, either. Every page hosted at "www.theonion.com" is complete, but every page hosted on any of the other URLs doesn't continue the recursion. For instance, on the front page, there's a link to "www.thenionavclub.com". The index.html from theonionavclub.com is retrieved completely, but none of the pages IT links to are retrieved. -CJ WOW: Rapacious | A priest advised Voltaire on his death bed to apocalyptech.com/wow | renounce the devil. Replied Voltaire, "This [EMAIL PROTECTED] | is no time to make new enemies."
