Matthew Woehlke wrote:
> Micah Cowan wrote:
>> Brian Keck wrote:
>>> Sometimes -p doesn't work.  For instance:
>>>
>>>     wget -p http://en.wikipedia.org/wiki/Herbig-Haro_object
>>
>> In this case, it appears that you've bumped into the fact that wget, by
>> default, will refuse to cross hostname boundaries to download things,
>> unless you tell it otherwise. You want the -H option.
> 
> Hmm, an interesting observation from that... am I missing something, or
> is there not currently an easy way to tell wget to span hosts in the
> same domain, but not span domains? For example, spanning to
> upload.wikipedia.org makes sense when grabbing from en.wikipedia.org,
> but spanning to casa.colorado.edu, www.daviddarling.info or
> sparky.rice.edu (to steal the external references from the mentioned
> article) probably isn't desired.
> 
> Might be a useful wish for some point in the unspecified future.

-D wikipedia.org will do.

Note that we can't do this automatically (as: what's the domain?); even
the assumption that a domain is whatever name is at the second level
(such as right before "com") is not always correct: for instance, many
domains in the ".name" TLD were sold at the third level only.
micah.cowan.name was sold separately from sara.cowan.name, and while
those two both happen to belong to me, there are other foo.cowan.name's
that belong to others, so traversing to those hosts wouldn't be appropriate.

-- 
Micah J. Cowan
Programmer, musician, typesetting enthusiast, gamer...
http://micah.cowan.name/

Attachment: signature.asc
Description: OpenPGP digital signature

Reply via email to