Hello,

I'm trying to use wget to do something that seems very simple, but I
haven't been able to find a solution anywhere and I'm hoping someone here
could point me in the right direction.

I want to mirror part of a website that contains two links pages, each of
which contains links to many root-level directories and also to the other
links page.  I want to download recursively all the links from one links
page, but not from the other: that is, I want to tell wget "download
links1 and follow all of its links, but do not download or follow links
from links2".

I've put a demo of this problem up at http://fangjaw.com/wgettest -- there
is a diagram there that might state the problem more clearly.

This functionality seems so basic that I assume I must be overlooking
something.  Clearly wget has been designed to give users control over
which files they download; but all I can find is that -X controls both
saving and link-following at the directory level, while -R controls saving
at the file level but still follows links from unsaved files.

Is there an obvious solution I'm missing?  Or a manual section I don't
have or something?

Thanks in advance,

Fang

(PS: wget I'm  using is 1.12.)


Reply via email to