Well... Shall I file a bug report for this issue? This seems to be core functionality for a program like wget, and frankly I'm a little surprised that such a fundamental bug would exist in such a mature utility.
So if I'm missing something, I'm happy to be corrected. But otherwise I guess I'll write it up over at http://savannah.gnu.org/bugs/?group=wget...? David > Hello, > > I'm trying to use wget to do something that seems very simple, but I > haven't been able to find a solution anywhere and I'm hoping someone here > could point me in the right direction. > > I want to mirror part of a website that contains two links pages, each of > which contains links to many root-level directories and also to the other > links page. I want to download recursively all the links from one links > page, but not from the other: that is, I want to tell wget "download > links1 and follow all of its links, but do not download or follow links > from links2". > > I've put a demo of this problem up at http://fangjaw.com/wgettest -- there > is a diagram there that might state the problem more clearly. > > This functionality seems so basic that I assume I must be overlooking > something. Clearly wget has been designed to give users control over > which files they download; but all I can find is that -X controls both > saving and link-following at the directory level, while -R controls saving > at the file level but still follows links from unsaved files. > > Is there an obvious solution I'm missing? Or a manual section I don't > have or something? > > Thanks in advance, > > Fang > > (PS: wget I'm using is 1.12.) > > >
