Hello, I'm trying to use wget to do something that seems very simple, but I haven't been able to find a solution anywhere and I'm hoping someone here could point me in the right direction.
I want to mirror part of a website that contains two links pages, each of which contains links to many root-level directories and also to the other links page. I want to download recursively all the links from one links page, but not from the other: that is, I want to tell wget "download links1 and follow all of its links, but do not download or follow links from links2". I've put a demo of this problem up at http://fangjaw.com/wgettest -- there is a diagram there that might state the problem more clearly. This functionality seems so basic that I assume I must be overlooking something. Clearly wget has been designed to give users control over which files they download; but all I can find is that -X controls both saving and link-following at the directory level, while -R controls saving at the file level but still follows links from unsaved files. Is there an obvious solution I'm missing? Or a manual section I don't have or something? Thanks in advance, Fang (PS: wget I'm using is 1.12.)
