Hi wgeteers! I am using wget to parse a local html file which has numerous links into the www. Now, I only want hosts that include certain strings like -H -Daudi,vw,online.de Two things I don't like in the way wget 1.8.1 works on windows:
The first page of even the rejected hosts gets saved. This messes up my directory structure as I force directories (which is my default and normally useful) I am aware that wget has switched to breadth first (as opposed to depth-first) retrieval. Now, with downloading from many (20+) different servers, this is a bit frustrating, as I will probably have the first completely downloaded site in a few days... Is there any other way to work around this besides installing wget 1.6 (or even 1.5?) Thanks Jens
