Hi,
I mirror many websites with wget, and I have one feature suggestion
that will minimize the unwanted mirroring of certain subdirectories.
For example, lets say I want to mirror everything on www.foo.com,
but they have a web-board forum at www.foo.com/forum. Also, their
cgi-bin directory is readable, and contains scripts that generate
indexes to image libraries and other such things. Lets say that
is located at www.foo.com/cgi-bin/. Allowing a wget user to specify
a list of URIs that contain directories that the user does not want
to mirror when performing a full wget -rm of a server via the
command-line or in a file some way, would be excellent. There will
need to be some decisions made as to what format the list of
directories that shouldn't be mirrored should be passed to wget, but
I think this feature would be a great addition to an already great
program.
Thank you,
-jbl