Hello!

I'm using wget for windows version 1.10.2.

I'm trying to download the contents of my photography site. For doing that I
created the following command:

wget --wait
2 --random-wait -r -l7 -H -p --convert-links --html-extension -Dpbase.com --
exclude-domains forum.pbase.com,search.pbase.com --no-parent -e robots=off
http://www.pbase.com/piotrstankiewicz

(I had to use -H option as the photos are placed at other servers that
www.pbase.com)

Unfortunately wget seems to ignore --no-parent option as it starts to
download also www.pbase.com/index.html
www.pbase.com/help.hmtl
documents and others placed in the main directory. I have impression it's
some kind of bug, although I'm not definitely wget expert. Could you try to
verify it please?

Additionnaly I tried to use the option -R to exclude those files. In such a
case wget downloads those files and deletes it after but it follows the
links from those files (which is unwated by me). I found the information
that it's by design. But what about introducing any other option precising
if the links from the unwated documents (specified with -R) should be
followed or no (in some cases it's not welcome).

With best regards

Piotr Stankiewicz


---------------------------------------------------

http://www.pbase.com/piotrstankiewicz

Reply via email to