Not sure if this is a bug or not. When using wget to download with
"page-requisites", "convert-links", and a fully qualified
"directory-prefix", some links are not converted as relative. For
example, if I execute the following:
./wget -E -H -k -K -p -nv -P /tmp/wget-test http://www.yahoo.com/
Some of the html "SRC" references have fully qualified directory paths
instead of relative paths. For example:
src="/tmp/wget-test/us.i1.yimg.com/us.yimg.com/i/ww/m6v3.gif"
However, if I run the following command:
./wget -E -H -k -K -p -nv -P . http://www.yahoo.com/
I get the following, correct relative reference:
src="../us.i1.yimg.com/us.yimg.com/i/ww/m6v3.gif"
The "wget" man page (convert-links section) suggests the references
should be relative:
"The fact that the former links
are converted to relative links ensures that you can
move the downloaded hierarchy to another directory."
I've tried this with wget 1.8.1 and 1.8.2. I would appreciate any
guidance regarding more proper settings I should be using or if this is
bug, I'd be happy to submit it to the appropriate place. Please cc: me
on replies as I'm not on the mailing list.
Thanks,
--Sunil