I´m trying to mirror a site using wget v1.11.4.
The parameters I´m passing it are simple:
wget -m -np -k -c (url)
I have tried adding --user-agent="Firefox/linux real user agent here"
along with --referer="[parent dir here]" but it doesn´t make any
difference. It outputs "Error 404 not found", even while I can see the
site just fine (and download files) using a web browser....
I´ve noticed that whatever web server it´s using, it outputs links
without paths, ie
<a
href="%D0%A0%D0%B0%D0%B1%D0%BE%D1%82%D0%B0%20%D1%81%20USB%20Flash%20%D0%BA%D0%B0%D0%BA%20%D1%81%20%D0%B6%D1%91%D1%81%D1%82%D0%BA%D0%B8%D0%BC%20%D0%B4%D0%B8%D1%81%D0%BA%D0%BE%D0%BC.rtf
<view-source:http://exit.ktnet.kg/Distr2/_Release/Flash%20controller/%D0%A0%D0%B0%D0%B1%D0%BE%D1%82%D0%B0%20%D1%81%20USB%20Flash%20%D0%BA%D0%B0%D0%BA%20%D1%81%20%D0%B6%D1%91%D1%81%D1%82%D0%BA%D0%B8%D0%BC%20%D0%B4%D0%B8%D1%81%D0%BA%D0%BE%D0%BC.rtf>">
FWIW, the full url is
wget -m -np -k -c --user-agent="Mozilla/5.0 (X11; Linux i686; rv:2.0.1)
Gecko/20100101 Firefox/4.0.1" --no-http-keep-alive
--referer="http://exit.ktnet.kg/Distr2/_Release/"
--keep-session-cookies
"http://exit.ktnet.kg/Distr2/_Release/Flash%20controller/"
Thoughts?
FC