-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 > On Sun, Jun 29, 2008 at 1:42 PM, Mishari Almishari <[EMAIL PROTECTED]> wrote: >> Hi, >> I want to download the website "www.2006election.net" >> >> For that, I used the command >> "wget -d -nd -p -E -H -k -K -S -R png,gif,jpg,bmp,ico --ignore-length >> --user-agent=Mozilla -e robots=off -P www.2006election.net -o >> www.2006election.net.out http://www.2006election.net" >> >> But the downloaded page "index.html" has no content (except body/head tags), >> eventhough i can see the content when i used internet exprolorer.
mm w wrote: > the default index is not named index, or there is a HTTP test > server/side regarding HTTP_USER_AGENT The first one could not possibly cause problems, since he's not requesting any URLs with "index.html" in them. The HTTP_USER_AGENT thing is the problem. Mishari tried to specifically handle this with the --user-agent line, but it apparently wasn't convincing enough. I got it to work with: --user-agent='Mozilla/4.0 (compatible; MSIE 7.0; Windows NT 5.1; .NET CLR 1.1.4322)' - -- Micah J. Cowan Programmer, musician, typesetting enthusiast, gamer, and GNU Wget Project Maintainer. http://micah.cowan.name/ -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.6 (GNU/Linux) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org iD8DBQFIaAOl7M8hyUobTrERAhldAJ9Ivi2zEQ5MZQ1fIdResHqPDhtnuACgj1Y+ kNGIgq2MS8tPXxkXoKpNVPw= =IhL+ -----END PGP SIGNATURE-----
