-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

> On Sun, Jun 29, 2008 at 1:42 PM, Mishari Almishari <[EMAIL PROTECTED]> wrote:
>> Hi,
>> I want to download the website "www.2006election.net"
>>
>> For that, I used the command
>> "wget -d -nd -p -E -H -k -K -S -R png,gif,jpg,bmp,ico  --ignore-length
>> --user-agent=Mozilla -e robots=off -P www.2006election.net -o
>> www.2006election.net.out  http://www.2006election.net";
>>
>> But the downloaded page "index.html" has no content (except body/head tags),
>> eventhough i can see the content when i used internet exprolorer.

mm w wrote:
> the default index is not named index, or there is a HTTP test
> server/side regarding HTTP_USER_AGENT

The first one could not possibly cause problems, since he's not
requesting any URLs with "index.html" in them.

The HTTP_USER_AGENT thing is the problem. Mishari tried to specifically
handle this with the --user-agent line, but it apparently wasn't
convincing enough. I got it to work with:

  --user-agent='Mozilla/4.0 (compatible; MSIE 7.0; Windows NT 5.1; .NET
CLR 1.1.4322)'

- --
Micah J. Cowan
Programmer, musician, typesetting enthusiast, gamer,
and GNU Wget Project Maintainer.
http://micah.cowan.name/
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.6 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iD8DBQFIaAOl7M8hyUobTrERAhldAJ9Ivi2zEQ5MZQ1fIdResHqPDhtnuACgj1Y+
kNGIgq2MS8tPXxkXoKpNVPw=
=IhL+
-----END PGP SIGNATURE-----

Reply via email to