-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA256

Alan Thomas wrote:
>       I admittedly do not know much about web server responses, and I
> have a question about why wget did not retrieve a document. . . .
>  
>    I executed the following wget command:
>  
> wget --recursive --level=20 --append-output=wget_log.txt
> --accept=pdf,doc,ppt,xls,zip,tar,gz,mov,avi,mpeg,mpg,wmv --no-parent
> --no-directories --directory-prefix="TEST_AnyLogic_Docs"
> "http://www.xjtek.com";
>  
>         However, it did not get the PDF document found by clicking on
> this link: http://www.xjtek.com/anylogic/license_agreement.  This URL
> automatically results in a download of a PDF file.
>  
>         Why?  Is there a wget option that will include this file? 

I believe it's being rejected because it doesn't end in a suffix that's
in your --accept list; it's a PDF file, but its URL doesn't end in .pdf.
It does use Content-Disposition to specify a filename, but the release
version of Wget doesn't acknowledge those.

If you use the current development version of Wget, and specify "-e
content_disposition=on", it will download. If you're willing to try
that, you'll need to look at
http://wget.addictivecode.org/RepositoryAccess for information on how to
get the current development version of Wget (you should use the 1.11
repository, not mainline), and special building requirements.

- --
HTHm
Micah J. Cowan
Programmer, musician, typesetting enthusiast, gamer...
http://micah.cowan.name/

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.6 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iD8DBQFHMRq97M8hyUobTrERCJ6WAJwK6uv/HlrLmTA7zK5DLZCnswkofQCfbMvJ
6yAiHoWEsXLohuYmQTGlPDo=
=DWHZ
-----END PGP SIGNATURE-----

Reply via email to