-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA256 Alan Thomas wrote: > I admittedly do not know much about web server responses, and I > have a question about why wget did not retrieve a document. . . . > > I executed the following wget command: > > wget --recursive --level=20 --append-output=wget_log.txt > --accept=pdf,doc,ppt,xls,zip,tar,gz,mov,avi,mpeg,mpg,wmv --no-parent > --no-directories --directory-prefix="TEST_AnyLogic_Docs" > "http://www.xjtek.com" > > However, it did not get the PDF document found by clicking on > this link: http://www.xjtek.com/anylogic/license_agreement. This URL > automatically results in a download of a PDF file. > > Why? Is there a wget option that will include this file?
I believe it's being rejected because it doesn't end in a suffix that's in your --accept list; it's a PDF file, but its URL doesn't end in .pdf. It does use Content-Disposition to specify a filename, but the release version of Wget doesn't acknowledge those. If you use the current development version of Wget, and specify "-e content_disposition=on", it will download. If you're willing to try that, you'll need to look at http://wget.addictivecode.org/RepositoryAccess for information on how to get the current development version of Wget (you should use the 1.11 repository, not mainline), and special building requirements. - -- HTHm Micah J. Cowan Programmer, musician, typesetting enthusiast, gamer... http://micah.cowan.name/ -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.6 (GNU/Linux) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org iD8DBQFHMRq97M8hyUobTrERCJ6WAJwK6uv/HlrLmTA7zK5DLZCnswkofQCfbMvJ 6yAiHoWEsXLohuYmQTGlPDo= =DWHZ -----END PGP SIGNATURE-----