Brad Wardman wrote: > Micah, > > I am sorry for not getting back to you earlier on this, but needed to > find URLs that is causing these problems with downloading to the wrong > directories. > > Two example URLs, first downloads a .doc and the latter downloads a > couple of pdfs. The first file is downloaded from a redirect, but other > redirects do not have the same behavior. I also noticed that the latter > URL calls the pdfs from an index.php parameter called id and requests > (e.g., id = 14%3AnameOfFile) the file with a numeric (e.g., 14,15,16) > followed by the hex value (%3A) for a colon, and then the name of the pdf. > > http://bit.ly/aoISTU > http://203.125.30.69/hsl/hslnieuwsbrief28/
The .doc file ends up inside the -P directory for me. The other one doesn't result in an pdfs for me. It does if I add --content-disposition, but it still lives in -P as it should (both in 1.12 and 1.11.4). > > Below is my command line execution: >>wget --referer="http://www.google.com" --user-agent="Mozilla/5.0....." > -t 1 -e robots=off -k -np -l 5 -P wgetFolder URL This isn't complete. It doesn't have recursive-descent options, so would never download more than one file as given. Please provide: - The version of Wget you're using - The operating system you're running it on - The contents of any ~/.wgetrc and/or /etc/wgetrc, as apparently there are more options being specified than you're actually giving at the command-line (almost certainly including --content-disposition/content_disposition, and perhaps recursive or mirror). - A relevant snippet of the log output when --debug is provided (probably for the http://bit.ly/aoISTU one, since that's only one file to download so will have a much shorter log). -- Micah J. Cowan http://micah.cowan.name/
