Hi, I was downloading all files from a site using the following command: wget -nd -v -r --accept-regex '.*mod.*resource/.*' --header 'Host: catedras.info.unlp.edu.ar' --header 'User-Agent: Mozilla/5.0 (X11; Ubuntu; Linux x86_64; rv:23.0) Gecko/20100101 Firefox/23.0' --header 'Accept: text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8' --header 'Accept-Language: es-ar,es;q=0.7,en-us;q=0.3' --header 'DNT: 1' --header 'Content-Type: application/x-www-form-urlencoded' --header 'Cookie: __utma=135945449.1331125489.1377905747.1378736807.1378776921.6; __utmz=135945449.1377905747.1.1.utmcsr=(direct)|utmccn=(direct)|utmcmd=(none); __utmc=135945449; MoodleSession=bp13b0uafi72eu68v29hlrvih5; MOODLEID1_=%25D1%25E3%257E%25AE%250C%257D%2519%25A1' https://catedras.info.unlp.edu.ar/course/view.php?id=597
Wget downloaded correctly all files, but when a page used the 303 See Other directive to send the file, the file wasn't saved with the name of the mirrored page, but with the previous one. Don't know if this is a bug, if it's not it would be a good proposal to add an option like --save-with-redirect-name or something in order to avoid this bad functionality. Here is the output of a wrongly named file: --2013-09-13 12:38:03-- https://catedras.info.unlp.edu.ar/mod/resource/view.php?id=10729 Reutilizando la conexión con catedras.info.unlp.edu.ar:443. Petición HTTP enviada, esperando respuesta... 303 See Other Ubicación: https://catedras.info.unlp.edu.ar/pluginfile.php/35940/mod_resource/content/1/tp02-topologias-practica-RIP.zip?forcedownload=1 [siguiente] --2013-09-13 12:38:03-- https://catedras.info.unlp.edu.ar/pluginfile.php/35940/mod_resource/content/1/tp02-topologias-practica-RIP.zip?forcedownload=1 Reutilizando la conexión con catedras.info.unlp.edu.ar:443. Petición HTTP enviada, esperando respuesta... 200 OK Longitud: 6860 (6,7K) [application/zip] Grabando a: “view.php?id=10729” 100%[===================================================================================================================================================================================================>] 6.860 --.-K/s en 0,02s 2013-09-13 12:38:03 (367 KB/s) - “view.php?id=10729” guardado [6860/6860] As shown before, the desirabled name would be tp02-topologias-practica-RIP.zip rather than view.php?id=10729 Here is the output of a well named file, download without 303 redirect: --2013-09-13 12:38:03-- https://catedras.info.unlp.edu.ar/pluginfile.php/35798/mod_resource/content/2/2.-%20ruteo%20interno_Parte1.pdf Reutilizando la conexión con catedras.info.unlp.edu.ar:443. Petición HTTP enviada, esperando respuesta... 200 OK Longitud: 651897 (637K) [application/pdf] Grabando a: “2.- ruteo interno_Parte1.pdf” 100%[===================================================================================================================================================================================================>] 651.897 1,48MB/s en 0,4s 2013-09-13 12:38:04 (1,48 MB/s) - “2.- ruteo interno_Parte1.pdf” guardado [651897/651897]
