Re: [Bug-wget] wget downloads index.html unnecessarily and halts batch script (Windows)

Ángel González Tue, 22 Sep 2015 16:04:59 -0700

El 22/09/15 19:10, El Gato escribió:

Hi, everyone.
I am having trouble with wget64 on Windows. I am using a batch scriptto download files from a host:
@echo OFF
FOR /L %%i in (1, 1, 9999) DO (
    cls
    echo Downloading file %%i
wget64.exe -A pdf,chm -e robots=off --progress=bar --show-progress-r -np -nd -nc -HDit-ebooks.info,filepi.com --content-disposition -awget.log it-ebooks.info/book/%%i/
)
|wget| will download |index.html| (which I feel is unnecessary), thenit proceeds to the hosted file and downloads it if the file does notexist on the destination, but will fail to retrieve the |index.html|of the next book and start the next download.
Is it really necessary to download |index.html| and if that is thecase, how can I tell |wget| to erase and download the new one every time?

It should be downloading then deleting it, since you are only acceptingpdf and chm files (it downloads index.html for looking for the files).And that's what it does here.

As a bit of unwanted help, I would recommend printing the urls (replacethe for contents with an echo) and loading the list with wget -i - Thisway wget will be able to reuse the opened connection instead of running10000 instances (and connecting to the server 10000 times).

Re: [Bug-wget] wget downloads index.html unnecessarily and halts batch script (Windows)

Reply via email to