Bug using recursive get and stdout

Jonathan A. Zdziarski Tue, 17 Apr 2007 12:45:11 -0700

Greetings,

Stumbled across a bug yesterday reproduced in both v1.8.2 and 1.10.2.

Apparently, recursive get tries to open the file for reading afterdownloading, to download subsequent files. Problem is, when used with-O - to deliver to stdout, it cannot open that file, so you get theoutput below (note the "No such file or directory error"). In 1.10,it appears that they removed this error message, but wget still failsto recursively fetch.

I realize it seems like there wouldn't be much reason to send morethan one page to stdout, but I'm feeding it all into a statisticalfilter to classify website data, so it doesn't really matter to thefilter. Do you know of any workaround for this, other than openingthe files after reading (won't scale with thousands per minute).


Thanks!

$ wget -O - -r http://www.zdziarski.com > out
--15:40:06--  http://www.zdziarski.com/
           => `-'
Resolving www.zdziarski.com... done.
Connecting to www.zdziarski.com[209.51.159.242]:80... connected.
HTTP request sent, awaiting response... 200 OK
Length: 24,275 [text/html]

100%[====================================>] 24,275 163.49K/sETA 00:00


15:40:06 (163.49 KB/s) - `-' saved [24275/24275]

www.zdziarski.com/index.html: No such file or directory

FINISHED --15:40:06--
Downloaded: 24,275 bytes in 1 files





Jonathan

Bug using recursive get and stdout

Reply via email to