Re: Debian bug 55145 - wget gets confused by redirects

Hrvoje Niksic Mon, 08 Apr 2002 09:56:15 -0700

Guillaume Morin <[EMAIL PROTECTED]> writes:

> If wget fetches a url which redirects to another host, wget
> retrieves the file, and there's nothing that can be done to turn
> that off.
>
> So, if you do wget -r on a machine that happens to have a redirect to
> www.yahoo.com you'll wind up trying to pull down a big chunk of
> yahoo.


Hmm.  Are you sure?  Wget 1.8.1 is trying hard to restrict following
redirections by applying the same rules normally used for following
links.  Downloading a half of Yahoo! because someone redirects to
www.yahoo.com is not intended to happen.

I tried to reproduce it by creating a page that redirects to
www.yahoo.com, but Wget behaved correctly:

    $ wget -r -l0 http://muc.arsdigita.com:2005/test.tcl
    --19:13:53--  http://muc.arsdigita.com:2005/test.tcl
               => `muc.arsdigita.com:2005/test.tcl'
    Resolving muc.arsdigita.com... done.
    Connecting to muc.arsdigita.com[212.84.246.68]:2005... connected.
    HTTP request sent, awaiting response... 302 Found
    Location: http://www.yahoo.com [following]
    --19:13:53--  http://www.yahoo.com/
               => `www.yahoo.com/index.html'
    Resolving www.yahoo.com... done.
    Connecting to www.yahoo.com[64.58.76.223]:80... connected.
    HTTP request sent, awaiting response... 200 OK
    Length: unspecified [text/html]

        [   <=>                               ] 16,829        22.39K/s             

    19:13:55 (22.39 KB/s) - `www.yahoo.com/index.html' saved [16829]


    FINISHED --19:13:55--
    Downloaded: 16,829 bytes in 1 files

Guillaume, exactly how have you reproduced the problem?

Re: Debian bug 55145 - wget gets confused by redirects

Reply via email to