> 
> Do you think this might be an issue with framesets and ssl sites?  or an issue
> with framesets and cgi source files?

This is not a problem with frames - it IS a problem with SSL.
wget, while it appears to have SSL support, didn't quite
get it right. The internal schems being used don't treat
https: as an "http" protocol, and thus don't recurse down
into sub pages. (wget specifically avoids recursing into
unknown protocols,and https was treated as one of these).

A previous post on a patch for this exists.  The patch
is as follows:

--- src/recur.c Wed Dec 19 09:27:29 2001
+++ ../wget-1.8.1.esoft/src/recur.c     Sat Dec 29 16:17:40 2001
@@ -437,7 +437,7 @@
      the list.  */
 
   /* 1. Schemes other than HTTP are normally not recursed into. */
-  if (u->scheme != SCHEME_HTTP
+  if (u->scheme != SCHEME_HTTP && u->scheme!= SCHEME_HTTPS
       && !(u->scheme == SCHEME_FTP && opt.follow_ftp))
     {
       DEBUGP (("Not following non-HTTP schemes.\n"));
@@ -446,7 +446,7 @@
 
   /* 2. If it is an absolute link and they are not followed, throw it
      out.  */
-  if (u->scheme == SCHEME_HTTP)
+  if (u->scheme == SCHEME_HTTP || u->scheme == SCHEME_HTTPS)
     if (opt.relative_only && !upos->link_relative_p)
       {
        DEBUGP (("It doesn't really look like a relative link.\n"));
@@ -534,7 +534,7 @@
       }
 
   /* 8. */
-  if (opt.use_robots && u->scheme == SCHEME_HTTP)
+  if (opt.use_robots && (u->scheme == SCHEME_HTTP || u->scheme ==
SCHEME_HTTPS))
     {
       struct robot_specs *specs = res_get_specs (u->host, u->port);
       if (!specs)


OR, alternatively, simply edit recur.c according to the following
instructions:

  Line 440: change to
     if (u->scheme != SCHEME_HTTP && u->scheme!= SCHEME_HTTPS 

  Line 449: change to
    if (u->scheme == SCHEME_HTTP || u->scheme == SCHEME_HTTPS) 

  Line 537: change to
    if (opt.use_robots && (u->scheme == SCHEME_HTTP || u->scheme ==
SCHEME_HTTPS))
 
and that should work better.

Thomas

> 
> Thanks again, and will try --no-http-keep-alive at some point.
> Picot
> 
> Ian Abbott wrote:
> 
> > On 10 Jan 2002 at 12:39, Picot Chappell wrote:
> >
> > > Has anyone solved this issue?  I am downloading a single html page,
> > > without recursion, and not getting the 'one hop further' that should
> > > occur for framesets.
> > >
> > > I'm using wget 1.8.1, on Solaris 8.  According to the documentation,
> > > options -p and -k should work to download everything, and from previous
> > > postings I see mention that -p should go at least one more hop (also
> > > confirmed in the News items on GNU Wget news).
> >
> > Well it seems to work as advertised on my employer's
> > web-site
> > (www.mev.co.uk), at least on my machine. Can you
> > provide an example
> > which fails on your machine?
> >
> > > Below is the gist of my call:
> > >
> > > >  ./wget --ignore-length --html-extension --tries=3 --timeout=60
> > > >  --cookies=off --page-requisites --convert-links -- www.someframed.page
> >
> > That looks okay. I substituted in www.mev.co.uk and got
> > the index
> > frameset page, two frames and the images on those
> > frames as
> > expected.
> >
> > The '--ignore-length' switch slows things down rather a
> > lot though,
> > due to keep-alive connections. Adding '--no-http-keep-
> > alive' to the
> > above will speed it up.

-- 
------------------------------------------------------------
E-Soft Inc.                         http://www.e-softinc.com
Publishers of SecuritySpace     http://www.securityspace.com
Tel: 1-905-331-2260                      Fax: 1-905-331-2504   
Tollfree in North America: 1-800-799-4831

Reply via email to