Hi all,

I've been experimenting with Sitescooper and Plucker as an AvantGo 
replacement and have now got it working pretty well. I have, however, 
noticed an annoyance, which I think is a bug.

It occurs with Sitescooper 3.1.2 when scooping in multipage plucker format 
and can be reproduced using digiguide_tv_listings.site from the samples 
included in the distribution.

Basically all of the '^^' links, including that on the contents page itself 
refer to '0.html', which doesnt exist.

I have traced this problem and implemented a fix as follows:-

At Robot.pm line 1018 (in sub href_to_multipage_filename)...

  my $assigned = $self->{page_to_tmpfile}->{$url};
  if (defined $assigned) { return $assigned; }
+
+  # handle redirections - added by Andy Carlson
+  my $redir_from = $self->{redirections_backwards}{$url};
+  $redir_from ||= '';
+  my @these_urls = ($url, split (' ', $redir_from));
+
+  foreach $url (@these_urls) {
+       my $assigned = $self->{page_to_tmpfile}->{$url};
+       if (defined $assigned) { return $assigned; }
+  }

  # new-style: just use numeric filenames. saves space.

(Note - the above is not a real 'diff' output because I'm not a real Unix 
person, but hopefully its clear enough)

It looks as if the 'real' URL to 'scooped' URL mapping uses the  original 
'real' URL for the contents page without checking for redirects, which the 
patch above fixes. There may of course be other ways to fix this (with a 
better knowledge of the design).

I havent checked whether there are similar issues on 3 level sites, because 
I havent got any real ones (Digiguide now seems to be just 2 levels - looks 
like it's changed since the .site was written). Perhaps someone else can 
suggest a 3 level site which redirects its top level page?

Regards, Andy
----------------------------------------------------------------
Andy Carlson


_________________________________________________________________
Get your FREE download of MSN Explorer at http://explorer.msn.com/intl.asp


_______________________________________________
Sitescooper-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/sitescooper-talk

Reply via email to