Hi all,
I've been experimenting with Sitescooper and Plucker as an AvantGo
replacement and have now got it working pretty well. I have, however,
noticed an annoyance, which I think is a bug.
It occurs with Sitescooper 3.1.2 when scooping in multipage plucker format
and can be reproduced using digiguide_tv_listings.site from the samples
included in the distribution.
Basically all of the '^^' links, including that on the contents page itself
refer to '0.html', which doesnt exist.
I have traced this problem and implemented a fix as follows:-
At Robot.pm line 1018 (in sub href_to_multipage_filename)...
my $assigned = $self->{page_to_tmpfile}->{$url};
if (defined $assigned) { return $assigned; }
+
+ # handle redirections - added by Andy Carlson
+ my $redir_from = $self->{redirections_backwards}{$url};
+ $redir_from ||= '';
+ my @these_urls = ($url, split (' ', $redir_from));
+
+ foreach $url (@these_urls) {
+ my $assigned = $self->{page_to_tmpfile}->{$url};
+ if (defined $assigned) { return $assigned; }
+ }
# new-style: just use numeric filenames. saves space.
(Note - the above is not a real 'diff' output because I'm not a real Unix
person, but hopefully its clear enough)
It looks as if the 'real' URL to 'scooped' URL mapping uses the original
'real' URL for the contents page without checking for redirects, which the
patch above fixes. There may of course be other ways to fix this (with a
better knowledge of the design).
I havent checked whether there are similar issues on 3 level sites, because
I havent got any real ones (Digiguide now seems to be just 2 levels - looks
like it's changed since the .site was written). Perhaps someone else can
suggest a 3 level site which redirects its top level page?
Regards, Andy
----------------------------------------------------------------
Andy Carlson
_________________________________________________________________
Get your FREE download of MSN Explorer at http://explorer.msn.com/intl.asp
_______________________________________________
Sitescooper-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/sitescooper-talk