in writing a large crawler using LWP, i found this site generates
uninitialized warnings:

GET http://www.swimnews.com
Use of uninitialized value in substitution (s///) at 
/usr/local/lib/perl5/site_perl/5.6.1/Net/HTTP/Methods.pm line 425.
Use of uninitialized value in substitution (s///) at 
/usr/local/lib/perl5/site_perl/5.6.1/Net/HTTP/Methods.pm line 426.
Use of uninitialized value in hex at 
/usr/local/lib/perl5/site_perl/5.6.1/Net/HTTP/Methods.pm line 427.


i am also seeing a bunch of these warnings. i can't seem to reproduce
them with GET.

these are some of the sites that generate these warnings.

http://bizjournals.bcentral.com/journals/demographics/
http://www.channelweb.com/


Use of uninitialized value in substr at
/usr/local/lib/perl5/site_perl/5.6.1/i86pc-solaris/HTML/PullParser.pm
line 82.

Use of uninitialized value in length at
/usr/local/lib/perl5/site_perl/5.6.1/i86pc-solaris/HTML/PullParser.pm
line 85.


i also found an odd problem with date parsing in HTTP/Date.pm

@MoY = qw(Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec);
@MoY{@MoY} = (1..12);

    # Translate month name to number
    $mon = $MoY{$mon} ||
           $MoY{"\u\L$mon"} ||
           ($mon >= 1 && $mon <= 12 && int($mon)) ||
           return;

if $mon is not a 3 char month, but it is a string, it will be
numerically compared to 1 and 12 and generate a warning. i suggest
truncating $mon to 3 chars before the lookup or verifying it is all
digits before the 1-12 test. i can't find the site that triggered this
but it was probably a bad server with long month names.

all of those are not critical and i am just logging them for now. 

thanx,

uri


-- 
Uri Guttman  ------  [EMAIL PROTECTED]  -------- http://www.stemsystems.com
----- Stem and Perl Development, Systems Architecture, Design and Coding ----
Search or Offer Perl Jobs  ----------------------------  http://jobs.perl.org

Reply via email to