in writing a large crawler using LWP, i found this site generates uninitialized warnings:
GET http://www.swimnews.com Use of uninitialized value in substitution (s///) at /usr/local/lib/perl5/site_perl/5.6.1/Net/HTTP/Methods.pm line 425. Use of uninitialized value in substitution (s///) at /usr/local/lib/perl5/site_perl/5.6.1/Net/HTTP/Methods.pm line 426. Use of uninitialized value in hex at /usr/local/lib/perl5/site_perl/5.6.1/Net/HTTP/Methods.pm line 427. i am also seeing a bunch of these warnings. i can't seem to reproduce them with GET. these are some of the sites that generate these warnings. http://bizjournals.bcentral.com/journals/demographics/ http://www.channelweb.com/ Use of uninitialized value in substr at /usr/local/lib/perl5/site_perl/5.6.1/i86pc-solaris/HTML/PullParser.pm line 82. Use of uninitialized value in length at /usr/local/lib/perl5/site_perl/5.6.1/i86pc-solaris/HTML/PullParser.pm line 85. i also found an odd problem with date parsing in HTTP/Date.pm @MoY = qw(Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec); @MoY{@MoY} = (1..12); # Translate month name to number $mon = $MoY{$mon} || $MoY{"\u\L$mon"} || ($mon >= 1 && $mon <= 12 && int($mon)) || return; if $mon is not a 3 char month, but it is a string, it will be numerically compared to 1 and 12 and generate a warning. i suggest truncating $mon to 3 chars before the lookup or verifying it is all digits before the 1-12 test. i can't find the site that triggered this but it was probably a bad server with long month names. all of those are not critical and i am just logging them for now. thanx, uri -- Uri Guttman ------ [EMAIL PROTECTED] -------- http://www.stemsystems.com ----- Stem and Perl Development, Systems Architecture, Design and Coding ---- Search or Offer Perl Jobs ---------------------------- http://jobs.perl.org
