On Mon, Mar 21, 2005 at 06:51:42PM +0100, Reinier Post wrote: > On Sun, Mar 20, 2005 at 01:51:25PM -0800, Bill Moseley wrote: > > On Sun, Mar 20, 2005 at 06:02:26PM +0300, [EMAIL PROTECTED] wrote: > > > Hello libwww, > > > > > > using it to parse html-forms etc... > > > noticed, that it recognizes strange comment > > > like <!--> as starting of the comment, > > > not like the whole empty comment, as IE. > > > > Doesn't seem like that's a valid comment. > > > > http://www.w3.org/TR/WD-html40-970917/intro/sgmltut.html#h-3.1.4 > > Well, the HTML:Parser perldoc says: > > HTML::Parser is not a generic SGML parser. We have tried to make it > able to deal with the HTML that is actually "out there", and it normally > parses as closely as possible to the way the popular web browsers do it > instead of strictly following one of the many HTML specifications from > W3C. Where there is disagreement, there is often an option that you can > enable to get the official behaviour.
Hard to imagine handling every possibility as an option. I would have thought an empty comment would be at a minimum: <!-- --> or maybe <!----> although I'm still trying to grasp the concept of an empty comment. -- Bill Moseley [EMAIL PROTECTED]