Not to criticize someone's hard work (OK, I guess I am...) but aren't we 
barking up the wrong tree here? Shouldn't it be possible to discover 
everything we need to know about HTML by parsing a DTD? That way as the 
language changes, software would automatically recognize new tags.  :)

(Of course, *which* DTD to parse could be problematic... we might need a 
"sloppy" DTD covering all the unofficial and obsolete tags that exist in the 
real world.)

The stuff with in_head in_body etc. could be generalized -- as mchase notes, 
td is "in_tr" not just "in_body".

Also re <TR> <TD> etc. there are a few (million) ill formed documents out 
there, so you need to know that a </TR> also ends a pending <TD> even if some 
bozo forgot to say </TD>




From:   mchase%ix.netcom.com@Internet on 99-12-28 08:21 AM
To:     libwww%perl.org@Internet, sburke%netadventure.net@Internet
cc:      (bcc: Marvin Simkin)
Subject:        Re: HTML::Known, containing HTML parameters

Would it be worthwhile to mention required parents?  For example, <tr>
doesn't mean much without an enclosing <td>.

Will this module also include information about boolean attributes?

I think I like HTML::Tags best so far.
--
Mac :})
----- Original Message -----
From: Sean M. Burke <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>
Sent: 27 December, 1999 15:09
Subject: HTML::Known, containing HTML parameters


> A few days back we kicked around the idea of having one module
> containing all the facts about all the HTML tags.  Here's my first
> stab at such a thing, made by just combining things from Element,
> TreeBuilder, and LinkExtor.  Thoughts, anyone?
>
> (Suggestions for the name are especially welcome.  So far I've
> considered HTML::Tags, but that's bad because it's too close to
> HTML::Element, and HTML::DocType, which is not so good because it
> implicates this is a DTD, or the kind of information you'd find in a
> DTD.)





Reply via email to