Randal L. Schwartz wrote:
>[...] The idea would be to point it at a w3.org DTD
> and pretty print an HTML doc, or at least generate the missing tags so
> that XHTML conversion was easier.
I've nothing to add that would get you any closer to what you're
after, but these things of tangential relevence do come to mind:
1) XML DTDs seem a whole lot easier to parse and deal with than SGML
DTDs, and not just with content-models.
2) Once I have HTML::Known, I'll be able to have HTML::Element have a
pretty-print option to as_HTML that won't introduce whitespace where
it would be wrong to do so. (I.e., it'll use what's now TreeBuilder's
can_tighten to know where it can introduce newlines and whitespace for
prettyprinting.)
3) Raggett's program called "Tidy" is at
http://www.w3.org/People/Raggett/tidy/ and seems to work rather well
at HTML->XHTML. That's just one of the things it does. It's a fine
program, and I suggest everyone use it for, e.g., HTML/XML
prettyprinting and/or checking.
--
Sean M. Burke [EMAIL PROTECTED] http://www.netadventure.net/~sburke/