>>>>> "Sean" == Sean M Burke <[EMAIL PROTECTED]> writes:
Sean> 1) XML DTDs seem a whole lot easier to parse and deal with than SGML
Sean> DTDs, and not just with content-models.
Yes... that's probably why I've all but abandoned my current track.
Lexing and parsing and understanding XML from the DTD is a lot more
interesting and long-term than doing anything with HTML. I think my
technology framework is interesting, but has toy value until we get to
XHTML.
Sean> 2) Once I have HTML::Known, I'll be able to have HTML::Element have a
Sean> pretty-print option to as_HTML that won't introduce whitespace where
Sean> it would be wrong to do so. (I.e., it'll use what's now TreeBuilder's
Sean> can_tighten to know where it can introduce newlines and whitespace for
Sean> prettyprinting.)
Cool!
Sean> 3) Raggett's program called "Tidy" is at
Sean> http://www.w3.org/People/Raggett/tidy/ and seems to work rather well
Sean> at HTML->XHTML. That's just one of the things it does. It's a fine
Sean> program, and I suggest everyone use it for, e.g., HTML/XML
Sean> prettyprinting and/or checking.
Yes, and having discovered this midway through the three weeks I was
thinking about my project, my HTML prettyprinter became all but
pointless since I seemed to be reinventing too many wheels at once.
There's only so much work I'm willing to put in to create another
WebTechniques column. :) Besides, if it stretches more than 200 lines
of Perl, I can't write about it in 10K chars. :)
--
Randal L. Schwartz - Stonehenge Consulting Services, Inc. - +1 503 777 0095
<[EMAIL PROTECTED]> <URL:http://www.stonehenge.com/merlyn/>
Perl/Unix/security consulting, Technical writing, Comedy, etc. etc.
See PerlTraining.Stonehenge.com for onsite and open-enrollment Perl training!