>>>>> "Sean" == Sean M Burke <[EMAIL PROTECTED]> writes:

Sean> 1) XML DTDs seem a whole lot easier to parse and deal with than SGML
Sean> DTDs, and not just with content-models.

Yes... that's probably why I've all but abandoned my current track.
Lexing and parsing and understanding XML from the DTD is a lot more
interesting and long-term than doing anything with HTML.  I think my
technology framework is interesting, but has toy value until we get to
XHTML.

Sean> 2) Once I have HTML::Known, I'll be able to have HTML::Element have a
Sean> pretty-print option to as_HTML that won't introduce whitespace where
Sean> it would be wrong to do so.  (I.e., it'll use what's now TreeBuilder's
Sean> can_tighten to know where it can introduce newlines and whitespace for
Sean> prettyprinting.)

Cool!

Sean> 3) Raggett's program called "Tidy" is at
Sean> http://www.w3.org/People/Raggett/tidy/ and seems to work rather well
Sean> at HTML->XHTML.  That's just one of the things it does.  It's a fine
Sean> program, and I suggest everyone use it for, e.g., HTML/XML
Sean> prettyprinting and/or checking.

Yes, and having discovered this midway through the three weeks I was
thinking about my project, my HTML prettyprinter became all but
pointless since I seemed to be reinventing too many wheels at once.

There's only so much work I'm willing to put in to create another
WebTechniques column. :)  Besides, if it stretches more than 200 lines
of Perl, I can't write about it in 10K chars. :)

-- 
Randal L. Schwartz - Stonehenge Consulting Services, Inc. - +1 503 777 0095
<[EMAIL PROTECTED]> <URL:http://www.stonehenge.com/merlyn/>
Perl/Unix/security consulting, Technical writing, Comedy, etc. etc.
See PerlTraining.Stonehenge.com for onsite and open-enrollment Perl training!

Reply via email to