On Mar 20, 2006, at 10:54 AM, Charles Yeomans wrote:
Closing paragraph tags would be a simple exercise. I would think that it would be easier to clean up the HTML than to write another parser.
It completely depends on how consistent and finite the variations of html getting spit out of the external application that Adam is referring to is. Could be simple. But it could also be a endless chase of breakage. Only Adam can answer that. I sure wouldn't tackle it unless I had some serious confidence that I knew all the possible html heading my way. Without that, you would be writing a generic html to xhtml engine (with an endless chase of breakage), just to be able to parse the resulting xhtml with the xml classes. And of course to do that properly, you would have written an html parser to start with. Sounds like fun as long as its someone else doing it! ;-)
-stephen =========================== Stephen Tallent Tallent Communications, Inc. [EMAIL PROTECTED] http://www.tallent.com _______________________________________________ Unsubscribe or switch delivery mode: <http://www.realsoftware.com/support/listmanager/> Search the archives of this list here: <http://support.realsoftware.com/listarchives/lists.html>
