[EMAIL PROTECTED] (Adam P. Harris) writes: > Yes, this is the crux, for me. I actually don't rule out using either > (a) a simple perl module wrapping around nsgmls in conjunction with a > DTD, or (b) writing a perl module to parse SGML down to simple data > structures (list of hashes comes to mind) on it's own in such as way > as it has 98% SGML (or XML) coverage. There may (should!) be a std > CPAN Perl module for this, but I haven't found it yet.
Have you looked at HTML::Parse (or is it HTML::Parser -- I always forget). It is not limited to HTML tags. Though it might not grok all of the advanced SGML stuff, such as omitted start tags or short tags or stuff like that. But these aren't in XML anyway, right? kai -- Really cancel? [OK] [Cancel] -- To UNSUBSCRIBE, email to [EMAIL PROTECTED] with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]

