Hi,

What it means is that all tags have to have closing tags.

E.g. 

<Body>
        <H1> HI </H1>
</Body>

In other words iut has to be well formed HTML.

Thanks

Pete

-----Original Message-----
From: Fernando [mailto:[EMAIL PROTECTED]
Sent: 03 August 2003 15:54
To: [EMAIL PROTECTED]
Subject: Re: Xerces C / HTML validate wia W3/DTD


Dave:

Thanks for yuour response. But, in the Xerces documentation FAQ there is a 
question:

"Can I use Xerces C++ to parse HTML?" and the answer is "Yes.. only if the 
HTML follows the rules given in the XML specification..."

What does it mean? Can you give me a point here?


Regards.



At 12:08 AM 8/3/2003 -0700, you wrote:

>Xerces-C is an XML, not an SGML/HTML parser, so you cannot use it to parse
>HTML.
>
>Dave
>
>
>
>Hi:
>
>Any of you could parse/validate an HTML file, pre-parsing the grammar with
>the W3 dtd?
>
>I'm doing:
>____________________________
>SAX2XMLReader* parser = XMLReaderFactory::createXMLReader();
>parser->setFeature(XMLUni::fgXercesUseCachedGrammarInParse, true);
>
>Grammar * gr=parser->loadGrammar(dtd_file_in_source,
>Grammar::DTDGrammarType, true);
>//here gr is not NULL!!
>
>parser->parse(some_html_in_source);
>____________________________
>
>
>But the parse stops in <meta....>, this is because the meta tag doesn't
>have to be slash terminated. That rule is in the DTD. ;-)
>So, Is the DTD being parsed?
>
>Any point will be appreciated.
>Regards.


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to