Hi,
 
I've posted this before but didn't see any response.  Can someone who knows about this pls helpA?
 
 
I have a question about how to selectively disable parsing of certain tags?  The situation is I have a html file that displays well in browser but not conform to the xHTML standard, can I still use xerces-c to do some parsing, say i have:
 
<html>

<head>
        <title>my page</title>
</head>

<body bgcolor="#FFFFFF">
<a name="top_of_page"></a>
<a href="http://www.someplace.com"><img
src="http://www.someplace.com/aaa/images/1.gif" hspace="0" vspace="0" width="10" height="1" border="0" alt="Do this" align="right"></a>
</body>
 
</html>
 
The IMG tag doesn't conform, can I do some preconfiguration to ignore certain tags so I only parse <a>, <html>, <head>, <body> and treat the rest as content only?  I still want to parse this doc but don't want the bad-formness of it stops the processing and I am not interested in some tags that are mis-used most of the time.
 
 

Thanks,

Rene

 

Reply via email to