|
Hi,
I've posted this before but didn't see any
response. Can someone who knows about this pls
helpA?
I have a question
about how to selectively disable parsing of certain tags? The situation is
I have a html file that displays well in browser but not conform to the xHTML
standard, can I still use xerces-c to do some parsing, say i
have:
<html>
<head> <title>my page</title> </head> <body bgcolor="#FFFFFF"> <a
name="top_of_page"></a>
<a href="http://www.someplace.com"><img src="http://www.someplace.com/aaa/images/1.gif" hspace="0" vspace="0" width="10" height="1" border="0" alt="Do this" align="right"></a> </body>
</html>
The IMG tag doesn't
conform, can I do some preconfiguration to ignore certain tags so I only parse
<a>, <html>, <head>, <body> and treat the rest as
content only? I still want to parse this doc but don't want the
bad-formness of it stops the processing and I am not interested in some tags
that are mis-used most of the time.
Thanks, Rene |
- Re: pls help: how to treat certain tags as content only? Rene Chan
- Re: pls help: how to treat certain tags as content o... Shawn Wilson
- Re: pls help: how to treat certain tags as content o... Joseph_Kesselman
