I'm parsing a text file containing an XHTML document into a Document
object, finding a fragment via XPath, then doing a regular expression
search on the fragment as textual XML. I'm finding that the XML text
rendered from the XMLWriter via asXML() is not consistent with the input
document. 

For example, when a table cell tag that came in as <td> is rendered, it
comes out as <td rowspan="1" colspan="1">. Since the regular expressions
are developed while looking at the input XHTML document, having the
fragment rendered differently is inconvenient in the least and wrong in
the worst case.

Are these extra attributes defaults found in the document type? Is there
a way to get a faithful rendering of a document fragment wrt to the
input?

Thanks,
Paul Kaiser



-------------------------------------------------------
This SF.Net email is sponsored by: GNOME Foundation
Hackers Unite!  GUADEC: The world's #1 Open Source Desktop Event.
GNOME Users and Developers European Conference, 28-30th June in Norway
http://2004/guadec.org
_______________________________________________
dom4j-user mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/dom4j-user

Reply via email to