In my opinion the DOM should never include ignorable whitespace and Xerces should be fixed to ignore that. The DOM present the information model and these white spaces, whether they originated in your text editor or in the pretty printer, are not informative.
ProjectX and OpenXML do not include such whitespace in the DOM. arkin "Pardoe, Julian" wrote: > > Assaf Arkin [mailto:[EMAIL PROTECTED] wrote: > <<<The Xerces parser insists on adding whitespaces in the original document > into the DOM as text nodes. [...] If you extract all these whitespaces from > the original personal.xml [...]>>> > > Doesn't this suggest that DOM should have some standard facility for pruning > white-space text nodes from the tree? ...or for never adding them to the > tree in the first place. It's something a lot of people are going to want. > A facility for normalization of white space within text-nodes might also be > handy. > > -- jP -- > > This message is for the named person's use only. It may contain > confidential, proprietary or legally privileged information. No > confidentiality or privilege is waived or lost by any mistransmission. > If you receive this message in error, please immediately delete it and all > copies of it from your system, destroy any hard copies of it and notify the > sender. You must not, directly or indirectly, use, disclose, distribute, > print, or copy any part of this message if you are not the intended > recipient. CREDIT SUISSE GROUP, CREDIT SUISSE FIRST BOSTON, and each of > their subsidiaries each reserve the right to monitor all e-mail > communications through its networks. Any views expressed in this message > are those of the individual sender, except where the message states > otherwise and the sender is authorised to state them to be the views of > any such entity.