My understanding is that white space outside of tags is not significant and between tags IS significant.
So if you had a pretty-printed XML doc, the CRLF and tags would be ignorable, but the stuff between tags can't be thrown away: <x> <y> Can't toss the leading or trailing blanks </y> <!-- tabs and newlines can be ignored --> </x> If M$ is assuming that whitespace in element content is ignorable, I'd have to disagree. - MOD --- "Arnold, Curt" <[EMAIL PROTECTED]> wrote: > I assume that your source files don't have a > document type declaration. If you had one, then > Xerces would have been able to safely infer that > whitespace in element content was ignorable. > > However without a DTD, it is impossible to know for > certain whether the whitespace is part of mixed > content and is significant or part of element > content and insignificant. Microsoft's parsers > guess > that whitespace between start and end tags is > ignorable, but that isn't always that case (though > it is most of the time). However, the safer > assumption is that the whitespace is significant. > > If you don't want to provide a DTD but do want to > suppress the production of the whitespace nodes, > probably the best approach would be to provide a > filter so that the SAX events that produce the > whitespace nodes don't ever get to the DOM builder. > > --------------------------------------------------------------------- > To unsubscribe, e-mail: > [EMAIL PROTECTED] > For additional commands, e-mail: > [EMAIL PROTECTED] > __________________________________________________ Do You Yahoo!? Yahoo! Auctions - buy the things you want at great prices http://auctions.yahoo.com/ --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
