My understanding is that white space outside of tags
is not significant and between tags IS significant.

So if you had a pretty-printed XML doc, the CRLF and
tags would be ignorable, but the stuff between tags
can't be thrown away:

<x>
   <y> Can't toss the leading or trailing blanks </y>
<!-- tabs and newlines can be ignored -->
</x>

If M$ is assuming that whitespace in element content
is ignorable, I'd have to disagree. - MOD


--- "Arnold, Curt" <[EMAIL PROTECTED]> wrote:
> I assume that your source files don't have a
> document type declaration.  If you had one, then
> Xerces would have been able to safely infer that
> whitespace in element content was ignorable.
> 
> However without a DTD, it is impossible to know for
> certain whether the whitespace is part of mixed
> content and is significant or part of element
> content and insignificant.  Microsoft's parsers
> guess
> that whitespace between start and end tags is
> ignorable, but that isn't always that case (though
> it is most of the time).  However, the safer
> assumption is that the whitespace is significant.
> 
> If you don't want to provide a DTD but do want to
> suppress the production of the whitespace nodes,
> probably the best approach would be to provide a
> filter so that the SAX events that produce the
> whitespace nodes don't ever get to the DOM builder.
> 
>
---------------------------------------------------------------------
> To unsubscribe, e-mail:
> [EMAIL PROTECTED]
> For additional commands, e-mail:
> [EMAIL PROTECTED]
> 


__________________________________________________
Do You Yahoo!?
Yahoo! Auctions - buy the things you want at great prices
http://auctions.yahoo.com/

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to