> Yep, it seems like the parser will replace the with a different
> kind of spaces (0xA0 instead of 0x20). The viewer will not recognize
> them as spaces so they will not be removed.
Yes, 0xA0 is the Unicode character code for non-breaking space. Since
nbsp is not one of the characters defined as "white space" in HTML (or
XHTML), and thus is not compressed as normal spaces are, it's
traditionally been used in HTML documents to provide non-collapsed
spacing. However, the HTML 4.01 specification explicitly says,
``This specification does not indicate the behavior, rendering or
otherwise, of space characters other than those explicitly identified
here as white space characters. For this reason, authors should use
appropriate elements and styles to achieve visual formatting effects
that involve white space, rather than space characters.''
However, I'd think that the current viewer handling of it is fine, at
least until we support style sheets and XML DTDs.
Bill