Mark Davis <mark dot davis at jtcsv dot com> wrote:

> Little probability that right double quote would appear at the start
> of a document either. Doesn't mean that you are free to delete it
> (*and* say that you are not modifying the contents).

True, but right double quote:

(a) has a visible glyph with a well-defined human-readable meaning,
(b) isn't defined by Unicode as having a text-processing influence on
adjoining characters (leaving the question wide open of what to do when
there are fewer than two adjoining characters),
(c) doesn't have a second meaning as a signature that under certain
conditions can be stripped.

> I agree that when the UTC decides that a BOM is *only* to be used as a
> signature, and that it would be ok to delete it anywhere in a document
> (like a non-character), then we are in much better shape. This was, as
> a matter of fact proposed for 3.2, but not approved. If we did that
> for 4.0, then there would be much less reason to distinguish UTF-8
> 'withBOM' from UTF-8 'withoutBOM'.

Every one of us will be grateful when that day comes.

-Doug Ewell
 Fullerton, California


Reply via email to