Thanks Ken. I don't know how I missed the text on 326 when I scanned it
before I mailed.
tex

Kenneth Whistler wrote:
> 
> Tex asked:
> 
> > But does the standard address their removal by receivers (or
> > intermediaries) , and does removing them include removing the contained
> > annotation?
> 
> Yes and yes. p. 326:
> 
> "On input, a plain text receiver should either preserve all characters
>                                         ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
> or remove the interlinear annotation characters as well as the annotating
>    ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
> text..."
> ^^^^
> 
> >
> > I can imagine an application that doesn't support I.A. deciding the
> > annotation is out of band and can't be preserved in its plain text
> > output, and so justifiably strips it as well.
> > Does the standard say what to do with "for internal use" only
> > characters?
> 
> Yes. Unicode 3.1:
> 
> D7b: Noncharacter: a code point that is permanently reserved for
>      internal use, and that should never be interchanged.
> 
> C10: A process shall make no change in a valid coded character
>      representation other than the possible replacement of
>      character sequences by their canonical-equivalent sequences
>      or the deletion of noncharacter code points, if that process
>      purports not to modify the interpretation of that coded
>      character sequence.
> 
> The interlinear annotation characters fall in a gray zone, since
> they are not noncharacters, but by rights ought to have been.
> Since they are standard characters though, the standard has to
> provide some guidelines -- and it is simply safer, if you encounter
> and delete them, to also delete the annotation. You would be changing
> the interpretation of the text, but in a knowing, intended manner.
> 
> >
> > I would have thought the rule was to ignore and pass along.
> 
> In general, yes, as for everything else, including unassigned
> code points. If your role in life is as a database, for example,
> or some other kind of data source or data pipe, then minimal
> meddling with the bytes is safest. But other kinds of processes
> will do graduated manipulations, depending on what they are
> aiming for.
> 
> --Ken

-- 
-------------------------------------------------------------
Tex Texin   cell: +1 781 789 1898   mailto:[EMAIL PROTECTED]
Xen Master                          http://www.i18nGuy.com
                         
XenCraft                            http://www.XenCraft.com
Making e-Business Work Around the World
-------------------------------------------------------------

Reply via email to