Re: Encoding italic

James Kass via Unicode Fri, 08 Feb 2019 17:47:13 -0800


William,

Rather than having the user insert the VS14 after every character, theeditor might allow the user to select a span of text for italicization. Then it would be up to the editor/app to insert the VS14s where appropriate.


For Andrew’s example of “fête”, the user would either type the string:
“f” + “ê” + “t” + “e”
or the string:
“f” + “e” + <U+0300 COMBINING CIRCUMFLEX ACCENT> + “t” + “e”.

If the latter, the application would insert VS14 characters after the“f”, “e”, “t”, and “e”. The application would not insert a VS14 afterthe combining circumflex — because the specification does not allow VScharacters after combining marks, they may only be used on base characters.

In the first ‘spelling’, since the specifications forbid VS charactersafter any character which is not a base character (in other words, notafter any character which has a decomposition, such as “ê”) — theapplication would first need to convert the string to the second‘spelling’, and proceed as above. This is known as converting to NFD.

So in order for VS14 to be a viable approach, any application would ①need to convert any selected span to NFD, and ② only insert VS14 aftereach base character. And those are two operations which are quitepossible, although they do add slightly to the programmer’s burden. Idon’t think it’s a “deal-killer”.

Of course, the user might insert VS14s without application assistance. In which case hopefully the user knows the rules. The worst casescenario is where the user might insert a VS14 after a non-basecharacter, in which case it should simply be ignored by anyapplication. It should never “break” the display or the processing; itsimply makes the text for that document non-conformant. (Of courseputting a VS14 after “ê” should not result in an italicized “ê”.)


Cheers,

James

Re: Encoding italic

Reply via email to