Re: Least used parts of BMP.

Doug Ewell Sat, 05 Jun 2010 09:45:01 -0700

Philippe Verdy <verdy underscore p at wanadoo dot fr> wrote:

Of course, he will not have other UTF-8-like features, such asavoidance of ASCII values in the final trail byte, and "fast forwardparsing" by looking at the first byte.
The fast forward feature is certianly not decisive, but the randomacessibility (from any position and in any direction) is certainlymuch more decisive and is a real positive factor for UTF-8, ratherthan the format proposed above, which can only be read in the forwarddirection, even if it can be accessed randomly to find the *next*character. to find the *previous* one, you have to scan backward untilyou eat at least one byte used to encode the character before it(otherwise, you don't know if a 1xxxxxx byte is the first one in asequence, even if you can know if a byte is the last one.

Kannan is looking for a format for a protocol that he is developing.Maybe scanning backwards through a string is not a scenario that willever be encountered in this protocol. It's not for us to say.


--
Doug Ewell  |  Thornton, Colorado, USA  |  http://www.ewellic.org
RFC 5645, 4645, UTN #14  |  ietf-languages @ http://is.gd/2kf0s

Re: Least used parts of BMP.

Reply via email to