Re: Origin of Ellipsis

Stephan Stiller Mon, 16 Sep 2013 08:26:32 -0700

You haven't been following the thread, have you. When you "count codepoints" you can: either count the original code "points", which is thesame as counting scalar values, /because that's what an encoding formencodes/; or count code points corresponding to code units because,well, you can match them up. The latter interpretation seemed to derivefrom terminological imprecision at first, but my concern and suspicionturned out to be spot-on with what Twitter did historically.


On 9/16/2013 7:19 AM, Philippe Verdy wrote:

2013/9/16 Stephan Stiller <[email protected]<mailto:[email protected]>>> That's exactly what happens when people confuse "code point" with"scalar value" ;-) Hmm, whom might we blame? :-)
Actually you never count scalar values. You are confusing tham withcode units. Twitter was orignally counting UTF-16 code units, but nowcounts code points.
Scalar values are unrelated, they are properites assigned to codepoints so that all code points have a scalar value but the reverse istrue only with the valid range 0 to 0x1FFFFF. Scalar values are onlyused if you need to perform arithmetic to compute code points fromothers. This genreally does not work well within the UCS except in afew very small ranges (like decimal digits). The scalar value is alsoneeded to convert from one standard UTF to another.

Re: Origin of Ellipsis

Reply via email to