RE: Non-ascii string processing?

Jill Ramonsky Mon, 06 Oct 2003 12:21:02 -0700

Nor I. "Characters" are perhaps the most useless objects ever invented.

Now - a count of DEFAULT GRAPHEME CLUSTERs might be useful (for example, for display on a console which uses fixed-width fonts). Indeed, a whole class of DEFAULT GRAPHEME CLUSTER handling functions might come in very handy indeed. Bytes are useful. Default grapheme clusters are useful. But a "character"? What's the point?

But then, a default grapheme cluster might theoretically require up to 16 Unicode characters. (Maybe more, I don't know). Even bit-packed to 21 bits per character, that still gives us 336 bits. So I conclude that our string processing functions could go a lot faster if only we'd all use UTF-336. Er....?

Jill


> -----Original Message-----
> From: Marco Cimarosti [mailto:[EMAIL PROTECTED]
> Sent: Monday, October 06, 2003 11:10 AM
> To: 'Doug Ewell'; Unicode Mailing List
> Cc: Theodore H. Smith
> Subject: RE: Non-ascii string processing?
>
>
> What strlen() cannot do is count�ng the number of
> *characters* in a string.
> But who cares? I can imagine very few situations where someone such an
> information would be useful.
>
> _ Marco
>

RE: Non-ascii string processing?

Reply via email to