On Mon, Oct 06, 2003 at 01:52:26PM +0200,
 Marco Cimarosti <[EMAIL PROTECTED]> wrote 
 a message of 51 lines which said:

> a word like "�lite" is always counted as five characters, regardless
> that it might be encoded as six Unicode "characters".

I assume that everybody on this list knows that you count characters
only after a proper normalization... (like many operations on Unicode
texts).

> 3) That is a very silly count anyway. If you want to have an idea of the
> "size" of a document, lines or words are much more useful units.

Tell that to the editor (editors of paper publications still talk with
this unit "3 000 characters, no more, for tommorrow morning").
 
> OK. But the length in "characters" of a string is not "character semantics":
> it's plain nonsense, IMHO.

I disagree.

Reply via email to