Re: {Spam?} Re: [Help-smalltalk] [Q] Unicode String?

Paolo Bonzini Fri, 07 Jul 2006 02:17:18 -0700

Chun Sungjin wrote:

Hi,
main problem is that for example, if I did create an instance ofstring like this;
a := 'Some MultiByte Encoded String'.

then

a size

does not answer correct length of string.

Well, strlen does not in C, too. You need mbrlen, and #size is morelike strlen than mbrlen.

Also, the result heavily depends on the chosen character set. If wewant to have #utf8Size, that's fine. But #size should be the number of*bytes*, not of characters.

I'm seeing now if I can add an EncodedStream method that extractsUnicode characters. Then what you wanted would be something like


   (EncodedStream wordsOn: 'some string') contents size

for which, of course, we can add a utility method.

Paolo


_______________________________________________
help-smalltalk mailing list
[email protected]
http://lists.gnu.org/mailman/listinfo/help-smalltalk

Re: {Spam?} Re: [Help-smalltalk] [Q] Unicode String?

Reply via email to