RE: My Querry

Addison Phillips [wM] Tue, 23 Nov 2004 10:01:24 -0800

If you are writing a C program, then the null character can be used to indicate 
the end of a string.


One of the nice things about UTF-8 is that the ASCII bytes from 0 to 7F hex 
(including the C0 control characters from \x00 through \x01f---including NULL) 
represent the ASCII characters from 0 to 7F hex. That is, amoung other things 
UTF-8 was designed specifically to be compatible with C language strings.

Addison

Addison P. Phillips
Director, Globalization Architecture
http://www.webMethods.com

Chair, W3C Internationalization Working Group
http://www.w3.org/International

Internationalization is an architecture. 
It is not a feature.

> -----Original Message-----
> From: [EMAIL PROTECTED] 
> [mailto:[EMAIL PROTECTED] Behalf Of Harshal Trivedi
> Sent: 2004å11æ23æ 3:42
> To: [EMAIL PROTECTED]
> Subject: My Querry
> 
> 
> How can i make sure that UTF-8 format string has terminated while
> encoding it, as compared to C program string which ends with '\0'
> (NULL) character?
> 
> -> Is there any special symbol or procedure to determine end of UTF-8
> string OR just ASCII NULL '\0' is used as it is to indicate that.
> 
> -- 
> Harshal P. Trivedi
> Software Engineer

RE: My Querry

Reply via email to