If you are writing a C program, then the null character can be used to indicate the end of a string.
One of the nice things about UTF-8 is that the ASCII bytes from 0 to 7F hex (including the C0 control characters from \x00 through \x01f---including NULL) represent the ASCII characters from 0 to 7F hex. That is, amoung other things UTF-8 was designed specifically to be compatible with C language strings. Addison Addison P. Phillips Director, Globalization Architecture http://www.webMethods.com Chair, W3C Internationalization Working Group http://www.w3.org/International Internationalization is an architecture. It is not a feature. > -----Original Message----- > From: [EMAIL PROTECTED] > [mailto:[EMAIL PROTECTED] Behalf Of Harshal Trivedi > Sent: 2004å11æ23æ 3:42 > To: [EMAIL PROTECTED] > Subject: My Querry > > > How can i make sure that UTF-8 format string has terminated while > encoding it, as compared to C program string which ends with '\0' > (NULL) character? > > -> Is there any special symbol or procedure to determine end of UTF-8 > string OR just ASCII NULL '\0' is used as it is to indicate that. > > -- > Harshal P. Trivedi > Software Engineer