Daniel B. wrote:
Marcel Ruff wrote:
...
As UTF-8 may not contain '\0' ...

Yes it can.

Are you thinking of Java's _modified_ version of UTF-8
(http://en.wikipedia.org/wiki/UTF-8#Java)?
Oi oi oi, this complicates things again.

1. Serializing UTF-8 in Java over a socket and reading it in C/C++ as UTF-8 could make problems?
  -> Is there a Java-UTF-8-standard conversion utility?

2. Using C UTF-8: When/how can it happen that a char* contains a '\0' which is a character instead of
   the end of a char* ?

thanks for some enlightment,

Marcel

Daniel


--
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/linux-utf8/

Reply via email to