> I was of the opinion that UTF8 encoding encodes each character using 8
> bits or a byte.
My understanding of UTF8 was clearly wrong. Just did some reading
again, it encodes characters in bytes, and can use upto 4 bytes to
represent a character.

> if on the stream writer, I add something like:
> writer.write(new String(msg.getBytes(), "UTF8").getBytes()) instead of
> simply writer.write(msg.getBytes()), I see the characters as expected
> on the C++ client. However this I believe messes up with the protobuf
> headers, so on C++ I receive only a partial file upto the entry that
> contains one such character.

Still not sure on the above though.

You received this message because you are subscribed to the Google Groups 
"Protocol Buffers" group.
To post to this group, send email to protobuf@googlegroups.com.
To unsubscribe from this group, send email to 
For more options, visit this group at 

Reply via email to