> I was of the opinion that UTF8 encoding encodes each character using 8
> bits or a byte.
My understanding of UTF8 was clearly wrong. Just did some reading
again, it encodes characters in bytes, and can use upto 4 bytes to
represent a character.

> if on the stream writer, I add something like:
> writer.write(new String(msg.getBytes(), "UTF8").getBytes()) instead of
> simply writer.write(msg.getBytes()), I see the characters as expected
> on the C++ client. However this I believe messes up with the protobuf
> headers, so on C++ I receive only a partial file upto the entry that
> contains one such character.

Still not sure on the above though.

-- 
You received this message because you are subscribed to the Google Groups 
"Protocol Buffers" group.
To post to this group, send email to protobuf@googlegroups.com.
To unsubscribe from this group, send email to 
protobuf+unsubscr...@googlegroups.com.
For more options, visit this group at 
http://groups.google.com/group/protobuf?hl=en.

Reply via email to