> I was of the opinion that UTF8 encoding encodes each character using 8 > bits or a byte. My understanding of UTF8 was clearly wrong. Just did some reading again, it encodes characters in bytes, and can use upto 4 bytes to represent a character.
> if on the stream writer, I add something like: > writer.write(new String(msg.getBytes(), "UTF8").getBytes()) instead of > simply writer.write(msg.getBytes()), I see the characters as expected > on the C++ client. However this I believe messes up with the protobuf > headers, so on C++ I receive only a partial file upto the entry that > contains one such character. Still not sure on the above though. -- You received this message because you are subscribed to the Google Groups "Protocol Buffers" group. To post to this group, send email to protobuf@googlegroups.com. To unsubscribe from this group, send email to protobuf+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/protobuf?hl=en.