[ https://issues.apache.org/jira/browse/AVRO-1593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14715559#comment-14715559 ]
ASF GitHub Bot commented on AVRO-1593: -------------------------------------- Github user asfgit closed the pull request at: https://github.com/apache/avro/pull/32 > C++ json encoder assumes "C" locale and generates invalid UTF-8 sequence > ------------------------------------------------------------------------- > > Key: AVRO-1593 > URL: https://issues.apache.org/jira/browse/AVRO-1593 > Project: Avro > Issue Type: Bug > Components: c++ > Affects Versions: 1.7.7 > Environment: windows-1252 encoding > Reporter: Hatem Helal > Priority: Critical > Fix For: 1.7.8 > > > encoding a multibyte UTF-8 code point such as: > "\xEF\xBD\x81" > Incorrectly becomes: > "\xEF\xBD\U0081" > When encoded in the service running in the windows-1252 locale. This isnĀ¹t a > valid UTF-8 sequence so we end up with Mojibake when reading back the JSON > encoded string. -- This message was sent by Atlassian JIRA (v6.3.4#6332)