[
https://issues.apache.org/jira/browse/AVRO-3860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17835380#comment-17835380
]
ASF subversion and git services commented on AVRO-3860:
-------------------------------------------------------
Commit 00afbaeda36db48cf48ec1c5faad1736add9e8f2 in avro's branch
refs/heads/avro-3631/fix-fixed-serialization from Thiruvalluvan M. G.
[ https://gitbox.apache.org/repos/asf?p=avro.git;h=00afbaeda ]
AVRO-3860: Fix for wrong encoding of Unicode values above 0xffff (#2831)
* Fix for wrong encoding of Unicode values above 0xffff
* More approriate error message with wrong Unicode escapes
* Fixed a subtle bug in detecting surrogate codes of UTF-16 encoding
* Fixed a bug that allowed prhibited unicode values
---------
Co-authored-by: Thiruvalluvan M G <[email protected]>
> C++ json fails to handle unicode > U+ FFFF
> ------------------------------------------
>
> Key: AVRO-3860
> URL: https://issues.apache.org/jira/browse/AVRO-3860
> Project: Apache Avro
> Issue Type: Bug
> Components: c++
> Affects Versions: 1.11.2
> Reporter: Pietro Cerutti
> Assignee: Thiruvalluvan M. G.
> Priority: Major
> Labels: pull-request-available
> Time Spent: 0.5h
> Remaining Estimate: 0h
>
> As a follow up of AVRO-1190, would it be possible to fix code points above
> U+FFFF?
> I think a reasonable test case would be to add this line
> [here:|https://github.com/apache/avro/blob/315f28d636c87eace9a6d6310de78710e1d1f85a/lang/c%2B%2B/test/JsonTests.cc#L70]
> {code:java}
> {R"("\U00010000")", EntityType::String, "\xF0\x90\x80\x80",
> R"("\U00010000")"}, {code}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)