[
https://issues.apache.org/jira/browse/BEAM-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Heejong Lee updated BEAM-7008:
------------------------------
Component/s: (was: sdk-py-core)
> adding UTF8 String coder to Java SDK ModelCoders
> ------------------------------------------------
>
> Key: BEAM-7008
> URL: https://issues.apache.org/jira/browse/BEAM-7008
> Project: Beam
> Issue Type: Bug
> Components: sdk-java-core
> Reporter: Heejong Lee
> Assignee: Heejong Lee
> Priority: Major
> Time Spent: 1.5h
> Remaining Estimate: 0h
>
> It looks like UTF-8 String Coder in Java and Python SDKs uses different
> encoding schemes. StringUtf8Coder in Java SDK puts the varint length of the
> input string before actual data bytes however StrUtf8Coder in Python SDK
> directly encodes the input string to bytes value. We should unify the
> encoding schemes of UTF8 strings across the different SDKs and make it a
> standard coder.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)