[ 
https://issues.apache.org/jira/browse/FLINK-33197?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17773536#comment-17773536
 ] 

Liu Chong commented on FLINK-33197:
-----------------------------------

we've got the code ready for adding ByteArraySchema to pyflink, would you like 
to give some comments if we should move forward and submit the PR? [~dianfu] 

> PyFlink support for ByteArraySchema
> -----------------------------------
>
>                 Key: FLINK-33197
>                 URL: https://issues.apache.org/jira/browse/FLINK-33197
>             Project: Flink
>          Issue Type: New Feature
>          Components: API / Python
>    Affects Versions: 1.17.0
>            Reporter: Liu Chong
>            Priority: Minor
>
> Currently in Python Flink API, when reading messages from a Kafka source, 
> only SimpleStringSchema is available.
> If the data is in arbitary binary format(e.g. marshalled Protocol Buffer msg) 
> it may not be decodable with the default 'utf-8' encoding. 
> There's currently a workaround which is to manually set the encoding to 
> 'ISO-8859-1' which supports all possible byte combinations. 
> However this is not an elegant solution.
> We should support ByteArraySchema which outputs a raw byte array for 
> subsequent unmarshalling.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to