Haizhou Zhao created FLINK-27255:
------------------------------------
Summary: Flink-avro does not support serialization and
deserialization of avro schema longer than 65535 characters
Key: FLINK-27255
URL: https://issues.apache.org/jira/browse/FLINK-27255
Project: Flink
Issue Type: Bug
Components: Formats (JSON, Avro, Parquet, ORC, SequenceFile)
Affects Versions: 1.14.4
Reporter: Haizhou Zhao
The underlying serialization of avro schema uses string serialization method of
ObjectOutputStream.class, however, the default string serialization by
ObjectOutputStream.class does not support handling string of more than 66535
characters (64kb). As a result, constructing flink operators that input/output
Avro Generic Record with huge schema is not possible.
The purposed fix is two change the serialization and deserialization method of
these following classes so that huge string could also be handled.
[GenericRecordAvroTypeInfo|https://github.com/apache/flink/blob/master/flink-formats/flink-avro/src/main/java/org/apache/flink/formats/avro/typeutils/GenericRecordAvroTypeInfo.java#L107]
[SerializableAvroSchema|https://github.com/apache/flink/blob/master/flink-formats/flink-avro/src/main/java/org/apache/flink/formats/avro/typeutils/SerializableAvroSchema.java#L55]
--
This message was sent by Atlassian Jira
(v8.20.1#820001)