John created FLINK-12304:
----------------------------
Summary: AvroInputFormat should support schema evolution
Key: FLINK-12304
URL: https://issues.apache.org/jira/browse/FLINK-12304
Project: Flink
Issue Type: Bug
Components: Formats (JSON, Avro, Parquet, ORC, SequenceFile)
Affects Versions: 1.8.0
Reporter: John
>From the avro spec:
_A reader of Avro data, whether from an RPC or a file, can always parse that
data because its schema is provided. But that schema may not be exactly the
schema that was expected. For example, if the data was written with a different
version of the software than it is read, then records may have had fields added
or removed._
The AvroInputFormat should allow the application to supply a reader's schema to
support cases where data was written with an old version of a schema and needs
to be read with a newer version. The reader's schema can have addition fields
with defaults so that the old schema can be adapted to the new. The underlying
avro java library supports schema resolution, so adding support in
AvroInputFormat should be straight forward.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)