Coder Evolution

Maximilian Michels Wed, 08 May 2019 11:45:40 -0700

Hi,

I'm looking into updating the Flink Runner to Flink version 1.8. Sinceversion 1.7 Flink has a new optional interface for Coder evolution*.

When a Flink pipeline is checkpointed, CoderSnapshots are written outalongside with the checkpointed data. When the pipeline is restored fromthat checkpoint, the CoderSnapshots are restored and used toreinstantiate the Coders.

Furthermore, there is a compatibility and migration check between theold and the new Coder. This allows to determine whether


 - The serializer did not change or is compatible (ok)
 - The serialization format of the coder changed (ok after migration)
 - The coder needs to be reconfigured and we know how to that based on
   the old version (ok after reconfiguration)
 - The coder is incompatible (error)

I was wondering about the Coder evolution story in Beam. The currentstate is that checkpointed Beam pipelines are only guaranteed to runwith the same Beam version and pipeline version. A newer version ofeither might break the checkpoint format without any way to migrate thestate.


Should we start thinking about supporting Coder evolution in Beam?

Thanks,
Max

* Coders are called TypeSerializers in Flink land. The interface isTypeSerializerSnapshot.

Coder Evolution

Reply via email to