Tzu-Li (Gordon) Tai created FLINK-9377:
------------------------------------------

             Summary: Remove writing serializers as part of the checkpoint meta 
information
                 Key: FLINK-9377
                 URL: https://issues.apache.org/jira/browse/FLINK-9377
             Project: Flink
          Issue Type: Sub-task
          Components: State Backends, Checkpointing
            Reporter: Tzu-Li (Gordon) Tai
            Assignee: Tzu-Li (Gordon) Tai
             Fix For: 1.6.0


When writing meta information of a state in savepoints, we currently write both 
the state serializer as well as the state serializer's configuration snapshot.

Writing both is actually redundant, as most of the time they have identical 
information.
Moreover, the fact that we use Java serialization to write the serializer and 
rely on it to be re-readable on the restore run, already poses problems for 
serializers such as the {{AvroSerializer}} (see discussion in FLINK-9202).

The proposal here is to leave only the config snapshot as meta information, and 
use that as the single source of truth of information about the schema of 
serialized state.
The config snapshot should be treated as a factory (or provided to a factory) 
to re-create serializers capable of reading old, serialized state.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to