[ 
https://issues.apache.org/jira/browse/FLINK-8531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stephan Ewen resolved FLINK-8531.
---------------------------------
    Resolution: Fixed

Fixed as of 4e481a72c1ed3cc5f177b511e5a72cd8726cf976

Consists of steps
  - 99495c91ecce7141ae8b2fbc96492681a9d130bd
  - 35c7d93ee85aa8689e804b713affa65b46af1acc
  - 9903c8c42793b922549835217c586c5928999ea5
  - 5cc50934bdcf80ae1fa69abe69e2f214852653f9
  - bb19e7f5278d43cd4fd265e3d2afa2fcc793ccf5
  - 1887187f6b5c210d2091c69ef14fa8b8a5cae82c
  - fc21423e1f8f1a1661badef20f9c6f368f6daf8b
  - e0b0f45bd9c8b06bd2cda56f6859d0d3944aa00e
  - 4e481a72c1ed3cc5f177b511e5a72cd8726cf976

> Support separation of "Exclusive", "Shared" and "Task owned" state
> ------------------------------------------------------------------
>
>                 Key: FLINK-8531
>                 URL: https://issues.apache.org/jira/browse/FLINK-8531
>             Project: Flink
>          Issue Type: Sub-task
>          Components: State Backends, Checkpointing
>            Reporter: Stephan Ewen
>            Assignee: Stephan Ewen
>            Priority: Major
>             Fix For: 1.5.0
>
>
> Currently, all state created at a certain checkpoint goes into the directory 
> {{chk-id}}.
> With incremental checkpointing, some state is shared across checkpoint and is 
> referenced by newer checkpoints. That way, old {{chk-id}} directories stay 
> around, containing some shared chunks. That makes it both for users and 
> cleanup hooks hard to determine when a {{chk-x}} directory could be deleted.
> The same holds for state that can only every be dropped by certain operators 
> on the TaskManager, never by the JobManager / CheckpointCoordinator. Examples 
> of that state are write ahead logs, which need to be retained until the move 
> to the target system is complete, which may in some cases be later then when 
> the checkpoint that created them is disposed.
> I propose to introduce different scopes for tasks:
>   - **EXCLUSIVE** is for state that belongs to one checkpoint only
>   - **SHARED** is for state that is possibly part of multiple checkpoints
>   - **TASKOWNED** is for state that must never by dropped by the JobManager.
> For file based checkpoint targets, I propose that we have the following 
> directory layout:
> {code}
> /user-defined-checkpoint-dir
>     |
>     + --shared/
>     + --taskowned/
>     + --chk-00001/
>     + --chk-00002/
>     + --chk-00003/
>     ...
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to