[
https://issues.apache.org/jira/browse/FLINK-12699?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Yu Li reassigned FLINK-12699:
-----------------------------
Assignee: PengFei Li (was: Yu Li)
> Reduce CPU consumption when snapshot/restore the spilled key-group
> ------------------------------------------------------------------
>
> Key: FLINK-12699
> URL: https://issues.apache.org/jira/browse/FLINK-12699
> Project: Flink
> Issue Type: Sub-task
> Components: Runtime / State Backends
> Reporter: Yu Li
> Assignee: PengFei Li
> Priority: Major
>
> We need to prevent the unnecessary de/serialization when
> snapshotting/restoring the spilled state key-group. To achieve this, we need
> to:
> 1. Add meta information for {{HeapKeyedStatebackend}} checkpoint on DFS,
> separating the on-heap and on-disk part
> 2. Write the off-heap bytes directly to DFS when checkpointing and mark it as
> on-disk
> 3. Directly write the bytes onto disk when restoring the data back from DFS,
> if it's marked as on-disk
> Notice that we cannot directly use file copy since we use mmap meanwhile
> support copy-on-write.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)