[
https://issues.apache.org/jira/browse/FLINK-12699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17323261#comment-17323261
]
Flink Jira Bot commented on FLINK-12699:
----------------------------------------
This issue is assigned but has not received an update in 7 days so it has been
labeled "stale-assigned". If you are still working on the issue, please give an
update and remove the label. If you are no longer working on the issue, please
unassign so someone else may work on it. In 7 days the issue will be
automatically unassigned.
> Reduce CPU consumption when snapshot/restore the spilled key-group
> ------------------------------------------------------------------
>
> Key: FLINK-12699
> URL: https://issues.apache.org/jira/browse/FLINK-12699
> Project: Flink
> Issue Type: Sub-task
> Components: Runtime / State Backends
> Reporter: Yu Li
> Assignee: PengFei Li
> Priority: Major
> Labels: stale-assigned
>
> We need to prevent the unnecessary de/serialization when
> snapshotting/restoring the spilled state key-group. To achieve this, we need
> to:
> 1. Add meta information for {{HeapKeyedStatebackend}} checkpoint on DFS,
> separating the on-heap and on-disk part
> 2. Write the off-heap bytes directly to DFS when checkpointing and mark it as
> on-disk
> 3. Directly write the bytes onto disk when restoring the data back from DFS,
> if it's marked as on-disk
> Notice that we cannot directly use file copy since we use mmap meanwhile
> support copy-on-write.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)