[
https://issues.apache.org/jira/browse/FLINK-12070?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16855659#comment-16855659
]
Piotr Nowojski commented on FLINK-12070:
----------------------------------------
{quote}As for the reason why the machine froze up, I guess it is because that
flushing mmaped region to disk also need memory while no enough pages left.
{quote}
That would be my guess as well, however it might be though to confirm directly.
What [~StephanEwen] proposed as fixups
{quote}1. Directly write to a file and mmap the file, rather than writing to
mmapped region. That way the data should be eagerly persisted, i.e., there is
no I/O needed when memory paged are evicted.
2. Directly write to file and directly read from file.
{quote}
Should solve the issue, since if kernel runs out of memory in that case, it
should be able to immediately drop mmaped pages.
> Make blocking result partitions consumable multiple times
> ---------------------------------------------------------
>
> Key: FLINK-12070
> URL: https://issues.apache.org/jira/browse/FLINK-12070
> Project: Flink
> Issue Type: Improvement
> Components: Runtime / Network
> Affects Versions: 1.9.0
> Reporter: Till Rohrmann
> Assignee: Stephan Ewen
> Priority: Blocker
> Labels: pull-request-available
> Fix For: 1.9.0
>
> Attachments: image-2019-04-18-17-38-24-949.png
>
> Time Spent: 20m
> Remaining Estimate: 0h
>
> In order to avoid writing produced results multiple times for multiple
> consumers and in order to speed up batch recoveries, we should make the
> blocking result partitions to be consumable multiple times. At the moment a
> blocking result partition will be released once the consumers has processed
> all data. Instead the result partition should be released once the next
> blocking result has been produced and all consumers of a blocking result
> partition have terminated. Moreover, blocking results should not hold on slot
> resources like network buffers or memory as it is currently the case with
> {{SpillableSubpartitions}}.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)