Benjamin Mahler created MESOS-755:
-------------------------------------

             Summary: Using the soft limit with memory thresholds has different 
OOM semantics than simply setting the hard limit.
                 Key: MESOS-755
                 URL: https://issues.apache.org/jira/browse/MESOS-755
             Project: Mesos
          Issue Type: Bug
    Affects Versions: 0.15.0, 0.14.1
            Reporter: Benjamin Mahler


In order to fix MESOS-662, we enabled the OOM killer.

In addition, instead of setting the memory hard limit (memory.limit_in_bytes), 
we set the soft limit (memory.soft_limit_in_bytes) to the requested amount of 
memory and set the hard limit higher by a fixed amount. Once the soft limit is 
reached, this triggers a memory threshold notification at which point we 
capture the memory.stat information and treat the executor as having OOMed.

We've seen reports from users that this is not behaving the same as simply 
setting the hard limit. In particular, we've seen the file cache not being 
purged by the kernel upon hitting the soft limit (the kernel documentation only 
states that action is taken with the soft limit in the presence of system wide 
memory pressure). However, it was not clear over email and in the review  as to 
the extent of which the hard limit and soft limit are treated differently in 
terms of purging cached memory: https://reviews.apache.org/r/14043/



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to