[ 
https://issues.apache.org/jira/browse/MESOS-3423?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Benjamin Mahler reassigned MESOS-3423:
--------------------------------------

    Assignee: Benjamin Mahler  (was: Cong Wang)

I'll take this.

> Perf event isolator stops performing sampling if a single timeout occurs.
> -------------------------------------------------------------------------
>
>                 Key: MESOS-3423
>                 URL: https://issues.apache.org/jira/browse/MESOS-3423
>             Project: Mesos
>          Issue Type: Bug
>          Components: isolation, slave
>    Affects Versions: 0.24.0
>            Reporter: Vinod Kone
>            Assignee: Benjamin Mahler
>              Labels: twitter
>
> Currently the perf event isolator times out a sample after a fixed extra time 
> of 2 seconds on top of the sample time elapses:
> {code}
>     Duration timeout = flags.perf_duration + Seconds(2);
> {code}
> This should be based on the reap interval maximum.
> Also, the code stops sampling altogether when a single timeout occurs. We've 
> observed time outs during normal operation, so it would be better for the 
> isolator to continue performing perf sampling in the case of timeouts. It may 
> also make sense to continue sampling in the case of errors, since these may 
> be transient.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to