----------------------------------------------------------- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/56568/#review165219 -----------------------------------------------------------
Ship it! +1 for fewer executor deaths during agent recovery (the agent gives 2 seconds for the executor to respond to the reconnect message before killing the executor). - Joseph Wu On Feb. 10, 2017, 4:45 p.m., Anand Mazumdar wrote: > > ----------------------------------------------------------- > This is an automatically generated e-mail. To reply, visit: > https://reviews.apache.org/r/56568/ > ----------------------------------------------------------- > > (Updated Feb. 10, 2017, 4:45 p.m.) > > > Review request for mesos and Vinod Kone. > > > Bugs: MESOS-7057 > https://issues.apache.org/jira/browse/MESOS-7057 > > > Repository: mesos > > > Description > ------- > > A relink is needed in cases where a netfilter module like iptables > can terminate the connection without notifying the executor. This > results in the executor still trying to reuse the stale "half-open" > connection upon receiving the reconnect message from the executor > leading to the erroneous behavior. > > > Diffs > ----- > > src/exec/exec.cpp 95c2e19f06c1778bd837247fc96d46d32b53c00b > > Diff: https://reviews.apache.org/r/56568/diff/ > > > Testing > ------- > > Performed the steps to reproduce on MESOS-5332 and verified the problem is > fixed with the relink. > > > Thanks, > > Anand Mazumdar > >
