> On Oct. 15, 2014, 9:49 p.m., Tobias Weingartner wrote: > > docs/reconciliation.md, lines 60-61 > > <https://reviews.apache.org/r/26669/diff/1/?file=719858#file719858line60> > > > > Does this result in tons of TASK_LOST right after a fail-over?
There will only be TASK_LOST updates sent if the tasks are no longer known. If this occurs, it's because the framework thought the task was non-terminal. The likely sources of this after a failover are tasks that were dropped during the failover (which should be a fairly small amount). > On Oct. 15, 2014, 9:49 p.m., Tobias Weingartner wrote: > > docs/reconciliation.md, line 85 > > <https://reviews.apache.org/r/26669/diff/1/?file=719858#file719858line85> > > > > This may never finish... what is a framework to do if this does not > > finish? It is guaranteed to eventually complete, if it does not complete it is a bug in Mesos. In the case of a serious regression that causes this to never complete, backoff is advised to ensure that this does not overload the system. > On Oct. 15, 2014, 9:49 p.m., Tobias Weingartner wrote: > > docs/reconciliation.md, lines 96-99 > > <https://reviews.apache.org/r/26669/diff/1/?file=719858#file719858line96> > > > > one recon per master/cluster, or per framework? This document is aimed at framework developers, so per framework. Per master/cluster is impossible for a framework to achieve without some form of distributed consensus across frameworks. > On Oct. 15, 2014, 9:49 p.m., Tobias Weingartner wrote: > > docs/reconciliation.md, lines 107-108 > > <https://reviews.apache.org/r/26669/diff/1/?file=719858#file719858line107> > > > > Why do we need both of these? > > > > Does one not imply the other? Almost, the second point captures the fact that a ZK blip does not trigger a disconnection, but it triggers a re-registration. I could elaborate here, but this information is more relevant to a mesos developer or operational engineer, whereas this doc is aimed at framework developers. - Ben ----------------------------------------------------------- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/26669/#review56818 ----------------------------------------------------------- On Oct. 14, 2014, 12:34 a.m., Ben Mahler wrote: > > ----------------------------------------------------------- > This is an automatically generated e-mail. To reply, visit: > https://reviews.apache.org/r/26669/ > ----------------------------------------------------------- > > (Updated Oct. 14, 2014, 12:34 a.m.) > > > Review request for mesos, Benjamin Hindman, Niklas Nielsen, and Vinod Kone. > > > Bugs: MESOS-681 > https://issues.apache.org/jira/browse/MESOS-681 > > > Repository: mesos-git > > > Description > ------- > > Please see here for rendered markdown, will be easier to review: > https://gist.github.com/bmahler/18409fc4f052df43f403 > > Please send your high level thoughts :) > > > Diffs > ----- > > docs/reconciliation.md PRE-CREATION > > Diff: https://reviews.apache.org/r/26669/diff/ > > > Testing > ------- > > > N/A > > > Thanks, > > Ben Mahler > >
