[
https://issues.apache.org/jira/browse/MESOS-1257?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14543399#comment-14543399
]
Tom Arnfeld commented on MESOS-1257:
------------------------------------
Hey guys. I can see the most recent review is still waiting to be
rebased/updated – is this being worked on? I assume not as that was >6 months
go :-)
We've run into some very quirky external containerizer slave recovery bugs (or
a single bug) which causes the slave to lock up completely while performing
recovery (it just never registers with the master)... not sure if this patch
would contribute to fixing that.
> ExternalContainerizerTest.Launch is flaky
> -----------------------------------------
>
> Key: MESOS-1257
> URL: https://issues.apache.org/jira/browse/MESOS-1257
> Project: Mesos
> Issue Type: Bug
> Reporter: Jie Yu
> Assignee: Till Toenshoff
> Labels: flaky
>
> {noformat}
> [----------] 1 test from ExternalContainerizerTest
> [ RUN ] ExternalContainerizerTest.Launch
> 2014-04-28
> 17:04:14,345:44634(0x7f94deffd700):ZOO_ERROR@handle_socket_error_msg@1697:
> Socket [127.0.0.1:52535] zk retcode=-4, errno=111(Connection refused): server
> refused to accept the client
> 2014-04-28
> 17:04:17,681:44634(0x7f94deffd700):ZOO_ERROR@handle_socket_error_msg@1697:
> Socket [127.0.0.1:52535] zk retcode=-4, errno=111(Connection refused): server
> refused to accept the client
> 2014-04-28
> 17:04:21,018:44634(0x7f94deffd700):ZOO_ERROR@handle_socket_error_msg@1697:
> Socket [127.0.0.1:52535] zk retcode=-4, errno=111(Connection refused): server
> refused to accept the client
> ../../../mesos/src/tests/external_containerizer_test.cpp:202: Failure
> Failed to wait 10secs for status
> {noformat}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)