Benjamin Bannier created MESOS-6800:
---------------------------------------
Summary: IOSwitchBoardTest.KillSwitchboardContainerDestroyed is
flaky
Key: MESOS-6800
URL: https://issues.apache.org/jira/browse/MESOS-6800
Project: Mesos
Issue Type: Bug
Environment: Linux
Reporter: Benjamin Bannier
Saw the test {{IOSwitchBoardTest.KillSwitchboardContainerDestroyed}} fail in
internal CI on a number of Linux platforms,
{noformat}
[02:53:36] : [Step 11/11] [ RUN ]
IOSwitchboardTest.KillSwitchboardContainerDestroyed
[02:53:36] : [Step 11/11] I1215 02:53:36.159004 23129
containerizer.cpp:220] Using isolation: posix/cpu,filesystem/posix,network/cni
[02:53:36] : [Step 11/11] I1215 02:53:36.159701 23146
containerizer.cpp:594] Recovering containerizer
[02:53:36] : [Step 11/11] I1215 02:53:36.160013 23144 provisioner.cpp:253]
Provisioner recovery complete
[02:53:36] : [Step 11/11] I1215 02:53:36.160274 23146
containerizer.cpp:986] Starting container ee8415af-5253-4ba2-9a98-2072af434f0f
for executor 'executor' of framework
[02:53:36] : [Step 11/11] I1215 02:53:36.160823 23150 switchboard.cpp:430]
Allocated pseudo terminal '/dev/pts/0' for container
ee8415af-5253-4ba2-9a98-2072af434f0f
[02:53:36] : [Step 11/11] I1215 02:53:36.160953 23150 switchboard.cpp:567]
Launching 'mesos-io-switchboard' with flags '--heartbeat_interval="30secs"
--help="false"
--socket_address="/tmp/mesos-io-switchboard-a2bf4732-420c-4d91-b5b3-50f65c4db73c"
--stderr_from_fd="25" --stderr_to_fd="2" --stdin_to_fd="25"
--stdout_from_fd="25" --stdout_to_fd="1" --tty="true"
--wait_for_connection="false"' for container
ee8415af-5253-4ba2-9a98-2072af434f0f
[02:53:36] : [Step 11/11] I1215 02:53:36.163383 23150 switchboard.cpp:597]
Created I/O switchboard server (pid: 10711) listening on socket file
'/tmp/mesos-io-switchboard-a2bf4732-420c-4d91-b5b3-50f65c4db73c' for container
ee8415af-5253-4ba2-9a98-2072af434f0f
[02:53:36] : [Step 11/11] I1215 02:53:36.164247 23144
containerizer.cpp:1535] Launching 'mesos-containerizer' with flags
'--help="false" --launch_info="{"command":{"shell":true,"value":"sleep
1000"},"environment":{"variables":[{"name":"MESOS_SANDBOX","value":"\/mnt\/teamcity\/temp\/buildTmp\/IOSwitchboardTest_KillSwitchboardContainerDestroyed_b4902D"}]},"err":{"fd":26,"type":"FD"},"in":{"fd":26,"type":"FD"},"out":{"fd":26,"type":"FD"},"tty_slave_path":"\/dev\/pts\/0","working_directory":"\/mnt\/teamcity\/temp\/buildTmp\/IOSwitchboardTest_KillSwitchboardContainerDestroyed_b4902D"}"
--pipe_read="25" --pipe_write="27"
--runtime_directory="/mnt/teamcity/temp/buildTmp/IOSwitchboardTest_KillSwitchboardContainerDestroyed_OJVRdU/containers/ee8415af-5253-4ba2-9a98-2072af434f0f"
--unshare_namespace_mnt="false"'
[02:53:36] : [Step 11/11] I1215 02:53:36.165638 23144 launcher.cpp:133]
Forked child with pid '10712' for container
'ee8415af-5253-4ba2-9a98-2072af434f0f'
[02:53:36] : [Step 11/11] I1215 02:53:36.165937 23144
containerizer.cpp:1634] Checkpointing container's forked pid 10712 to
'/mnt/teamcity/temp/buildTmp/IOSwitchboardTest_KillSwitchboardContainerDestroyed_Yo7Yoa/meta/slaves/frameworks/executors/executor/runs/ee8415af-5253-4ba2-9a98-2072af434f0f/pids/forked.pid'
[02:53:36] : [Step 11/11] I1215 02:53:36.167196 23148 fetcher.cpp:349]
Starting to fetch URIs for container: ee8415af-5253-4ba2-9a98-2072af434f0f,
directory:
/mnt/teamcity/temp/buildTmp/IOSwitchboardTest_KillSwitchboardContainerDestroyed_b4902D
[02:53:36] : [Step 11/11] E1215 02:53:36.243254 23148 switchboard.cpp:880]
Unexpected termination of I/O switchboard server: 'IOSwitchboard' exited with
signal: Killed for container ee8415af-5253-4ba2-9a98-2072af434f0f
[02:53:36] : [Step 11/11] I1215 02:53:36.243259 23150
containerizer.cpp:2493] Container ee8415af-5253-4ba2-9a98-2072af434f0f has
reached its limit for resource {} and will be terminated
[02:53:36] : [Step 11/11] I1215 02:53:36.243288 23150
containerizer.cpp:2113] Destroying container
ee8415af-5253-4ba2-9a98-2072af434f0f in RUNNING state
[02:53:36] : [Step 11/11] I1215 02:53:36.243319 23150
containerizer.cpp:2476] Container ee8415af-5253-4ba2-9a98-2072af434f0f has
exited
[02:53:36] : [Step 11/11] I1215 02:53:36.243332 23150 launcher.cpp:149]
Asked to destroy container ee8415af-5253-4ba2-9a98-2072af434f0f
[02:53:36] : [Step 11/11] E1215 02:53:36.247699 23145 switchboard.cpp:801]
Failed to remove unix domain socket file
'/tmp/mesos-io-switchboard-a2bf4732-420c-4d91-b5b3-50f65c4db73c' for container
'ee8415af-5253-4ba2-9a98-2072af434f0f': No such file or directory
[02:53:36] : [Step 11/11] I1215 02:53:36.248097 23150 provisioner.cpp:324]
Ignoring destroy request for unknown container
ee8415af-5253-4ba2-9a98-2072af434f0f
[02:53:36] : [Step 11/11]
../../src/tests/containerizer/io_switchboard_tests.cpp:885: Failure
[02:53:36] : [Step 11/11] Expecting WIFSIGNALED(wait.get()->status()) but
WIFEXITED(wait.get()->status()) is true and WEXITSTATUS(wait.get()->status())
is 1
[02:53:36] : [Step 11/11] [ FAILED ]
IOSwitchboardTest.KillSwitchboardContainerDestroyed (100 ms)
{noformat}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)