[
https://issues.apache.org/jira/browse/MESOS-6784?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15746259#comment-15746259
]
Jie Yu commented on MESOS-6784:
-------------------------------
Another data point with different log:
{noformat}
[20:16:40] : [Step 11/11] [ RUN ]
IOSwitchboardTest.KillSwitchboardContainerDestroyed
[20:16:40] : [Step 11/11] I1213 20:16:40.465116 26604
containerizer.cpp:220] Using isolation: posix/cpu,filesystem/posix,network/cni
[20:16:40] : [Step 11/11] I1213 20:16:40.465904 26624
containerizer.cpp:594] Recovering containerizer
[20:16:40] : [Step 11/11] I1213 20:16:40.466194 26623 provisioner.cpp:253]
Provisioner recovery complete
[20:16:40] : [Step 11/11] I1213 20:16:40.466544 26625
containerizer.cpp:986] Starting container 8fc415c0-74cc-4c13-8bcf-1f3dfc7a993b
for executor 'executor' of framework
[20:16:40] : [Step 11/11] I1213 20:16:40.467177 26622 switchboard.cpp:430]
Allocated pseudo terminal '/dev/pts/0' for container
8fc415c0-74cc-4c13-8bcf-1f3dfc7a993b
[20:16:40] : [Step 11/11] I1213 20:16:40.467337 26622 switchboard.cpp:567]
Launching 'mesos-io-switchboard' with flags '--heartbeat_interval="30secs"
--help="false"
--socket_address="/tmp/mesos-io-switchboard-2b923443-e3d2-4513-9442-31536611ea28"
--stderr_from_fd="13" --stderr_to_fd="2" --stdin_to_fd="13"
--stdout_from_fd="13" --stdout_to_fd="1" --tty="true"
--wait_for_connection="false"' for container
8fc415c0-74cc-4c13-8bcf-1f3dfc7a993b
[20:16:40] : [Step 11/11] I1213 20:16:40.469470 26622 switchboard.cpp:597]
Created I/O switchboard server (pid: 11965) listening on socket file
'/tmp/mesos-io-switchboard-2b923443-e3d2-4513-9442-31536611ea28' for container
8fc415c0-74cc-4c13-8bcf-1f3dfc7a993b
[20:16:40] : [Step 11/11] I1213 20:16:40.470201 26619
containerizer.cpp:1535] Launching 'mesos-containerizer' with flags
'--help="false" --launch_info="{"command":{"shell":true,"value":"sleep
1000"},"environment":{"variables":[{"name":"MESOS_SANDBOX","value":"\/mnt\/teamcity\/temp\/buildTmp\/IOSwitchboardTest_KillSwitchboardContainerDestroyed_gYP6xR"}]},"err":{"fd":14,"type":"FD"},"in":{"fd":14,"type":"FD"},"out":{"fd":14,"type":"FD"},"tty_slave_path":"\/dev\/pts\/0","working_directory":"\/mnt\/teamcity\/temp\/buildTmp\/IOSwitchboardTest_KillSwitchboardContainerDestroyed_gYP6xR"}"
--pipe_read="13" --pipe_write="15"
--runtime_directory="/mnt/teamcity/temp/buildTmp/IOSwitchboardTest_KillSwitchboardContainerDestroyed_3d8l0O/containers/8fc415c0-74cc-4c13-8bcf-1f3dfc7a993b"
--unshare_namespace_mnt="false"'
[20:16:40] : [Step 11/11] I1213 20:16:40.471458 26619 launcher.cpp:133]
Forked child with pid '11966' for container
'8fc415c0-74cc-4c13-8bcf-1f3dfc7a993b'
[20:16:40] : [Step 11/11] I1213 20:16:40.471776 26619
containerizer.cpp:1634] Checkpointing container's forked pid 11966 to
'/mnt/teamcity/temp/buildTmp/IOSwitchboardTest_KillSwitchboardContainerDestroyed_fdQTsM/meta/slaves/frameworks/executors/executor/runs/8fc415c0-74cc-4c13-8bcf-1f3dfc7a993b/pids/forked.pid'
[20:16:40] : [Step 11/11] I1213 20:16:40.472676 26621 fetcher.cpp:349]
Starting to fetch URIs for container: 8fc415c0-74cc-4c13-8bcf-1f3dfc7a993b,
directory:
/mnt/teamcity/temp/buildTmp/IOSwitchboardTest_KillSwitchboardContainerDestroyed_gYP6xR
[20:16:40] : [Step 11/11] E1213 20:16:40.547350 26625 switchboard.cpp:880]
Unexpected termination of I/O switchboard server: 'IOSwitchboard' exited with
signal: Killed for container 8fc415c0-74cc-4c13-8bcf-1f3dfc7a993b
[20:16:40] : [Step 11/11] I1213 20:16:40.547364 26620
containerizer.cpp:2493] Container 8fc415c0-74cc-4c13-8bcf-1f3dfc7a993b has
reached its limit for resource {} and will be terminated
[20:16:40] : [Step 11/11] I1213 20:16:40.547385 26620
containerizer.cpp:2113] Destroying container
8fc415c0-74cc-4c13-8bcf-1f3dfc7a993b in RUNNING state
[20:16:40] : [Step 11/11] I1213 20:16:40.547490 26620 launcher.cpp:149]
Asked to destroy container 8fc415c0-74cc-4c13-8bcf-1f3dfc7a993b
[20:16:40] : [Step 11/11] I1213 20:16:40.552752 26620
containerizer.cpp:2476] Container 8fc415c0-74cc-4c13-8bcf-1f3dfc7a993b has
exited
[20:16:40] : [Step 11/11] E1213 20:16:40.553004 26624 switchboard.cpp:801]
Failed to remove unix domain socket file
'/tmp/mesos-io-switchboard-2b923443-e3d2-4513-9442-31536611ea28' for container
'8fc415c0-74cc-4c13-8bcf-1f3dfc7a993b': No such file or directory
[20:16:40] : [Step 11/11] I1213 20:16:40.553323 26624 provisioner.cpp:324]
Ignoring destroy request for unknown container
8fc415c0-74cc-4c13-8bcf-1f3dfc7a993b
[20:16:40] : [Step 11/11]
../../src/tests/containerizer/io_switchboard_tests.cpp:668: Failure
[20:16:40] : [Step 11/11] Expecting WIFSIGNALED(wait.get()->status()) but
WIFEXITED(wait.get()->status()) is true and WEXITSTATUS(wait.get()->status())
is 1
[20:16:40] : [Step 11/11] [ FAILED ]
IOSwitchboardTest.KillSwitchboardContainerDestroyed (100 ms)
{noformat}
> IOSwitchboardTest.KillSwitchboardContainerDestroyed is flaky
> ------------------------------------------------------------
>
> Key: MESOS-6784
> URL: https://issues.apache.org/jira/browse/MESOS-6784
> Project: Mesos
> Issue Type: Bug
> Components: agent
> Reporter: Neil Conway
> Assignee: Kevin Klues
> Labels: mesosphere
>
> {noformat}
> [ RUN ] IOSwitchboardTest.KillSwitchboardContainerDestroyed
> I1212 13:57:02.641043 2211 containerizer.cpp:220] Using isolation:
> posix/cpu,filesystem/posix,network/cni
> W1212 13:57:02.641438 2211 backend.cpp:76] Failed to create 'overlay'
> backend: OverlayBackend requires root privileges, but is running as user nrc
> W1212 13:57:02.641559 2211 backend.cpp:76] Failed to create 'bind' backend:
> BindBackend requires root privileges
> I1212 13:57:02.642822 2268 containerizer.cpp:594] Recovering containerizer
> I1212 13:57:02.643975 2253 provisioner.cpp:253] Provisioner recovery complete
> I1212 13:57:02.644953 2255 containerizer.cpp:986] Starting container
> 09e87380-00ab-4987-83c9-fa1c5d86717f for executor 'executor' of framework
> I1212 13:57:02.647004 2245 switchboard.cpp:430] Allocated pseudo terminal
> '/dev/pts/54' for container 09e87380-00ab-4987-83c9-fa1c5d86717f
> I1212 13:57:02.652305 2245 switchboard.cpp:596] Created I/O switchboard
> server (pid: 2705) listening on socket file
> '/tmp/mesos-io-switchboard-b4af1c92-6633-44f3-9d35-e0e36edaf70a' for
> container 09e87380-00ab-4987-83c9-fa1c5d86717f
> I1212 13:57:02.655513 2267 launcher.cpp:133] Forked child with pid '2706'
> for container '09e87380-00ab-4987-83c9-fa1c5d86717f'
> I1212 13:57:02.655732 2267 containerizer.cpp:1621] Checkpointing container's
> forked pid 2706 to
> '/tmp/IOSwitchboardTest_KillSwitchboardContainerDestroyed_Me5CRx/meta/slaves/frameworks/executors/executor/runs/09e87380-00ab-4987-83c9-fa1c5d86717f/pids/forked.pid'
> I1212 13:57:02.726306 2265 containerizer.cpp:2463] Container
> 09e87380-00ab-4987-83c9-fa1c5d86717f has exited
> I1212 13:57:02.726352 2265 containerizer.cpp:2100] Destroying container
> 09e87380-00ab-4987-83c9-fa1c5d86717f in RUNNING state
> E1212 13:57:02.726495 2243 switchboard.cpp:861] Unexpected termination of
> I/O switchboard server: 'IOSwitchboard' exited with signal: Killed for
> container 09e87380-00ab-4987-83c9-fa1c5d86717f
> I1212 13:57:02.726563 2265 launcher.cpp:149] Asked to destroy container
> 09e87380-00ab-4987-83c9-fa1c5d86717f
> E1212 13:57:02.783607 2228 switchboard.cpp:799] Failed to remove unix domain
> socket file '/tmp/mesos-io-switchboard-b4af1c92-6633-44f3-9d35-e0e36edaf70a'
> for container '09e87380-00ab-4987-83c9-fa1c5d86717f': No such file or
> directory
> ../../mesos/src/tests/containerizer/io_switchboard_tests.cpp:661: Failure
> Value of: wait.get()->reasons().size() == 1
> Actual: false
> Expected: true
> *** Aborted at 1481579822 (unix time) try "date -d @1481579822" if you are
> using GNU date ***
> PC: @ 0x1bf16d0 testing::UnitTest::AddTestPartResult()
> *** SIGSEGV (@0x0) received by PID 2211 (TID 0x7faed7d078c0) from PID 0;
> stack trace: ***
> @ 0x7faecf855100 (unknown)
> @ 0x1bf16d0 testing::UnitTest::AddTestPartResult()
> @ 0x1be6247 testing::internal::AssertHelper::operator=()
> @ 0x19ed751
> mesos::internal::tests::IOSwitchboardTest_KillSwitchboardContainerDestroyed_Test::TestBody()
> @ 0x1c0ed8c
> testing::internal::HandleSehExceptionsInMethodIfSupported<>()
> @ 0x1c09e74
> testing::internal::HandleExceptionsInMethodIfSupported<>()
> @ 0x1beb505 testing::Test::Run()
> @ 0x1bebc88 testing::TestInfo::Run()
> @ 0x1bec2ce testing::TestCase::Run()
> @ 0x1bf2ba8 testing::internal::UnitTestImpl::RunAllTests()
> @ 0x1c0f9b1
> testing::internal::HandleSehExceptionsInMethodIfSupported<>()
> @ 0x1c0a9f2
> testing::internal::HandleExceptionsInMethodIfSupported<>()
> @ 0x1bf18ee testing::UnitTest::Run()
> @ 0x11bc9e3 RUN_ALL_TESTS()
> @ 0x11bc599 main
> @ 0x7faece663b15 __libc_start_main
> @ 0xa9c219 (unknown)
> {noformat}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)