[
https://issues.apache.org/jira/browse/MESOS-6576?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
James Peach updated MESOS-6576:
-------------------------------
Attachment: KillTaskGroupOnTaskFailure.success.log
KillTaskGroupOnTaskFailure.failure.log
> DefaultExecutorTest.KillTaskGroupOnTaskFailure sometimes fails in CI
> --------------------------------------------------------------------
>
> Key: MESOS-6576
> URL: https://issues.apache.org/jira/browse/MESOS-6576
> Project: Mesos
> Issue Type: Bug
> Components: tests
> Reporter: James Peach
> Attachments: KillTaskGroupOnTaskFailure.failure.log,
> KillTaskGroupOnTaskFailure.success.log
>
>
> {{DefaultExecutorTest.KillTaskGroupOnTaskFailure}} sometimes fails in the ASF
> CI.
> Interesting pieces of the failing test run:
> {noformat}
> ...
> I1110 20:38:54.775871 29740 status_update_manager.cpp:323] Received status
> update TASK_KILLED (UUID: a4746389-8155-44e0-ada4-00b8d3e997c1) for task
> df99cc50-9b0f-4692-afc9-d587c3515a67 of framework
> 2df0125f-4865-4aba-b13d-02f338815729-0000
> I1110 20:38:54.776181 29730 slave.cpp:4075] Status update manager
> successfully handled status update TASK_KILLED (UUID:
> a4746389-8155-44e0-ada4-00b8d3e997c1) for task
> df99cc50-9b0f-4692-afc9-d587c3515a67 of framework
> 2df0125f-4865-4aba-b13d-02f338815729-0000
> I1110 20:38:55.456354 29738 hierarchical.cpp:1880] Filtered offer with
> cpus(*):1.7; mem(*):928; disk(*):928; ports(*):[31000-32000] on agent
> 2df0125f-4865-4aba-b13d-02f338815729-S0 for framework
> 2df0125f-4865-4aba-b13d-02f338815729-0000
> I1110 20:38:55.456434 29738 hierarchical.cpp:1694] No allocations performed
> I1110 20:38:55.456468 29738 hierarchical.cpp:1789] No inverse offers to send
> out!
> I1110 20:38:55.456545 29738 hierarchical.cpp:1286] Performed allocation for 1
> agents in 745185ns
> I1110 20:38:55.875964 29731 containerizer.cpp:2336] Container
> a56ac08b-8f97-4ae4-a2e8-5ef5d55fbe98 has exited
> I1110 20:38:55.876022 29731 containerizer.cpp:1973] Destroying container
> a56ac08b-8f97-4ae4-a2e8-5ef5d55fbe98 in RUNNING state
> I1110 20:38:55.876387 29731 launcher.cpp:143] Asked to destroy container
> a56ac08b-8f97-4ae4-a2e8-5ef5d55fbe98
> I1110 20:38:55.881464 29728 provisioner.cpp:324] Ignoring destroy request for
> unknown container a56ac08b-8f97-4ae4-a2e8-5ef5d55fbe98
> I1110 20:38:55.882894 29730 slave.cpp:4672] Executor 'default' of framework
> 2df0125f-4865-4aba-b13d-02f338815729-0000 exited with status 0
> I1110 20:38:55.883446 29741 master.cpp:5884] Executor 'default' of framework
> 2df0125f-4865-4aba-b13d-02f338815729-0000 on agent
> 2df0125f-4865-4aba-b13d-02f338815729-S0 at slave(18)@172.17.0.2:36164
> (ade222407ffe): exited with status 0
> I1110 20:38:55.883545 29741 master.cpp:7840] Removing executor 'default' with
> resources cpus(*):0.1; mem(*):32; disk(*):32 of framework
> 2df0125f-4865-4aba-b13d-02f338815729-0000 on agent
> 2df0125f-4865-4aba-b13d-02f338815729-S0 at slave(18)@172.17.0.2:36164
> (ade222407ffe)
> I1110 20:38:55.884820 29729 hierarchical.cpp:1018] Recovered cpus(*):0.1;
> mem(*):32; disk(*):32 (total: cpus(*):2; mem(*):1024; disk(*):1024;
> ports(*):[31000-32000], allocated: cpus(*):0.2; mem(*):64; disk(*):64) on
> agent 2df0125f-4865-4aba-b13d-02f338815729-S0 from framework
> 2df0125f-4865-4aba-b13d-02f338815729-0000
> I1110 20:38:55.885892 29737 scheduler.cpp:675] Enqueuing event FAILURE
> received from <a
> href='http://172.17.0.2:36164/master/api/v1/scheduler'>http://172.17.0.2:36164/master/api/v1/scheduler</a>
> GMOCK WARNING:
> Uninteresting mock function call - returning directly.
> Function call: failure(0x7ffdc4df11f0, @0x2b639800b6b0 48-byte object
> <90-82 AC-51 63-2B 00-00 00-00 00-00 00-00 00-00 07-00 00-00 00-00 00-00
> 70-0A 01-98 63-2B 00-00 20-C7 00-98 63-2B 00-00 00-00 00-00 63-2B 00-00>)
> ...
> I1110 20:39:04.566794 29732 master.cpp:7715] Updating the state of task
> e72d5139-0a11-48af-9d43-d4163c1404ee of framework
> 2df0125f-4865-4aba-b13d-02f338815729-0000 (latest state: TASK_FAILED, status
> update state: TASK_RUNNING)
> ...
> I1110 20:39:04.569413 29736 scheduler.cpp:675] Enqueuing event UPDATE
> received from <a
> href='http://172.17.0.2:36164/master/api/v1/scheduler'>http://172.17.0.2:36164/master/api/v1/scheduler</a>
> ../../src/tests/default_executor_tests.cpp:583: Failure
> Value of: taskStates
> Actual: { (df99cc50-9b0f-4692-afc9-d587c3515a67, TASK_KILLED),
> (e72d5139-0a11-48af-9d43-d4163c1404ee, TASK_FAILED) }
> Expected: expectedTaskStates
> Which is: { (df99cc50-9b0f-4692-afc9-d587c3515a67, TASK_RUNNING),
> (e72d5139-0a11-48af-9d43-d4163c1404ee, TASK_RUNNING) }
> ...
> {noformat}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)