Thomas Marshall created MESOS-513:
-------------------------------------
Summary: FaultToleranceTest.SchedulerFailoverFrameworkMessage test
is flaky
Key: MESOS-513
URL: https://issues.apache.org/jira/browse/MESOS-513
Project: Mesos
Issue Type: Bug
Reporter: Thomas Marshall
https://hadrian.millennium.berkeley.edu/jenkins/job/Mesos-minimal/370/console
[ RUN ] FaultToleranceTest.SchedulerFailoverFrameworkMessage
I0617 13:43:31.122650 14700 master.cpp:228] Master started on 127.0.1.1:55889
I0617 13:43:31.122716 14700 master.cpp:243] Master ID:
201306171343-16842879-55889-14678
W0617 13:43:31.122899 14697 master.cpp:83] No whitelist given. Advertising
offers for all slaves
I0617 13:43:31.123078 14700 master.cpp:526] Elected as master!
I0617 13:43:31.129281 14700 slave.cpp:219] Slave started on 72)@127.0.1.1:55889
I0617 13:43:31.129338 14700 slave.cpp:220] Slave resources: cpus=2; mem=1024;
ports=[31000-32000]; disk=1024
I0617 13:43:31.129907 14699 master.cpp:569] Registering framework
201306171343-16842879-55889-14678-0000 at scheduler(61)@127.0.1.1:55889
I0617 13:43:31.129971 14700 slave.cpp:540] New master detected at
[email protected]:55889
I0617 13:43:31.130146 14699 hierarchical_allocator_process.hpp:327] Added
framework 201306171343-16842879-55889-14678-0000
I0617 13:43:31.130167 14700 slave.cpp:555] Postponing registration until
recovery is complete
I0617 13:43:31.130192 14698 status_update_manager.cpp:155] New master detected
at [email protected]:55889
I0617 13:43:31.130249 14700 slave.cpp:401] Finished recovery
I0617 13:43:31.130493 14698 master.cpp:891] Attempting to register slave on
ubuntu at slave(72)@127.0.1.1:55889
I0617 13:43:31.130522 14698 master.cpp:1851] Adding slave
201306171343-16842879-55889-14678-0 at ubuntu with cpus=2; mem=1024;
ports=[31000-32000]; disk=1024
I0617 13:43:31.130635 14699 slave.cpp:600] Registered with master
[email protected]:55889; given slave ID 201306171343-16842879-55889-14678-0
I0617 13:43:31.130759 14697 hierarchical_allocator_process.hpp:449] Added slave
201306171343-16842879-55889-14678-0 (ubuntu) with cpus=2; mem=1024;
ports=[31000-32000]; disk=1024 (and cpus=2; mem=1024; ports=[31000-32000];
disk=1024 available)
I0617 13:43:31.131098 14697 master.cpp:1239] Sending 1 offers to framework
201306171343-16842879-55889-14678-0000
I0617 13:43:31.131892 14699 master.cpp:1472] Processing reply for offer
201306171343-16842879-55889-14678-0 on slave
201306171343-16842879-55889-14678-0 (ubuntu) for framework
201306171343-16842879-55889-14678-0000
I0617 13:43:31.132138 14699 master.hpp:291] Adding task 1 with resources
cpus=2; mem=1024; ports=[31000-32000]; disk=1024 on slave
201306171343-16842879-55889-14678-0
I0617 13:43:31.132213 14699 master.cpp:1591] Launching task 1 of framework
201306171343-16842879-55889-14678-0000 with resources cpus=2; mem=1024;
ports=[31000-32000]; disk=1024 on slave 201306171343-16842879-55889-14678-0
(ubuntu)
I0617 13:43:31.132488 14699 slave.cpp:740] Got assigned task 1 for framework
201306171343-16842879-55889-14678-0000
I0617 13:43:31.132761 14699 slave.cpp:838] Launching task 1 for framework
201306171343-16842879-55889-14678-0000
I0617 13:43:31.134079 14699 paths.hpp:303] Created executor directory
'/tmp/FaultToleranceTest_SchedulerFailoverFrameworkMessage_DVe9Uf/slaves/201306171343-16842879-55889-14678-0/frameworks/201306171343-16842879-55889-14678-0000/executors/default/runs/127bf532-ba74-46ef-8d5f-63636383e97e'
I0617 13:43:31.134562 14699 slave.cpp:949] Queuing task '1' for executor
default of framework '201306171343-16842879-55889-14678-0000
I0617 13:43:31.134639 14699 slave.cpp:522] Successfully attached file
'/tmp/FaultToleranceTest_SchedulerFailoverFrameworkMessage_DVe9Uf/slaves/201306171343-16842879-55889-14678-0/frameworks/201306171343-16842879-55889-14678-0000/executors/default/runs/127bf532-ba74-46ef-8d5f-63636383e97e'
I0617 13:43:31.134835 14699 slave.cpp:1396] Got registration for executor
'default' of framework 201306171343-16842879-55889-14678-0000
I0617 13:43:31.135053 14699 slave.cpp:1511] Flushing queued task 1 for executor
'default' of framework 201306171343-16842879-55889-14678-0000
I0617 13:43:31.136834 14697 slave.cpp:1693] Handling status update TASK_RUNNING
(UUID: 86548a6f-f7b9-4fcb-95cf-aa32b6a7e757) for task 1 of framework
201306171343-16842879-55889-14678-0000
I0617 13:43:31.137197 14699 status_update_manager.cpp:290] Received status
update TASK_RUNNING (UUID: 86548a6f-f7b9-4fcb-95cf-aa32b6a7e757) for task 1 of
framework 201306171343-16842879-55889-14678-0000 with checkpoint=false
I0617 13:43:31.137267 14699 status_update_manager.cpp:450] Creating
StatusUpdate stream for task 1 of framework
201306171343-16842879-55889-14678-0000
I0617 13:43:31.137382 14699 status_update_manager.cpp:336] Forwarding status
update TASK_RUNNING (UUID: 86548a6f-f7b9-4fcb-95cf-aa32b6a7e757) for task 1 of
framework 201306171343-16842879-55889-14678-0000 to [email protected]:55889
I0617 13:43:31.137531 14697 master.cpp:1022] Status update from
slave(72)@127.0.1.1:55889: task 1 of framework
201306171343-16842879-55889-14678-0000 is now in state TASK_RUNNING
I0617 13:43:31.137565 14699 slave.cpp:1810] Sending acknowledgement for status
update TASK_RUNNING (UUID: 86548a6f-f7b9-4fcb-95cf-aa32b6a7e757) for task 1 of
framework 201306171343-16842879-55889-14678-0000 to executor(26)@127.0.1.1:55889
I0617 13:43:31.138015 14700 status_update_manager.cpp:360] Received status
update acknowledgement 86548a6f-f7b9-4fcb-95cf-aa32b6a7e757 for task 1 of
framework 201306171343-16842879-55889-14678-0000
I0617 13:43:31.138618 14698 master.cpp:604] Re-registering framework
201306171343-16842879-55889-14678-0000 at scheduler(62)@127.0.1.1:55889
I0617 13:43:31.138685 14698 master.cpp:623] Framework
201306171343-16842879-55889-14678-0000 failed over
I0617 13:43:31.139152 14697 slave.cpp:1863] Sending message for framework
201306171343-16842879-55889-14678-0000 to scheduler(61)@127.0.1.1:55889
W0617 13:43:31.139173 14698 master.cpp:721] scheduler(61)@127.0.1.1:55889 tried
to deactivate framework; expecting scheduler(62)@127.0.1.1:55889
I0617 13:43:31.139255 14697 slave.cpp:1278] Updating framework
201306171343-16842879-55889-14678-0000 pid to scheduler(62)@127.0.1.1:55889
W0617 13:43:36.124213 14699 master.cpp:83] No whitelist given. Advertising
offers for all slaves
../../src/tests/fault_tolerance_tests.cpp:1140: Failure
Failed to wait 5secs for frameworkMessage
W0617 13:43:41.125799 14700 master.cpp:83] No whitelist given. Advertising
offers for all slaves
W0617 13:43:46.127622 14697 master.cpp:83] No whitelist given. Advertising
offers for all slaves
W0617 13:43:51.129123 14698 master.cpp:83] No whitelist given. Advertising
offers for all slaves
W0617 13:43:56.130734 14697 master.cpp:83] No whitelist given. Advertising
offers for all slaves
W0617 13:44:01.131566 14697 master.cpp:83] No whitelist given. Advertising
offers for all slaves
W0617 13:44:06.132598 14699 master.cpp:83] No whitelist given. Advertising
offers for all slaves
W0617 13:44:11.133581 14700 master.cpp:83] No whitelist given. Advertising
offers for all slaves
.....
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira