This might have been a misconfiguration. I'll report back if I see it again.
On Mon, Jul 8, 2013 at 1:55 PM, Benjamin Mahler <benjamin.mah...@gmail.com>wrote: > Are these the un-edited logs? I'm expecting to see some logs from the > process_isolator or cgroups_isolator in there. > > > On Fri, Jul 5, 2013 at 2:38 PM, Brenden Matthews < > brenden.matth...@airbedandbreakfast.com> wrote: > > > Hey guys, > > > > I'm currently having a problem where tasks will get stuck in the staging > > state, though according to the logs they should have been terminated. > They > > hang indefinitely, or until I restart the slave. Below is a screenshot + > > logs. Also interesting is the 'Failed to collect resource usage ...' > > messages. > > > > [image: Inline image 2] > > > > I0705 16:19:51.551512 9706 slave.cpp:739] Got assigned task > >> ct:1373041190990:0:add_latest_reservation_survey_events_partitio > >> n for framework chronos > >> I0705 16:19:51.552150 9706 slave.cpp:837] Launching task > >> ct:1373041190990:0:add_latest_reservation_survey_events_partition f > >> or framework chronos > >> I0705 16:19:51.553956 9706 paths.hpp:303] Created executor directory > >> '/tmp/mesos/slaves/201307030043-2037266954-5050-15277-1 > >> > >> > 517/frameworks/chronos/executors/ct:1373041190990:0:add_latest_reservation_survey_events_partition/runs/611ba128-557f-4b5e-8c > >> f2-4d1ce60d618f' > >> I0705 16:19:51.554576 9706 slave.cpp:948] Queuing task > >> 'ct:1373041190990:0:add_latest_reservation_survey_events_partition' f > >> or executor > >> ct:1373041190990:0:add_latest_reservation_survey_events_partition of > >> framework 'c > >> hronos > >> I0705 16:19:51.555027 9706 slave.cpp:511] Successfully attached file > >> > '/tmp/mesos/slaves/201307030043-2037266954-5050-15277-1517/frameworks/chronos/executors/ct:1373041190990:0:add_latest_reservation_survey_events_partition/runs/611ba128-557f-4b5e-8cf2-4d1ce60d618f' > >> I0705 16:19:54.048754 9724 slave.cpp:2530] Current usage 42.18%. Max > >> allowed age: 22.955009563956388hrs > >> W0705 16:19:54.108963 9724 monitor.cpp:186] Failed to collect resource > >> usage for executor 'executor_Task_Tracker_8023' of framework > >> '201307030043-2037266954-5050-15277-0006': Future discarded > >> W0705 16:19:59.110787 9729 monitor.cpp:186] Failed to collect resource > >> usage for executor 'executor_Task_Tracker_8023' of framework > >> '201307030043-2037266954-5050-15277-0006': Future discarded > >> W0705 16:20:04.112406 9704 monitor.cpp:186] Failed to collect resource > >> usage for executor 'executor_Task_Tracker_8023' of framework > >> '201307030043-2037266954-5050-15277-0006': Future discarded > >> W0705 16:20:09.114367 9705 monitor.cpp:186] Failed to collect resource > >> usage for executor 'executor_Task_Tracker_8023' of framework > >> '201307030043-2037266954-5050-15277-0006': Future discarded > >> W0705 16:20:14.116312 9706 monitor.cpp:186] Failed to collect resource > >> usage for executor 'executor_Task_Tracker_8023' of framework > >> '201307030043-2037266954-5050-15277-0006': Future discarded > >> W0705 16:20:19.118370 9699 monitor.cpp:186] Failed to collect resource > >> usage for executor 'executor_Task_Tracker_8023' of framework > >> '201307030043-2037266954-5050-15277-0006': Future discarded > >> W0705 16:20:24.120311 9701 monitor.cpp:186] Failed to collect resource > >> usage for executor 'executor_Task_Tracker_8023' of framework > >> '201307030043-2037266954-5050-15277-0006': Future discarded > >> W0705 16:20:29.122355 9700 monitor.cpp:186] Failed to collect resource > >> usage for executor 'executor_Task_Tracker_8023' of framework > >> '201307030043-2037266954-5050-15277-0006': Future discarded > >> W0705 16:20:34.123443 9722 monitor.cpp:186] Failed to collect resource > >> usage for executor 'executor_Task_Tracker_8023' of framework > >> '201307030043-2037266954-5050-15277-0006': Future discarded > >> W0705 16:20:39.125660 9718 monitor.cpp:186] Failed to collect resource > >> usage for executor 'executor_Task_Tracker_8023' of framework > >> '201307030043-2037266954-5050-15277-0006': Future discarded > >> W0705 16:20:44.127464 9724 monitor.cpp:186] Failed to collect resource > >> usage for executor 'executor_Task_Tracker_8023' of framework > >> '201307030043-2037266954-5050-15277-0006': Future discarded > >> W0705 16:20:49.129385 9725 monitor.cpp:186] Failed to collect resource > >> usage for executor 'executor_Task_Tracker_8023' of framework > >> '201307030043-2037266954-5050-15277-0006': Future discarded > >> I0705 16:20:51.555174 9703 slave.cpp:2482] Terminating executor > >> ct:1373041190990:0:add_latest_reservation_survey_events_partition of > >> framework chronos because it did not register within 1mins > >> I0705 16:20:54.050434 9717 slave.cpp:2530] Current usage 42.18%. Max > >> allowed age: 22.955009342481944hrs > >> W0705 16:20:54.130730 9699 monitor.cpp:186] Failed to collect resource > >> usage for executor 'executor_Task_Tracker_8023' of framework > >> '201307030043-2037266954-5050-15277-0006': Future discarded > >> W0705 16:20:59.132472 9702 monitor.cpp:186] Failed to collect resource > >> usage for executor 'executor_Task_Tracker_8023' of framework > >> '201307030043-2037266954-5050-15277-0006': Future discarded > >> W0705 16:21:04.134557 9713 monitor.cpp:186] Failed to collect resource > >> usage for executor 'executor_Task_Tracker_8023' of framework > >> '201307030043-2037266954-5050-15277-0006': Future discarded > >> W0705 16:21:09.135619 9701 monitor.cpp:186] Failed to collect resource > >> usage for executor 'executor_Task_Tracker_8023' of framework > >> '201307030043-2037266954-5050-15277-0006': Future discarded > > > > > > >