Hi Vinod, I find there are some WARNING log in the slave log. I think the reason may be related with "Failed to collect resource usage for executor".
The content is like this. W0523 00:19:35.861829 14117 process_isolator.cpp:402] Failed to get status of descendant process 188 74 of parent 20448: Failed to open '/proc/18874/stat' W0523 00:49:54.263650 14118 process_isolator.cpp:402] Failed to get status of descendant process 245 63 of parent 20990: Failed to open '/proc/24563/stat' W0523 01:00:52.704118 14108 process_isolator.cpp:402] Failed to get status of descendant process 546 2 of parent 20990: Failed to open '/proc/5462/stat' W0523 04:25:26.100183 14116 monitor.cpp:167] Failed to collect resource usage for executor 'executor _Task_Tracker_93' of framework '201305221443-252063498-5050-2128-0000': 0 W0523 04:25:27.095105 14106 monitor.cpp:167] Failed to collect resource usage for executor 'executor _Task_Tracker_102' of framework '201305221443-252063498-5050-2128-0000': 0 W0523 04:25:31.101133 14106 monitor.cpp:167] Failed to collect resource usage for executor 'executor _Task_Tracker_93' of framework '201305221443-252063498-5050-2128-0000': 0 W0523 04:25:32.096012 14106 monitor.cpp:167] Failed to collect resource usage for executor 'executor _Task_Tracker_102' of framework '201305221443-252063498-5050-2128-0000': 0 Guodong On Fri, May 24, 2013 at 1:48 AM, Vinod Kone <[email protected]> wrote: > I unfortunately cannot access the web links you pasted. It would be much > better if you can just paste the slave logs, so that I can diagnose. > > Is this related to this param "--executor_shutdown_grace_period", I can see > > the default value is 5 seconds, if the executor shutdown after 5 seconds, > > what will happen then? > > > > > The grace period is used when the slave tries to shutdown an executor. The > slave typically shuts down an executor if the framework is shutting down or > if the slave itself is shutting down. After sending a shutdown to the > executor, the slave expects the executor process to terminate. If it > doesn't terminate within "executor_shutdown_grace_period" duration, then it > will issue a unix 'kill'. > > > > > > > Thanks. > > > > Guodong > > >
