parthasarathy created MESOS-3701: ------------------------------------ Summary: Reciving Task_lost error while launching job Key: MESOS-3701 URL: https://issues.apache.org/jira/browse/MESOS-3701 Project: Mesos Issue Type: Task Components: hadoop Affects Versions: 0.23.0 Reporter: parthasarathy
I am getting TASK_LOST error while launching job from mesos for wordcount and I am using external hdfs in my environment. Kindly suggest to resolve the same. Mesos Version: 0.23.0 Hadoop: Hadoop 2.0.0-cdh4.7.0 Below is the error, I1009 12:06:43.723626 2395 slave.cpp:4193] Received oversubscribable resources from the resource estimator I1009 12:06:56.772047 2388 slave.cpp:1244] Got assigned task Task_Tracker_0 for framework 20151009-120054-2458766602-5050-2333-0000 I1009 12:06:56.773672 2388 slave.cpp:1355] Launching task Task_Tracker_0 for framework 20151009-120054-2458766602-5050-2333-0000 I1009 12:06:56.781332 2388 slave.cpp:4733] Launching executor executor_Task_Tracker_0 of framework 20151009-120054-2458766602-5050-2333-0000 with resources cpus(*):0.15; mem(*):368 in work directory '/tmp/mesos/slaves/20151007-153859-2458766602-5050-31659-S1/frameworks/20151009-120054-2458766602-5050-2333-0000/executors/executor_Task_Tracker_0/runs/2d0d9ad0-b42c-4d13-a0ab-75524d280c21' I1009 12:06:56.782209 2395 containerizer.cpp:534] Starting container '2d0d9ad0-b42c-4d13-a0ab-75524d280c21' for executor 'executor_Task_Tracker_0' of framework '20151009-120054-2458766602-5050-2333-0000' I1009 12:06:56.782562 2388 slave.cpp:1573] Queuing task 'Task_Tracker_0' for executor executor_Task_Tracker_0 of framework '20151009-120054-2458766602-5050-2333-0000 I1009 12:06:56.790356 2395 launcher.cpp:131] Forked child with pid '3026' for container '2d0d9ad0-b42c-4d13-a0ab-75524d280c21' I1009 12:06:58.724186 2393 slave.cpp:4179] Querying resource estimator for oversubscribable resources I1009 12:06:58.724879 2393 slave.cpp:4193] Received oversubscribable resources from the resource estimator I1009 12:07:01.332754 2388 slave.cpp:2333] Got registration for executor 'executor_Task_Tracker_0' of framework 20151009-120054-2458766602-5050-2333-0000 from executor(1)@10.205.141.146:52433 I1009 12:07:01.336251 2391 slave.cpp:1729] Sending queued task 'Task_Tracker_0' to executor 'executor_Task_Tracker_0' of framework 20151009-120054-2458766602-5050-2333-0000 I1009 12:07:01.931179 2389 containerizer.cpp:1188] Executor for container '2d0d9ad0-b42c-4d13-a0ab-75524d280c21' has exited I1009 12:07:01.932209 2389 containerizer.cpp:1001] Destroying container '2d0d9ad0-b42c-4d13-a0ab-75524d280c21' I1009 12:07:01.965467 2384 slave.cpp:3356] Executor 'executor_Task_Tracker_0' of framework 20151009-120054-2458766602-5050-2333-0000 exited with status 1 I1009 12:07:01.967715 2384 slave.cpp:2671] Handling status update TASK_LOST (UUID: c7c335eb-8125-4497-850d-569d5b7f4896) for task Task_Tracker_0 of framework 20151009-120054-2458766602-5050-2333-0000 from @0.0.0.0:0 W1009 12:07:01.968459 2390 containerizer.cpp:885] Ignoring update for unknown container: 2d0d9ad0-b42c-4d13-a0ab-75524d280c21 I1009 12:07:01.969363 2387 status_update_manager.cpp:322] Received status update TASK_LOST (UUID: c7c335eb-8125-4497-850d-569d5b7f4896) for task Task_Tracker_0 of framework 20151009-120054-2458766602-5050-2333-0000 I1009 12:07:01.970538 2384 slave.cpp:2926] Forwarding the update TASK_LOST (UUID: c7c335eb-8125-4497-850d-569d5b7f4896) for task Task_Tracker_0 of framework 20151009-120054-2458766602-5050-2333-0000 to master@10.205.141.146:5050 I1009 12:07:01.979173 2394 status_update_manager.cpp:394] Received status update acknowledgement (UUID: c7c335eb-8125-4497-850d-569d5b7f4896) for task Task_Tracker_0 of framework 20151009-120054-2458766602-5050-2333-0000 I1009 12:07:01.980028 2385 slave.cpp:3460] Cleaning up executor 'executor_Task_Tracker_0' of framework 20151009-120054-2458766602-5050-2333-0000 I1009 12:07:01.980412 2384 gc.cpp:56] Scheduling '/tmp/mesos/slaves/20151007-153859-2458766602-5050-31659-S1/frameworks/20151009-120054-2458766602-5050-2333-0000/executors/executor_Task_Tracker_0/runs/2d0d9ad0-b42c-4d13-a0ab-75524d280c21' for gc 6.99998865415407days in the future Regards, Partha -- This message was sent by Atlassian JIRA (v6.3.4#6332)