[
https://issues.apache.org/jira/browse/TEZ-1358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14092388#comment-14092388
]
Jeff Zhang commented on TEZ-1358:
---------------------------------
Verify the 2 cases ( bad environment settings, localization failures ), which
cause container launch failed. The error messages looks clear to me, no need to
fix it. [~hitesh], please help confirm it.
The following are the error messages:
*bad environment settings*
{code}
DAG diagnostics: [Vertex failed, vertexName=tokenizer,
vertexId=vertex_1407719224092_0019_1_00, diagnostics=[Task failed,
taskId=task_1407719224092_0019_1_00_000000, diagnostics=[TaskAttempt 0 failed,
info=[Container container_1407719224092_0019_01_000002 finished with
diagnostics set to [Container failed. Invalid environment variable name: "na=me"
]], TaskAttempt 1 failed, info=[Container
container_1407719224092_0019_01_000003 finished with diagnostics set to
[Container failed. Invalid environment variable name: "na=me"
]], TaskAttempt 2 failed, info=[Container
container_1407719224092_0019_01_000004 finished with diagnostics set to
[Container failed. Invalid environment variable name: "na=me"
]], TaskAttempt 3 failed, info=[Container
container_1407719224092_0019_01_000005 finished with diagnostics set to
[Container failed. Invalid environment variable name: "na=me"
]]], Vertex failed as one or more tasks failed. failedTasks:1], Vertex killed,
vertexName=summer, vertexId=vertex_1407719224092_0019_1_01, diagnostics=[Vertex
received Kill while in RUNNING state., Vertex killed as other vertex failed.
failedTasks:0], DAG failed due to vertex failure. failedVertices:1
killedVertices:1]
{code}
*localization failures*
{code}
DAG diagnostics: [Vertex failed, vertexName=tokenizer,
vertexId=vertex_1407719224092_0018_1_00, diagnostics=[Task failed,
taskId=task_1407719224092_0018_1_00_000000, diagnostics=[TaskAttempt 0 failed,
info=[Container container_1407719224092_0018_01_000002 finished with
diagnostics set to [Container failed. Resource
hdfs://0.0.0.0:9000/user/jzhang/tez-mapreduce-0.5.0-SNAPSHOT.jar changed on src
filesystem (expected 1407725193190, was 1407725193090
]], TaskAttempt 1 failed, info=[Container
container_1407719224092_0018_01_000003 finished with diagnostics set to
[Container failed. Resource
hdfs://0.0.0.0:9000/user/jzhang/tez-mapreduce-0.5.0-SNAPSHOT.jar changed on src
filesystem (expected 1407725193190, was 1407725193090
]], TaskAttempt 2 failed, info=[Container
container_1407719224092_0018_01_000004 finished with diagnostics set to
[Container failed. Resource
hdfs://0.0.0.0:9000/user/jzhang/tez-mapreduce-0.5.0-SNAPSHOT.jar changed on src
filesystem (expected 1407725193190, was 1407725193090
]], TaskAttempt 3 failed, info=[Container
container_1407719224092_0018_01_000005 finished with diagnostics set to
[Container failed. Resource
hdfs://0.0.0.0:9000/user/jzhang/tez-mapreduce-0.5.0-SNAPSHOT.jar changed on src
filesystem (expected 1407725193190, was 1407725193090
]]], Vertex failed as one or more tasks failed. failedTasks:1], Vertex killed,
vertexName=summer, vertexId=vertex_1407719224092_0018_1_01, diagnostics=[Vertex
received Kill while in RUNNING state., Vertex killed as other vertex failed.
failedTasks:0], DAG failed due to vertex failure. failedVertices:1
killedVertices:1]
{code}
> Display better diagnostics when tasks fail to launch
> -----------------------------------------------------
>
> Key: TEZ-1358
> URL: https://issues.apache.org/jira/browse/TEZ-1358
> Project: Apache Tez
> Issue Type: Sub-task
> Reporter: Hitesh Shah
> Assignee: Jeff Zhang
>
> Tasks could fail to launch due to various issues - bad environment settings,
> localization failures.
--
This message was sent by Atlassian JIRA
(v6.2#6252)