[ 
https://issues.apache.org/jira/browse/HIVE-15017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15589425#comment-15589425
 ] 

Sergey Shelukhin edited comment on HIVE-15017 at 10/19/16 6:09 PM:
-------------------------------------------------------------------

The app logs appear to be for the AMs.
Can you download the full application logs for Tez or MR apps? (via yarn logs 
... command).
If they don't have anything for the problematic container (e.g. 
containerId=container_1475850791417_0105_01_000002, 
nodeId=datanode06.bigdata.fr:60737), it might be possible to go the node and 
try to find container log directory to see its output


was (Author: sershe):
The app logs appear to be for the AMs.
Can you download the full application logs for Tez or MR apps?
If they don't have anything for the problematic container (e.g. 
containerId=container_1475850791417_0105_01_000002, 
nodeId=datanode06.bigdata.fr:60737), it might be possible to go the node and 
try to find container log directory to see its output

> Random job failures with MapReduce and Tez
> ------------------------------------------
>
>                 Key: HIVE-15017
>                 URL: https://issues.apache.org/jira/browse/HIVE-15017
>             Project: Hive
>          Issue Type: Bug
>          Components: Hive
>    Affects Versions: 2.1.0
>         Environment: Hadoop 2.7.2, Hive 2.1.0
>            Reporter: Alexandre Linte
>            Priority: Critical
>         Attachments: hive_cli_mr.txt, hive_cli_tez.txt, 
> nodemanager_logs_mr_job.txt, yarn_syslog_mr_job.txt, yarn_syslog_tez_job.txt
>
>
> Since Hive 2.1.0, we are facing a blocking issue on our cluster. All the jobs 
> are failing randomly on mapreduce and tez as well. 
> In both case, we don't have any ERROR or WARN message in the logs. You can 
> find attached:
> - hive cli output errors 
> - yarn logs for a tez and mapreduce job
> - nodemanager logs (mr only, we have the same logs with tez)
> Note: This issue doesn't exist with Pig jobs (mr + tez), Spark jobs (mr), so 
> this cannot be an Hadoop / Yarn issue.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to