[ 
https://issues.apache.org/jira/browse/TEZ-2269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14482719#comment-14482719
 ] 

Siddharth Seth commented on TEZ-2269:
-------------------------------------

Committing this. Thanks for the help debugging and verifying 
[~rajesh.balamohan], and [~zjffdu]. This is probably the same as 2267. Will 
close that one out if the tests aren't as flaky going forward.
Still not sure why the writeLock in it's original form was an issue though.

> DAGAppMaster becomes unresponsive
> ---------------------------------
>
>                 Key: TEZ-2269
>                 URL: https://issues.apache.org/jira/browse/TEZ-2269
>             Project: Apache Tez
>          Issue Type: Bug
>    Affects Versions: 0.7.0
>            Reporter: Rajesh Balamohan
>         Attachments: TEZ-2269.altlock.txt, TEZ-2269.altlock.txt, 
> TEZ-2269.test.patch, app_master_application_1428021179455_0001_jstack.txt, 
> client_jstack.txt
>
>
> Scenario:
> - Run TPCH query20 @ 1 TB scale
> - Tez master branch, Hive trunk
> - auto-reduce parallelism is not an issue (happens with/without auto-reduce 
> parallelism)
> 1 or 2 times in 10 runs, DAGAppMaster would freeze unexpectedly.  There is no 
> pattern observed on which vertex it happens. But when this happens, only 
> option is to kill the application.   I will attach the jstack soon, but that 
> doesn't seem to reveal much.
> Need to debug more; Creating this JIRA for tracking purposes.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to