[jira] [Commented] (HIVE-9976) LLAP: Possible race condition in DynamicPartitionPruner for 200ms tasks

2015-03-16 Thread Gunther Hagleitner (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-9976?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14364039#comment-14364039
 ] 

Gunther Hagleitner commented on HIVE-9976:
--

[~gopalv] said that it was on tez 0.7.

 LLAP: Possible race condition in DynamicPartitionPruner for 200ms tasks
 

 Key: HIVE-9976
 URL: https://issues.apache.org/jira/browse/HIVE-9976
 Project: Hive
  Issue Type: Sub-task
  Components: Tez
Affects Versions: llap
Reporter: Gopal V
Assignee: Gunther Hagleitner
 Attachments: llap_vertex_200ms.png


 Race condition in the DynamicPartitionPruner between 
 DynamicPartitionPruner::processVertex() and 
 DynamicPartitionpruner::addEvent() for tasks which respond with both the 
 result and success in a single heartbeat sequence.
 {code}
 2015-03-16 07:05:01,589 ERROR [InputInitializer [Map 1] #0] 
 tez.DynamicPartitionPruner: Expecting: 1, received: 0
 2015-03-16 07:05:01,590 ERROR [Dispatcher thread: Central] impl.VertexImpl: 
 Vertex Input: store_sales initializer failed, 
 vertex=vertex_1424502260528_1113_4_04 [Map 1]
 org.apache.tez.dag.app.dag.impl.AMUserCodeException: 
 org.apache.hadoop.hive.ql.metadata.HiveException: Incorrect event count in 
 dynamic parition pruning
 {code}
 !llap_vertex_200ms.png!
 All 4 upstream vertices of Map 1 need to finish within ~200ms to trigger 
 this, which seems to be consistently happening with LLAP.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-9976) LLAP: Possible race condition in DynamicPartitionPruner for 200ms tasks

2015-03-16 Thread Siddharth Seth (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-9976?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14363909#comment-14363909
 ] 

Siddharth Seth commented on HIVE-9976:
--

I'll take a look. Assuming this was run with Tez 0.7 snapshot ?

 LLAP: Possible race condition in DynamicPartitionPruner for 200ms tasks
 

 Key: HIVE-9976
 URL: https://issues.apache.org/jira/browse/HIVE-9976
 Project: Hive
  Issue Type: Sub-task
  Components: Tez
Affects Versions: llap
Reporter: Gopal V
Assignee: Gunther Hagleitner
 Attachments: llap_vertex_200ms.png


 Race condition in the DynamicPartitionPruner between 
 DynamicPartitionPruner::processVertex() and 
 DynamicPartitionpruner::addEvent() for tasks which respond with both the 
 result and success in a single heartbeat sequence.
 {code}
 2015-03-16 07:05:01,589 ERROR [InputInitializer [Map 1] #0] 
 tez.DynamicPartitionPruner: Expecting: 1, received: 0
 2015-03-16 07:05:01,590 ERROR [Dispatcher thread: Central] impl.VertexImpl: 
 Vertex Input: store_sales initializer failed, 
 vertex=vertex_1424502260528_1113_4_04 [Map 1]
 org.apache.tez.dag.app.dag.impl.AMUserCodeException: 
 org.apache.hadoop.hive.ql.metadata.HiveException: Incorrect event count in 
 dynamic parition pruning
 {code}
 !llap_vertex_200ms.png!
 All 4 upstream vertices of Map 1 need to finish within ~200ms to trigger 
 this, which seems to be consistently happening with LLAP.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)