[jira] [Commented] (HIVE-9976) LLAP: Possible race condition in DynamicPartitionPruner for 200ms tasks
[ https://issues.apache.org/jira/browse/HIVE-9976?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14364039#comment-14364039 ] Gunther Hagleitner commented on HIVE-9976: -- [~gopalv] said that it was on tez 0.7. LLAP: Possible race condition in DynamicPartitionPruner for 200ms tasks Key: HIVE-9976 URL: https://issues.apache.org/jira/browse/HIVE-9976 Project: Hive Issue Type: Sub-task Components: Tez Affects Versions: llap Reporter: Gopal V Assignee: Gunther Hagleitner Attachments: llap_vertex_200ms.png Race condition in the DynamicPartitionPruner between DynamicPartitionPruner::processVertex() and DynamicPartitionpruner::addEvent() for tasks which respond with both the result and success in a single heartbeat sequence. {code} 2015-03-16 07:05:01,589 ERROR [InputInitializer [Map 1] #0] tez.DynamicPartitionPruner: Expecting: 1, received: 0 2015-03-16 07:05:01,590 ERROR [Dispatcher thread: Central] impl.VertexImpl: Vertex Input: store_sales initializer failed, vertex=vertex_1424502260528_1113_4_04 [Map 1] org.apache.tez.dag.app.dag.impl.AMUserCodeException: org.apache.hadoop.hive.ql.metadata.HiveException: Incorrect event count in dynamic parition pruning {code} !llap_vertex_200ms.png! All 4 upstream vertices of Map 1 need to finish within ~200ms to trigger this, which seems to be consistently happening with LLAP. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (HIVE-9976) LLAP: Possible race condition in DynamicPartitionPruner for 200ms tasks
[ https://issues.apache.org/jira/browse/HIVE-9976?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14363909#comment-14363909 ] Siddharth Seth commented on HIVE-9976: -- I'll take a look. Assuming this was run with Tez 0.7 snapshot ? LLAP: Possible race condition in DynamicPartitionPruner for 200ms tasks Key: HIVE-9976 URL: https://issues.apache.org/jira/browse/HIVE-9976 Project: Hive Issue Type: Sub-task Components: Tez Affects Versions: llap Reporter: Gopal V Assignee: Gunther Hagleitner Attachments: llap_vertex_200ms.png Race condition in the DynamicPartitionPruner between DynamicPartitionPruner::processVertex() and DynamicPartitionpruner::addEvent() for tasks which respond with both the result and success in a single heartbeat sequence. {code} 2015-03-16 07:05:01,589 ERROR [InputInitializer [Map 1] #0] tez.DynamicPartitionPruner: Expecting: 1, received: 0 2015-03-16 07:05:01,590 ERROR [Dispatcher thread: Central] impl.VertexImpl: Vertex Input: store_sales initializer failed, vertex=vertex_1424502260528_1113_4_04 [Map 1] org.apache.tez.dag.app.dag.impl.AMUserCodeException: org.apache.hadoop.hive.ql.metadata.HiveException: Incorrect event count in dynamic parition pruning {code} !llap_vertex_200ms.png! All 4 upstream vertices of Map 1 need to finish within ~200ms to trigger this, which seems to be consistently happening with LLAP. -- This message was sent by Atlassian JIRA (v6.3.4#6332)