[ 
https://issues.apache.org/jira/browse/HIVE-22687?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17006116#comment-17006116
 ] 

Hive QA commented on HIVE-22687:
--------------------------------



Here are the results of testing the latest attachment:
https://issues.apache.org/jira/secure/attachment/12989739/HIVE-22687.01.patch

{color:red}ERROR:{color} -1 due to no test(s) being added or modified.

{color:red}ERROR:{color} -1 due to 1 failed/errored test(s), 17786 tests 
executed
*Failed tests:*
{noformat}
org.apache.hadoop.hive.common.format.datetime.TestHiveSqlDateTimeFormatter.testParseTimestamp
 (batchId=311)
{noformat}

Test results: 
https://builds.apache.org/job/PreCommit-HIVE-Build/20051/testReport
Console output: https://builds.apache.org/job/PreCommit-HIVE-Build/20051/console
Test logs: http://104.198.109.242/logs/PreCommit-HIVE-Build-20051/

Messages:
{noformat}
Executing org.apache.hive.ptest.execution.TestCheckPhase
Executing org.apache.hive.ptest.execution.PrepPhase
Executing org.apache.hive.ptest.execution.YetusPhase
Executing org.apache.hive.ptest.execution.ExecutionPhase
Executing org.apache.hive.ptest.execution.ReportingPhase
Tests exited with: TestsFailedException: 1 tests failed
{noformat}

This message is automatically generated.

ATTACHMENT ID: 12989739 - PreCommit-HIVE-Build

> Query hangs indefinitely if LLAP daemon registers after the query is submitted
> ------------------------------------------------------------------------------
>
>                 Key: HIVE-22687
>                 URL: https://issues.apache.org/jira/browse/HIVE-22687
>             Project: Hive
>          Issue Type: Bug
>          Components: llap
>    Affects Versions: 3.1.0
>            Reporter: Himanshu Mishra
>            Assignee: Himanshu Mishra
>            Priority: Major
>         Attachments: HIVE-22687.01.patch
>
>
> If a query is submitted and no LLAP daemon is running, it waits for 1 minute 
> and times out with error {{SERVICE_UNAVAILABLE}}.
> While waiting, if a new LLAP Daemon starts, then the timeout is cancelled, 
> and the tasks do not get scheduled as well. As a result, the query hangs 
> indefinitely.
> This is due to the race condition where LLAP Daemon first registers the LLAP 
> instance at {{.../workers/worker-0000}}, and afterwards registers 
> {{.../workers/slot-0000}}. In the gap between two, Tez AM gets notified of 
> worker zk node and while processing it checks if slot zk node is present, if 
> not it rejects the LLAP Daemon. Error in Tez AM is:
> {code:java}
> [INFO] [LlapScheduler] |impl.LlapZookeeperRegistryImpl|: Unknown slot for 
> 8ebfdc45-0382-4757-9416-52898885af90{code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to