[
https://issues.apache.org/jira/browse/HIVE-15529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rajesh Balamohan updated HIVE-15529:
------------------------------------
Resolution: Fixed
Hadoop Flags: Reviewed
Fix Version/s: 2.2.0
Status: Resolved (was: Patch Available)
Thanks [~sershe]. Committed to master.
> LLAP: TaskSchedulerService can get stuck when scheduling tasks as disabled
> node is not re-enabled in NodeEnablerCallable
> ------------------------------------------------------------------------------------------------------------------------
>
> Key: HIVE-15529
> URL: https://issues.apache.org/jira/browse/HIVE-15529
> Project: Hive
> Issue Type: Bug
> Components: llap
> Reporter: Rajesh Balamohan
> Assignee: Rajesh Balamohan
> Priority: Critical
> Fix For: 2.2.0
>
> Attachments: HIVE-15529.1.patch
>
>
> Easier way to simulate the issue:
> 1. Start hive cli with "--hiveconf hive.execution.mode=llap"
> 2. Run a sql script file (e.g sql script containing tpc-ds queries)
> 3. In the middle of the run, press "ctrl+C" which would interrupt the current
> job. This should not exit the hive cli yet.
> 4. After sometime, launch the same SQL script in same cli. This would get
> stuck indefinitely (waiting for computing the splits).
> Even when cli is quit, AM runs forever until explicitly killed.
> Issue seems to be around {{LlapTaskSchedulerService::schedulePendingTasks}}
> dealing with the loop when it encounters {{DELAYED_RESOURCES}} on task
> scheduling.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)