[
https://issues.apache.org/jira/browse/HIVE-11660?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Siddharth Seth updated HIVE-11660:
----------------------------------
Attachment: HIVE-11660.1.txt
Attaching patch to fix the tests. Have run 100 iterations of both on a Linux
box - where the failures are normally seen - with all of them passing.
There's some real bugs which were causing TestLlapTaskSchedulerService to fail.
The last allocateTaskRequest for a dag could've ended up being ignored.
Also in TaskScheduler, the waitQueue can be improved - filed a separate jira
for this.
[~sershe] - please review.
> LLAP: TestTaskExecutorService is flaky
> --------------------------------------
>
> Key: HIVE-11660
> URL: https://issues.apache.org/jira/browse/HIVE-11660
> Project: Hive
> Issue Type: Bug
> Reporter: Sergey Shelukhin
> Assignee: Siddharth Seth
> Attachments: HIVE-11660.1.txt
>
>
> {noformat}
> java.lang.Exception: test timed out after 10000 milliseconds
> at sun.misc.Unsafe.park(Native Method)
> at java.util.concurrent.locks.LockSupport.park(LockSupport.java:186)
> at
> java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.await(AbstractQueuedSynchronizer.java:2043)
> at
> org.apache.hadoop.hive.llap.daemon.impl.TestTaskExecutorService$TaskExecutorServiceForTest$InternalCompletionListenerForTest.awaitCompletion(TestTaskExecutorService.java:244)
> at
> org.apache.hadoop.hive.llap.daemon.impl.TestTaskExecutorService$TaskExecutorServiceForTest$InternalCompletionListenerForTest.access$000(TestTaskExecutorService.java:208)
> at
> org.apache.hadoop.hive.llap.daemon.impl.TestTaskExecutorService.testWaitQueuePreemption(TestTaskExecutorService.java:168)
> {noformat}
> Cannot repro locally. See HIVE-11642
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)