[ 
https://issues.apache.org/jira/browse/HIVE-15562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Siddharth Seth updated HIVE-15562:
----------------------------------
    Attachment: HIVE-15562.01.patch

Patch makes a few changes. Some unrelated to the core issue.
For the core issue, the registration for state change notifications has moved 
into the main lock.

Additional changes.
- Avoid excessive debug logging in the ZK registry.
- Handle a potential leak in the AM
- Switch around the maybeRegisterForStateChangeNotifications to work correctly 
with the return value (was inverted)

[~sershe] - could you please take a look?

> LLAP TaskExecutorService race can lead to some fragments being permanently 
> lost
> -------------------------------------------------------------------------------
>
>                 Key: HIVE-15562
>                 URL: https://issues.apache.org/jira/browse/HIVE-15562
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Siddharth Seth
>            Assignee: Siddharth Seth
>            Priority: Critical
>         Attachments: HIVE-15562.01.patch
>
>
> A fragment can be evicted before it's submission completed. A race can cause 
> a situation As a result, we end up trying to unregister the fragment for 
> stateChangeNotifications before it has been registered - leading to an 
> exception, which skips the actual fragment kill.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to