[
https://issues.apache.org/jira/browse/YARN-2579?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14177886#comment-14177886
]
Rohith commented on YARN-2579:
------------------------------
bq. Under what conditions, can resetDispatcher be called by two threads
simultaneously?
resetDispatcher is called only once in synchronized block(transitionToStandBy
or transitinedToActive).
Here the problem is ,
*Thread-1 :* just before stoppingActiveServices() from trainsitionToStandBy()
method if RMFatalEvent is thrown then RMFatalEventDispatcher wait for
trainsitionToStandBy() for obtaining lock.RMFatalEventDispatcher is BLOCKED on
trainsitionToStandBy().
*Thread-2 :* From the elector, trainsitionedTotandBy() stops dispatcher in
resetDispatcher() method. (Service)Dispatcher.stop() wait for draining out
RMFatalEventDispatcher event.But "AsyncDispatcher event handler" is WAITING on
dispatcher thread to finish.
> Both RM's state is Active , but 1 RM is not really active.
> ----------------------------------------------------------
>
> Key: YARN-2579
> URL: https://issues.apache.org/jira/browse/YARN-2579
> Project: Hadoop YARN
> Issue Type: Bug
> Components: resourcemanager
> Affects Versions: 2.5.1
> Reporter: Rohith
> Assignee: Rohith
> Attachments: YARN-2579.patch, YARN-2579.patch
>
>
> I encountered a situaltion where both RM's web page was able to access and
> its state displayed as Active. But One of the RM's ActiveServices were
> stopped.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)