[ 
https://issues.apache.org/jira/browse/YARN-353?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Karthik Kambatla updated YARN-353:
----------------------------------

    Attachment: yarn-353-12-wip.patch

While looking into the findbugs issue, noticed that there was a potential bug 
in ZKAction implementation. 

In YARN-353.11.patch and other previous patches, ZKAction has a local copy of 
zkClient. If this zkClient is null, it waits for createConnection() to connect. 
However, this local copy is not updated, making the wait-notify between 
ZKAction#runWithCheck and createConnection moot.

Posting a wip patch (haven't tested it on a cluster yet) that 
# gets rid of the local variables in ZKAction
# adds synchronization around the wait in ZKAction#runWithCheck()
# moves zkDoWithRetries to ZKAction#runWithRetries

With this patch, there was no need to exclude findbugs warnings. Verified 
TestRMRestart passes.
                
> Add Zookeeper-based store implementation for RMStateStore
> ---------------------------------------------------------
>
>                 Key: YARN-353
>                 URL: https://issues.apache.org/jira/browse/YARN-353
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>          Components: resourcemanager
>            Reporter: Hitesh Shah
>            Assignee: Bikas Saha
>         Attachments: YARN-353.10.patch, YARN-353.11.patch, 
> yarn-353-12-wip.patch, YARN-353.1.patch, YARN-353.2.patch, YARN-353.3.patch, 
> YARN-353.4.patch, YARN-353.5.patch, YARN-353.6.patch, YARN-353.7.patch, 
> YARN-353.8.patch, YARN-353.9.patch
>
>
> Add store that write RM state data to ZK

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to