[ 
https://issues.apache.org/jira/browse/HDFS-5124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13747694#comment-13747694
 ] 

Chris Nauroth commented on HDFS-5124:
-------------------------------------

bq. Do we even need a read lock on the namespace at all? Token verification has 
nothing to do with the namespace.

The only potential problem I see with removing the namesystem read lock is that 
failover acquires the write lock in {{HAState#setStateInternal}}.  If this NN 
is in the middle of transitioning from standby to active, then holding the read 
lock in {{DelegationTokenSecretManager#retrievePassword}} blocks clients until 
the transition to active has completed.  Without the read lock, clients will 
get an immediate {{StandbyException}} even though this NN is about to become 
active.  If this happens rapidly enough, then the client could exhaust its max 
retries and get an RPC error before the transition to active completes.

I kicked off a Jenkins run for the new patch:

https://builds.apache.org/job/PreCommit-HDFS-Build/4870/

                
> Namenode in secure cluster deadlocks
> ------------------------------------
>
>                 Key: HDFS-5124
>                 URL: https://issues.apache.org/jira/browse/HDFS-5124
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: namenode
>    Affects Versions: 2.1.1-beta
>         Environment: Secure Hadoop 2 cluster
>            Reporter: Deepesh Khandelwal
>            Assignee: Jing Zhao
>            Priority: Blocker
>         Attachments: HADOOP-5124.patch, HDFS-5124.001.patch, 
> HDFS-5124.002.patch, nn_jstack.out
>
>
> Namenode deadlocks after a while in use.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to