[ 
https://issues.apache.org/jira/browse/HDFS-5124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13747765#comment-13747765
 ] 

Chris Nauroth commented on HDFS-5124:
-------------------------------------

bq. Once those 5 jam up waiting on the FSN lock, the listener thread is going 
to keep accepting sockets as fast as he can - at least until an OOM

This is the first time I noticed this logic.  Thanks for pointing it out.  
Shouldn't a properly tuned listen backlog prevent OOM though?  Or have you seen 
one of those cases where the OS doesn't really enforce the listen backlog you 
requested?

At this point, I'm really torn on whether or not to hold the namesystem lock.  
(Damned if we do, damned if we don't.)  Risk of OOM could tip the scale though.

                
> Namenode in secure cluster deadlocks
> ------------------------------------
>
>                 Key: HDFS-5124
>                 URL: https://issues.apache.org/jira/browse/HDFS-5124
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: namenode
>    Affects Versions: 2.1.1-beta
>         Environment: Secure Hadoop 2 cluster
>            Reporter: Deepesh Khandelwal
>            Assignee: Daryn Sharp
>            Priority: Blocker
>         Attachments: HADOOP-5124.patch, HDFS-5124.001.patch, 
> HDFS-5124.002.patch, nn_jstack.out
>
>
> Namenode deadlocks after a while in use.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to