steven xu created HBASE-11833:
---------------------------------

             Summary: Hbase does not closing a closed socket resulting in 
thousand of CLOSE_WAIT sockets
                 Key: HBASE-11833
                 URL: https://issues.apache.org/jira/browse/HBASE-11833
             Project: HBase
          Issue Type: Bug
          Components: regionserver
    Affects Versions: 0.98.0
         Environment: RHEL 6.3 -HDP 2.1 -6 RegionServers/Datanode -18T per node 
-3108Regions
            Reporter: steven xu


HBase dose not close a dead connection with the datanode.
This resulting in over 30K+ CLOSE_WAIT and at some point HBase can not connect 
to the datanode because too many mapped sockets from one host to another on the 
same port:50010. 
After I restart all RSs,  the count of CLOSE_WAIT will increase always.
$ netstat -an|grep CLOSE_WAIT|wc -l
2545
# netstat -nap|grep CLOSE_WAIT|grep 6569|wc -l
2545
# ps -ef|grep 6569
hbase     6569  6556 21 Aug25 ?        09:52:33 /opt/jdk1.6.0_25/bin/java 
-Dproc_regionserver -XX:OnOutOfMemoryError=kill -9 %p -Xmx1000m 
-XX:+UseConcMarkSweepGC 

I aslo have reviewed these issues:
[HBASE-9393|https://issues.apache.org/jira/browse/HBASE-9393]
[HDFS-5671|https://issues.apache.org/jira/browse/HDFS-5671]
[HDFS-1836|https://issues.apache.org/jira/browse/HDFS-1836]
I found HBase 0.98/Hadoop 2.4.0 I uesed which source codes are different from 
these patches.
But I donot understand why HBase 0.98/Hadoop 2.4.0 also have this isssue. 
Please check. Thanks a lot.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to