steven xu created HBASE-11833:
---------------------------------
Summary: Hbase does not closing a closed socket resulting in
thousand of CLOSE_WAIT sockets
Key: HBASE-11833
URL: https://issues.apache.org/jira/browse/HBASE-11833
Project: HBase
Issue Type: Bug
Components: regionserver
Affects Versions: 0.98.0
Environment: RHEL 6.3 -HDP 2.1 -6 RegionServers/Datanode -18T per node
-3108Regions
Reporter: steven xu
HBase dose not close a dead connection with the datanode.
This resulting in over 30K+ CLOSE_WAIT and at some point HBase can not connect
to the datanode because too many mapped sockets from one host to another on the
same port:50010.
After I restart all RSs, the count of CLOSE_WAIT will increase always.
$ netstat -an|grep CLOSE_WAIT|wc -l
2545
# netstat -nap|grep CLOSE_WAIT|grep 6569|wc -l
2545
# ps -ef|grep 6569
hbase 6569 6556 21 Aug25 ? 09:52:33 /opt/jdk1.6.0_25/bin/java
-Dproc_regionserver -XX:OnOutOfMemoryError=kill -9 %p -Xmx1000m
-XX:+UseConcMarkSweepGC
I aslo have reviewed these issues:
[HBASE-9393|https://issues.apache.org/jira/browse/HBASE-9393]
[HDFS-5671|https://issues.apache.org/jira/browse/HDFS-5671]
[HDFS-1836|https://issues.apache.org/jira/browse/HDFS-1836]
I found HBase 0.98/Hadoop 2.4.0 I uesed which source codes are different from
these patches.
But I donot understand why HBase 0.98/Hadoop 2.4.0 also have this isssue.
Please check. Thanks a lot.
--
This message was sent by Atlassian JIRA
(v6.2#6252)