[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-883?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12965374#action_12965374
 ] 

Vishal K commented on ZOOKEEPER-883:
------------------------------------

Hi,

If this problem is reproducible, can you try with the patch attached for 
ZOOKEEPER-880. Going through the logs, the patch for ZOOKEEPER-880 will only 
fix part of the problem - it will prevent leak of SendWorker thread.

Logs show that there are 320 RecvWorker threads blocked in a read() from remote 
socket. 2 of these threads should be legitimate threads to remote servers. Very 
likely rest of them are from Nagios.

There are 2 cases I can think of that can lead to this situation:
1. Nagios may not be closing a connection. If Nagios was closing connections, 
then the read() should have received an exception.
2. Monitoring frequency set too high

-Vishal


> Idle cluster increasingly consumes CPU resources
> ------------------------------------------------
>
>                 Key: ZOOKEEPER-883
>                 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-883
>             Project: ZooKeeper
>          Issue Type: Bug
>          Components: server
>    Affects Versions: 3.3.1
>            Reporter: Lars George
>         Attachments: Archive.zip
>
>
> Monitoring the ZooKeeper nodes by polling the various ports using Nagios' 
> open port checks seems to cause a substantial raise of CPU being used by the 
> ZooKeeper daemons. Over the course of a week an idle cluster grew from a 
> baseline 2% to >10% CPU usage. Attached is a stack dump and logs showing the 
> occupied threads. At the end the daemon starts failing on "too many open 
> files" errors as all handles are used up.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to