ZanderXu opened a new pull request, #4691:
URL: https://github.com/apache/hadoop/pull/4691

   ### Description of PR
   Jira link: [HADOOP-18389](https://issues.apache.org/jira/browse/HADOOP-18389)
   In our prod environment, we encountered an accident that JN OOM because 
Server#Connection#responseQueue used 97% memory.
   
   After analyzed the memory of JN and found that there are 2w+ called stacked 
in one Server#Connection#responseQueue, because the network between NN and JN 
jitters with some tcp packet loss.
   
   We can refer to some screenshots in 
[HADOOP-18389](https://issues.apache.org/jira/browse/HADOOP-18389)
   
   In this case, I think Client.java should support limit the stacked calls of 
one connection to avoid the possible OOM in Server.  When the number of stacked 
calls is more than the limit size, we can just throw one IOException to the 
method caller.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to