[ 
https://issues.apache.org/jira/browse/HBASE-26812?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17508784#comment-17508784
 ] 

chenglei edited comment on HBASE-26812 at 3/18/22, 1:42 PM:
------------------------------------------------------------

I have written an UT in the PR to reproduce this problem in HBase and write a 
{{ClientServiceBlockingInterfaceWrapper}}  to surround the get 
/scan/multi(which is for multiGet) method calls with nullifying and restoring 
{{RpcCal}}, and because {{ShortCircuitingClusterConnection}} is for 
{{HRegionServer}} and must access {{RpcServer}},  I also move it to 
hbase-server module.


was (Author: comnetwork):
I have written an UT in the PR to reproduce this problem in HBase and write a 
{{ClientServiceBlockingInterfaceWrapper}} to nullify and restore {{RpcCal}} to 
surround the get /scan/multi(which is for multiGet) method calls, and because 
{{ShortCircuitingClusterConnection}} is for HRegionServer and must access 
{{RpcServer}}, so I also move it to hbase-server module.

> ShortCircuitingClusterConnection fails to close RegionScanners when making 
> short-circuited calls
> ------------------------------------------------------------------------------------------------
>
>                 Key: HBASE-26812
>                 URL: https://issues.apache.org/jira/browse/HBASE-26812
>             Project: HBase
>          Issue Type: Bug
>    Affects Versions: 2.4.9
>            Reporter: Lars Hofhansl
>            Priority: Critical
>
> Just ran into this on the Phoenix side.
> We retrieve a Connection via 
> {{{}RegionCoprocessorEnvironment.createConnection... getTable(...){}}}. And 
> then call get on that table. The Get's key happens to be local. Now each call 
> to table.get() leaves an open StoreScanner around forever. (verified with a 
> memory profiler).
> There references are held via 
> RegionScannerImpl.storeHeap.scannersForDelayedClose. Eventially the 
> RegionServer goes into a GC of death and can only ended with kill -9.
> The reason appears to be that in this case there is no currentCall context. 
> Some time in 2.x the Rpc handler/call was made responsible for closing open 
> region scanners, but we forgot to handle {{ShortCircuitingClusterConnection}}
> It's not immediately clear how to fix this. But it does make 
> ShortCircuitingClusterConnection useless and dangerous. If you use it, you 
> *will* create a giant memory leak.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to