Client region location caches redundant HTableDescriptor's
----------------------------------------------------------
Key: HBASE-4055
URL: https://issues.apache.org/jira/browse/HBASE-4055
Project: HBase
Issue Type: Improvement
Affects Versions: 0.90.3
Reporter: Dave Latham
Fix For: 0.92.0
While examining the heap of a map task in a MapReduce job that writes directly
to HBase, I noticed that the HRegionLocation instances were taking up 90 MB
(out of a 700 MB heap for each map task) to cache the locations for 15K
regions. As the number of regions in the cluster continues to grow, this
continues to grow as well.
Of that, it appears that about 80 MB were going to 15K HTableDescriptor
instances. There are only 5 tables that it's writing to, so it seems to be
wasting a great deal of memory with a separate copy of the table descriptor for
each region.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira