[ https://issues.apache.org/jira/browse/HBASE-19290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
binlijin updated HBASE-19290: ----------------------------- Resolution: Fixed Fix Version/s: 1.5.0 3.0.0 2.0.0 Status: Resolved (was: Patch Available) > Reduce zk request when doing split log > -------------------------------------- > > Key: HBASE-19290 > URL: https://issues.apache.org/jira/browse/HBASE-19290 > Project: HBase > Issue Type: Improvement > Reporter: binlijin > Assignee: binlijin > Fix For: 2.0.0, 3.0.0, 1.5.0 > > Attachments: HBASE-19290.branch-1.001.patch, > HBASE-19290.branch-1.001.patch, HBASE-19290.master.001.patch, > HBASE-19290.master.002.patch, HBASE-19290.master.003.patch, > HBASE-19290.master.004.patch, HBASE-19290.master.005.patch, > HBASE-19290.master.006.patch, HBASE-19290.master.006.patch > > > We observe once the cluster has 1000+ nodes and when hundreds of nodes abort > and doing split log, the split is very very slow, and we find the > regionserver and master wait on the zookeeper response, so we need to reduce > zookeeper request and pressure for big cluster. > (1) Reduce request to rsZNode, every time calculateAvailableSplitters will > get rsZNode's children from zookeeper, when cluster is huge, this is heavy. > This patch reduce the request. > (2) When the regionserver has max split tasks running, it may still trying to > grab task and issue zookeeper request, we should sleep and wait until we can > grab tasks again. -- This message was sent by Atlassian JIRA (v6.4.14#64029)