Investigate .META. slowdowns when more than 1 store files
---------------------------------------------------------

                 Key: HBASE-2175
                 URL: https://issues.apache.org/jira/browse/HBASE-2175
             Project: Hadoop HBase
          Issue Type: Bug
            Reporter: Jean-Daniel Cryans
             Fix For: 0.21.0


I'm currently testing Hadoop 0.21 with HBase trunk + HBASE-2066 by importing 
our main data set. After some time, probably because of log rolls which force 
flushes and a cluster restart, the .META. region begins to accumulate store 
files. I'm refreshing the master web UI a lot to see our insert speed and saw 
that 1) it was getting slower to refresh and 2) the import speed went down at 
the same time.

Having already seen something like that previously with 0.20, I forced a major 
compaction on .META. and immediately the refresh speed got 10 times better and 
the import throughput went 2x (tasks went from 20 min to 10 min).

Why is scanning and doing random reads from the client that slow when .META. 
has more than 1 store file? If it's a more fondamental speed issue, could we at 
least force major compactions on .META. when it grows so that the rest of the 
cluster doesn't get super slow? By the way, that operation takes less than 1 
second since that region is so small.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to