On Tue, Sep 13, 2011 at 11:20 PM, Sujee Maniyam <[email protected]> wrote: > hehe J-D (hopefully first name!)
:) > I agree with your point that pre-splitting the table can make a big > difference. > > Do the later versions of 'PerformanceEvaluation' class has an option to > pre-split the table? I remember, when I ran this for the first time, > only one region server is busy until the table split. But second time > around, all the region-servers were hit with requests. No, this wasn't added. Nicolas had this idea tho: https://issues.apache.org/jira/browse/HBASE-4163 > I just peeked at the code for this class, and it does NOT truncate the > table. So subsequent runs benefit from split tables already. And it looks > like it is overriding the rows. > > ** So I will mention this, and say to ignore the first run and only measure > subsequent runs. what do you think? ** It will still be splitting a ton, even tho it's on multiple servers. J-D
