On Mon, Nov 14, 2011 at 11:37 AM, Sam Seigal <[email protected]> wrote: > If you are not too concerned with random access time, but want more > efficient scans, is increasing the block size then a good idea ? >
I'd say leave things as they are unless you have a problem. For your case, where random read latency is not so important and you are only scanning, upping the block size should not change your scan latencies and it will make the hfile indices smaller (if you double the blocksize to 128k, your indices should be halved -- you can see index sizes in your regionserver UI). St.Ack
