On Wed, May 25, 2011 at 1:33 PM, Sean Owen <[email protected]> wrote: > (I suggest we not use IntWritable or LongWritable, but favor VarIntWritable > and VarLongWritable, which are variable length encoding versions, where > possible. Saving a couple bytes per key adds up.) >
If you have millions to hundreds of millions of keys, how many of them are going to be low enough to fit in less than 4 bytes? As soon as you have more than 16 million, "most" numbers take up the full 4 bytes, right? -jake
