Lance Amundsen wrote:
There's lots of references on decreasing DFS block size to increase maps to record ratios. What is the easiest way to do this? Is it possible with the standard SequenceFile class?
You could specify the block size in the Configuration parameter to SequenceFile#createWriter() using the dfs.block.size parameter. But if you simply want to create sub-block-size splits, then increasing the number of map tasks should do that.
Doug