On Mon, May 14, 2012 at 3:03 PM, Shrijeet Paliwal <[email protected]> wrote: > These M files will have to contain globally sorted entries (first > entry in 0th file will be smallest key and last entry of M-1th file > will be the largest key), No?
Yes > Unless there is a way in bulk import to enforce total order even if > the output of MR is not that way. No. The MR job has to run w/ TOP. Sounds like a split file w/ enough regions in it is way to go; you'll not have to change code or write custom MR job. St.Ack
