Chen,

Files are stored based on the reducer partitions, not exactly per-key.
The result is that there are far lesser files than you imagine there
ought to be. The keys are kept sorted inside the partitioned files and
thus you do not lose out on your key groups either.

See Partitioner, which is responsible for doing the partitioning of
your map outputs:
(http://hadoop.apache.org/common/docs/r0.20.2/mapred_tutorial.html#Partitioner)

On Sun, Sep 25, 2011 at 10:30 PM, He Chen <airb...@gmail.com> wrote:
> Hi everyone
>
> According to my understanding of Hadoop, it save MapReduce  job's
> intermediate results into files in the mapper's hard drive. Each key will
> occupy a file. I am curious what will happen if mapper's hard drive does not
> have enough inodes to save the generated keys.  Because every file needs a
> inode.
>
> Best wishes!
>
> Chen
>



-- 
Harsh J

Reply via email to