Hi All,

Can anyone provide documentation regarding how on-disk merge on reduce phase 
works in detail in Hadoop 2.2.0?
There is an explanation in this page but I am afraid it could be outdated since 
what I observe in my log files is a bunch of "OnDiskMerger - Thread to merge 
on-disk map-outputs" work at the end of merge phase.

Thanks,
-

Reply via email to