Hi folks,
I searched around JIRA and didn't find anything that resembled this. Is this something on the roadmap? For normal aggregations, this is never an issue. But in some cases (typically joins) - map phase can emit lot of data and take quite a bit of time doing it. Meanwhile the reducers seem to sit around copying data slowly where they could be merging the map-outputs that are already copied over. Curious whether I have an outlier application or is this generally useful/doable .. Thx, Joydeep
