Re: Hadoop shuffling traffic

Bing Jiang Thu, 25 Sep 2014 20:28:42 -0700

see mapreduce.job.reduce.slowstart.completedmaps
It gives hint of  when reduce tasks could kick off.


2014-09-26 8:36 GMT+08:00 Abdul Navaz <[email protected]>:

> Hello,
>
> I am having a Hadoop cluster with 1 name node and 3 data nodes. I running
> sample word count job on 1GB of file which is distributed among the HDFS.
>
> When I run the map reduce job, before even completing the mapping 100 %
> reduce starts.  Say for eg map 40% reduce 10% etc.
>
> I would like to know when the shuffling traffic starts ?
>
> ->  Is there any way to find out when exactly shuffling started ?  Does it
> generate any syslog in the logs .
> -> How to find the total amount of shuffling traffic?
>
>
>
> Thanks & Regards,
>
> Abdul Navaz
> Research Assistant
> University of Houston Main Campus, Houston TX
> Ph: 281-685-0388
>
>


-- 
Bing Jiang
Tel：(86)134-2619-1361
weibo: http://weibo.com/jiangbinglover
BLOG: www.binospace.com
BLOG: http://blog.sina.com.cn/jiangbinglover
Focus on distributed computing, HDFS/HBase

Re: Hadoop shuffling traffic

Reply via email to