Hello, I am having a Hadoop cluster with 1 name node and 3 data nodes. I running sample word count job on 1GB of file which is distributed among the HDFS.
When I run the map reduce job, before even completing the mapping 100 % reduce starts. Say for eg map 40% reduce 10% etc. I would like to know when the shuffling traffic starts ? -> Is there any way to find out when exactly shuffling started ? Does it generate any syslog in the logs . -> How to find the total amount of shuffling traffic? Thanks & Regards, Abdul Navaz Research Assistant University of Houston Main Campus, Houston TX Ph: 281-685-0388
