Hi, The metrics provide information for the reduce (i.e. shuffleReaders) tasks about the time taken to fetch the shuffle outputs. Is there a way I can find out the the time taken by a map task (ie shuffleWriter) on a remote machine to read its input partition from disk?
I believe I should look in HadoopRDD.scala where there is the getRecordReader, and the headers show that it should be in org.apache.hadoop.mapred.RecordReader, but I can't find that file anywhere. Any help would be appreciated. thanks! Umar
