Howdy, I want to take a look at a MR job which seems to be slower than I had hoped. Mind you, this MR job is only running on a pseudo-distributed VM (cloudera cdh4).
I have modified my mapred-site.xml with the following (that last one is commented out because it crashes my MR job): <property> <name>mapred.task.profile</name> <value>true</value> </property> <property> <name>mapred.task.profile.maps</name> <value>0-2</value> </property> <property> <name>mapred.task.profile.reduces</name> <value>0-2</value> </property> <!--property> <name>mapred.task.profile.params</name> <value>agentlib:hprof=cpu=samples,heap=sites,depth=6,force=n,thread=y,verbose=n,file=%s</value> </property--> Are there any resources that explain how to interpret the results? Or maybe an open-source app that could help display the results in a more intuiative manner? Ideally, we'd want to know where we are spending most of our time. Cheers, David