So what other memory consumption tools do you suggest? I don't want to do it manually and dump statistics into file because IO will affect performance too.
Thanks, Mark On Tue, May 17, 2011 at 2:58 PM, Allen Wittenauer <a...@apache.org> wrote: > > On May 17, 2011, at 1:01 PM, Mark question wrote: > > > Hi > > > > I need to use hadoop-tool-kit for monitoring. So I followed > > http://code.google.com/p/hadoop-toolkit/source/checkout > > > > and applied the patch in my hadoop.20.2 directory as: patch -p0 < > patch.20.2 > > Looking at the code, be aware this is going to give incorrect > results/suggestions for certain stats it generates when multiple jobs are > running. > > It also seems to lack "the algorithm should be rewritten" and "the > data was loaded incorrectly" suggestions, which is usually the proper answer > for perf problems 80% of the time.