Hi,
Apologies if this is a repeat.  I don't know if my earlier posting got out due 
to the timing of my subscription being accepted.

I'm a new subscriber to this list with some questions about Hadoop JMX metrics.

We are running version 0.20.2.
Hadoop-metrics.properties is configured for GangliaContext31 for dfs, mapred, 
and rpc, with period=10.
Hadoop-env.sh is setup for jmxremote, with namenode and datanode accessible on 
ports 8004 & 8005.
I am able to access the MBeans and retrieve attributes and values - using both 
jconsole and my own java client code.

These are my questions/comments:

1.      (Comment)  It seems that certain counters such as number of reads and 
writes are interval values - they reset to zero after publication, in my case, 
every 10 seconds.  For a JMX client querying these stats at arbitrary, or even 
regular, time intevals, this would not be a good method for deriving useful 
info from these stats.  Would NullContextWithUpdateThread context behave 
differently, that is, not reset?  For a "dashboard"-type application, metrics 
would be meaningful only when receiving the periodic publications.
2.      Our Hadoop runs on a 4-node cluster with 4 datanodes.  The NameNodeInfo 
mbean (class org.apache.hadoop.hdfs.server.namenode.FSNamesystem) reports "used 
space" and "last contact" on all 4 nodes.  How to interpret the value for "last 
contact"?  For example, for a DeadNode, the last contact value is a large 
number.
3.      There's a datanode mbean report with a key property that begins with 
"FSDatasetState-DS-...", (class 
org.apache.hadoop.hdfs.server.datanode.FSDataset) that contains counts for 
remaining/capacity/used space - but it's only for one node.  Why is there not 
one for each datanode?
4.      Is the spec for the GangliaContext publication format available, or is 
it proprietary?

Maybe my issues are due to the version we're running?  I'm sure things will be 
different with metrics2.

Thanks for any information.


David Wong
HP



Reply via email to