Hey Bill,
Have you tried the Ganglia or JMX stats from your namenode?
I.e., look here:
http://rcf.unl.edu/ganglia/?m=load_one&r=hour&s=descending&c=red-workers&h=hadoop-name&sh=1&hc=4&z=small
The dfs.FSNamesystem.UnderReplicatedBlocks metric should keep track of
what you're looking for. You can query Ganglia or turn on JMX and use
one of the JMX/Nagios connectors.
Brian
On Feb 10, 2009, at 5:05 PM, Bill Au wrote:
I am in the process of setting up remote monitoring of my Hadoop
cluster. I
seems to me that the replication status can only be obtained from the
command line by the fsck command. Has anyone though about adding
replication status to the NameNode web UI in dfshealth.jsp? Or is
that
something that I really shouldn't worry about since Hadoop will fix
things
all by itself?
Bill