Hi,

I'm beginner in Hadoop concepts. I have few basic questions:
1) looking for APIs to retrieve the capacity of the cluster. so that i can 
write a script to when to add a new slave node to the cluster

             a) No.of Task trackers and  capacity of  each task tracker  to 
spawn  max No.of Mappers
              b) CPU,RAM and disk capacity of each tracker
              c) how to decide to add a new  slave node to the cluster
 2) what is the API to retrieve metrics like current usage of resources and 
currently running/spawned Mappers/Reducers

 3) what is the purpose of Hadoop-common?Is it API to interact with hadoop


I referred following URL:
for Hadoop common : 
http://hadoop.apache.org/common/docs/current/api/org/apache/hadoop/mapred/
Capacity scheduler : 
http://hortonworks.com/blog/understanding-apache-hadoops-capacity-scheduler/

Regards,
Nagaraju B



DISCLAIMER
==========
This e-mail may contain privileged and confidential information which is the 
property of Persistent Systems Ltd. It is intended only for the use of the 
individual or entity to which it is addressed. If you are not the intended 
recipient, you are not authorized to read, retain, copy, print, distribute or 
use this message. If you have received this communication in error, please 
notify the sender and delete all copies of this message. Persistent Systems 
Ltd. does not accept any liability for virus infected mails.

Reply via email to