Hi, I'm beginner in Hadoop concepts. I have few basic questions: 1) looking for APIs to retrieve the capacity of the cluster. so that i can write a script to when to add a new slave node to the cluster
a) No.of Task trackers and capacity of each task tracker to spawn max No.of Mappers b) CPU,RAM and disk capacity of each tracker c) how to decide to add a new slave node to the cluster 2) what is the API to retrieve metrics like current usage of resources and currently running/spawned Mappers/Reducers 3) what is the purpose of Hadoop-common?Is it API to interact with hadoop I referred following URL: for Hadoop common : http://hadoop.apache.org/common/docs/current/api/org/apache/hadoop/mapred/ Capacity scheduler : http://hortonworks.com/blog/understanding-apache-hadoops-capacity-scheduler/ Regards, Nagaraju B DISCLAIMER ========== This e-mail may contain privileged and confidential information which is the property of Persistent Systems Ltd. It is intended only for the use of the individual or entity to which it is addressed. If you are not the intended recipient, you are not authorized to read, retain, copy, print, distribute or use this message. If you have received this communication in error, please notify the sender and delete all copies of this message. Persistent Systems Ltd. does not accept any liability for virus infected mails.