[EC2] On demand monitoring and auto scaling
-------------------------------------------

                 Key: HBASE-2300
                 URL: https://issues.apache.org/jira/browse/HBASE-2300
             Project: Hadoop HBase
          Issue Type: Sub-task
            Reporter: Andrew Purtell
            Priority: Minor


Modify the scripts to install monitoring and auto-scaling rules to launch new 
slaves on demand according to CPU, disk, or I/O (over)use.

http://docs.amazonwebservices.com/AmazonCloudWatch/latest/DeveloperGuide/ 
http://docs.amazonwebservices.com/AutoScaling/latest/DeveloperGuide/ 

CloudWatch metrics are provided as statistics ['Minimum', 'Maximum', 'Sum', 
'Average', 'Samples'] in units of  ['Seconds', 'Percent', 'Bytes', 'Bits', 
'Count', 'Bytes/Second', 'Bits/Second', 'Count/Second']. Some useful metrics:
- NetworkIn
- NetworkOut
- CPUUtilization 
- DiskWriteBytes
- DiskWriteOps
- DiskReadBytes
- DiskReadOps

We can be agnostic about statistics and scaling triggers and just pass them 
through to EC2.

This would be especially easy if we port our scripts to Python like Hadoop has 
on trunk and use boto: http://code.google.com/p/boto/. 


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to