[EC2] On demand monitoring and auto scaling
-------------------------------------------
Key: HBASE-2300
URL: https://issues.apache.org/jira/browse/HBASE-2300
Project: Hadoop HBase
Issue Type: Sub-task
Reporter: Andrew Purtell
Priority: Minor
Modify the scripts to install monitoring and auto-scaling rules to launch new
slaves on demand according to CPU, disk, or I/O (over)use.
http://docs.amazonwebservices.com/AmazonCloudWatch/latest/DeveloperGuide/
http://docs.amazonwebservices.com/AutoScaling/latest/DeveloperGuide/
CloudWatch metrics are provided as statistics ['Minimum', 'Maximum', 'Sum',
'Average', 'Samples'] in units of ['Seconds', 'Percent', 'Bytes', 'Bits',
'Count', 'Bytes/Second', 'Bits/Second', 'Count/Second']. Some useful metrics:
- NetworkIn
- NetworkOut
- CPUUtilization
- DiskWriteBytes
- DiskWriteOps
- DiskReadBytes
- DiskReadOps
We can be agnostic about statistics and scaling triggers and just pass them
through to EC2.
This would be especially easy if we port our scripts to Python like Hadoop has
on trunk and use boto: http://code.google.com/p/boto/.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.