VCL-169

Aaron Peeler Mon, 06 Jul 2009 12:13:12 -0700

I'm going to start working on VCL-169 Event Driven power down...


This is a first step of a larger power management feature.

In this step, I suggest extending health_check.pl script to accept optionsfor different data center events that would require the hardware to beshutdown. The events are usually related to heat issues that are detectedwithin the blade chassis's or other external thermal sensors.


The two primary events are

1)shutdown idle blades (phase 1)

I'm thinking the process is to pull all blades that are idle under thecontrolling management node, relocate any upcoming reservations that mightreside on those blades, then proceed to shutdown the blades.


2)shutdown blades currently inuse (phase 2 - phase 1 did not do enough)

This second part would be triggered if and only if event 1 is noteffective. It notifies the user running on the VCL resource about theunexpected data center problem and then starts a count-down of when thenode will be shutdown. Depending on the reservation type (Long-term vsshort or some other method) - we'll need to address either reclaiming theblade or just shutting it down and retaining the reservation data byextending the end time. Then once things are back to normal vcld on startup will detect these previous reservations and start them back up, thennotify the end-user it is available again.

If there are any thoughts or other suggestions, please feel free tocomment.


Aaron

VCL-169

Reply via email to