Box has added IPMI functionality right into Ganglia. Might be one way to start.
At the same time, I should really start working on the Drupal RPM so that we can create a package for the web portal :) Cheers, Bernard > -----Original Message----- > From: [EMAIL PROTECTED] > [mailto:[EMAIL PROTECTED] On Behalf Of > Steven A. DuChene > Sent: Tuesday, November 22, 2005 10:02 > To: OSCAR-devel List > Subject: Re: [Oscar-devel] Fwd: [OSCAR feeback] > > I know that it is possible to add some look&feel > customizations to ganglia > as this is what ROCKS does when they integrate it into their > disto. How about > some minor customization that takes the user to a reboot > node(s) page or > extra functionality portal page from the ganglia monitor pages? > -- > Steven A. DuChene > > -----Original Message----- > From: Jeff Squyres <[EMAIL PROTECTED]> > Sent: Nov 22, 2005 12:55 PM > To: OSCAR-devel List <[email protected]> > Subject: Re: [Oscar-devel] Fwd: [OSCAR feeback] > > On Nov 22, 2005, at 9:45 AM, Lombard, David N wrote: > > >>> 1. First is due to the nature of the disk boot model. There's no > > low > >>> level way to know why a node doesn't boot. If one doesn't come up > >>> then putting a monitor on it is required. I don't know what you > > would > >>> do about it. It's pretty low level. Finding a > third-party solution > >>> and bundling it in would be nice if possible. > > > > Some sort of remote console access is needed. Could be as > simple as a > > serial line with BIOS console redirect, one of those KVM-over-IP > > thingies, or SOL (Serial Over LAN, an IPMI 2.0 feature). > At the end of > > the day, hardware must provide a part of the answer. > > I agree. But are there 10% solutions that OSCAR can offer? I think > that's the real question here. Is there any kind of feedback -- > however rudimentary -- that the node can provide to give some kind of > indication about why it failed? Perhaps even the following messages: > > 1. I failed, but if you reboot me, it might work > 2. I failed, but if you reboot me, it should work > 3. I failed, and have no idea why. You need to attach a monitor and > find out. > > Something simple like that -- even if the predominant answer we can > give is #3, that could be helpful. > > >>> 2. The problem that can be resolved more easily is the > post-install > >>> configuration. Oscar is really good at the initial > install but try > >>> doing something like building the latest kernel from source and > >>> distributing it to the nodes. It's not easy. We have a lot of > > things > >>> that need to be installed from source. What I do is chroot to the > >>> image, make the changes and do a cpushimage. It works > but it's not > >>> oscar friendly. It would be nice if I could do something more in > > the > >>> GUI. I understand that updating the image is going to be text and > >>> manual but I don't believe the gui lets me push the updated image. > >>> Keep in mind that I don't want to reformat the disk every time. I > > may > >>> just want to push a single updated binary out. > > > > I don't know that this is "not oscar friendly" as we > provide the tools, > > just not the gui. > > Yes, I think that's what he meant. Very definitely a user > perspective > here; he doesn't know/care how OSCAR is implemented internally > (although he does use c3 and the other tools that OSCAR provides). > > > But, should be eminently doable from that spiffy new > > portal--well, it is directly doable from the current > portal's C3 tools > > page, but a more purpose-built variant to push/get an image would be > > useful. We need the localboot/install magic for PXEBOOT manageable > > there, too. > > I think that's the goal here -- it would be nice if some kind of gui > (even a web-based thingy) could do some of these common tasks > easily/trivially. Perhaps a shiny button "re-push image X out to the > relevant nodes." > > >>> 3. Also, ganglia is a nice monitor but it would be nice to be able > > to > >>> do things like reboot a node from the gui. I know it can > be done at > >>> the command line. > > > > I don't see this as a ganglia feature--as a portal feature, > this would > > be fine as would the above item. > > Agreed. I think this stems from the user perspective of "I > can see all > this stuff in Ganglia, but I can't *do* anything to/with it -- it's > just reporting. But it seems like a natural place to let me *do* > things as well." > > -- > {+} Jeff Squyres > {+} The Open MPI Project > {+} http://www.open-mpi.org/ > > > > ------------------------------------------------------- > This SF.Net email is sponsored by the JBoss Inc. Get Certified Today > Register for a JBoss Training Course. Free Certification Exam > for All Training Attendees Through End of 2005. For more info visit: > http://ads.osdn.com/?ad_id=7628&alloc_id=16845&op=click > _______________________________________________ > Oscar-devel mailing list > [email protected] > https://lists.sourceforge.net/lists/listinfo/oscar-devel > > > > ------------------------------------------------------- > This SF.Net email is sponsored by the JBoss Inc. Get Certified Today > Register for a JBoss Training Course. Free Certification Exam > for All Training Attendees Through End of 2005. For more info visit: > http://ads.osdn.com/?ad_id=7628&alloc_id=16845&op=click > _______________________________________________ > Oscar-devel mailing list > [email protected] > https://lists.sourceforge.net/lists/listinfo/oscar-devel > ------------------------------------------------------- This SF.Net email is sponsored by the JBoss Inc. Get Certified Today Register for a JBoss Training Course. Free Certification Exam for All Training Attendees Through End of 2005. For more info visit: http://ads.osdn.com/?ad_idv28&alloc_id845&op=click _______________________________________________ Oscar-devel mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/oscar-devel
