We are currently working on a nasty problem with our OSCAR cluster. I am running OSCAR 2.1 on RedHat 7.2 with a modified kernel to accommodate for my e1000 gigabit cards and mylex hardware RAID.
First the problem. We recently received some new nodes for our Beowulf cluster. We brought the cluster down to install them. We installed an additional network card to the switch, and manually moved the position of the old nodes in the rack. When we rebooted up the cluster (master and old nodes), ALL of the old nodes do not boot up. They stop at a screen that says LIL- I have removed the additional card from the switch and tried rebooting everything. No Joy I am able to ping the master node from the switch and vice-versa. If you could give some ideas of where to look? Or do I need to completely reinstall? I'd rather avoid the reinstall because I do not know what cause the crash, and thus would be susceptible to a reoccurrence. **************************************************** Christopher D. Oubre * email: [EMAIL PROTECTED] * research: http://cmt.rice.edu/~coubre * Web: http://www.angelfire.com/la2/oubre * Hangout: http://pub44.ezboard.com/bsouthterrebonne * Phone:(713)348-3541 Fax: (713)348-4150 * Rice University * Department of Physics, M.S. 61 * 6100 Main St. ^-^ * Houston, Tx 77251-1892, USA (O O) * -= Phlax=- ( v ) * ************************************m*m************* ------------------------------------------------------- This SF.net email is sponsored by: Does your code think in ink? You could win a Tablet PC. Get a free Tablet PC hat just for playing. What are you waiting for? http://ads.sourceforge.net/cgi-bin/redirect.pl?micr5043en _______________________________________________ Oscar-users mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/oscar-users
