Hi all,

yesterday we had to reboot our cluster, and today one of the machines refuses to
boot. When grub kicks in, the screen just shows "GRUB " (without the quotes),
and stays there.

I have no idea what is happening (I dont' have sufficient knowledge of GRUB),
but it is very strange. The golden client gets replicated to (as far as I can
tell) 15 identical machines. Last time (previous to this one) I rebooted the
cluster there were 2 machines showing a similar problem (nodes 11 and 12), then
I rebooted the whole cluster again and this time node 5 joined the other two
(but booted correctly on first reboot). Today it is node 13 that is refusing to
get past the GRUB message... The other machines boot perfectly. Ah.. one of the
machines (I think it was node 5 showed a different behaviour: instead of just
showing GRUB and staying there, it wrote GRUB continiously to the screen,
filling it completely).

Any ideas what could be wrong? Last time I simply reinstalled the problematic
machines and all was well, but that's not a possibility for the future: I have
PVFS2 installed in the cluster, and losing the data in one of the nodes means
losing the data of the whole cluster...

Thanks in advance for any help with this.
Cheers,
Angel de Vicente
-- 
----------------------------------
http://www.iac.es/galeria/angelv/

PostDoc Software Support
Instituto de Astrofisica de Canarias
-- 
SIE Webpage
http://marta/SINFIN/index.html


-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys -- and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Sisuite-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/sisuite-users

Reply via email to