I just got my cluster "working" with OSCAR 3.0 and RH 9.0 using 3com 3c2000T cards and 
a Cisco GigE switch.  When I installed OSCAR I had forgotten to take the machine name 
out of the alias list for 127.0.0.1 and put it on its own line where it belongs.  I 
fixed the hosts files on the head and all the nodes (not sure how to fix on the 
image).  Pushing files around works fine.

I am now having problems with running programs with lam-mpi on all my processors.  I 
ran NetPipe between several nodes without problem (it simple, only ever has two 
processes), but running the hpl benchmark on more than the head node and maybe one or 
two other processors locks up right away.

My question is mainly what effect having the hosts file slightly wrong would have on 
OSCAR's install process (it seemed to work ok, so it clearly wasn't fatal).  I changed 
a number of things at the same time so I am not sure which one of them is causing the 
lockup, so I thought I would check and see if any of you folks could think of obvious 
configuration problems it would cause.

Thanks again for all your help.



-------------------------------------------------------
SF.Net email is sponsored by Shop4tech.com-Lowest price on Blank Media
100pk Sonic DVD-R 4x for only $29 -100pk Sonic DVD+R for only $33
Save 50% off Retail on Ink & Toner - Free Shipping and Free Gift.
http://www.shop4tech.com/z/Inkjet_Cartridges/9_108_r285
_______________________________________________
Oscar-users mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/oscar-users

Reply via email to