I just got my cluster "working" with OSCAR 3.0 and RH 9.0 using 3com 3c2000T cards and a Cisco GigE switch. When I installed OSCAR I had forgotten to take the machine name out of the alias list for 127.0.0.1 and put it on its own line where it belongs. I fixed the hosts files on the head and all the nodes (not sure how to fix on the image). Pushing files around works fine.
I am now having problems with running programs with lam-mpi on all my processors. I ran NetPipe between several nodes without problem (it simple, only ever has two processes), but running the hpl benchmark on more than the head node and maybe one or two other processors locks up right away. My question is mainly what effect having the hosts file slightly wrong would have on OSCAR's install process (it seemed to work ok, so it clearly wasn't fatal). I changed a number of things at the same time so I am not sure which one of them is causing the lockup, so I thought I would check and see if any of you folks could think of obvious configuration problems it would cause. Thanks again for all your help. ------------------------------------------------------- SF.Net email is sponsored by Shop4tech.com-Lowest price on Blank Media 100pk Sonic DVD-R 4x for only $29 -100pk Sonic DVD+R for only $33 Save 50% off Retail on Ink & Toner - Free Shipping and Free Gift. http://www.shop4tech.com/z/Inkjet_Cartridges/9_108_r285 _______________________________________________ Oscar-users mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/oscar-users
