On Thu, 28 Apr 2005, Bernard Li wrote: > Hey Jeremy: > Issues with interfaces are usually to do with the /etc/hosts settings > of your headnode - so you might want to check that too.
on the master node /etc/hosts looks like this: 10.2.6.199 oscar-control.blah.com oscar-control oscar_server nfs_oscar pbs_oscar 172.21.184.192 oscar-control.blah.com oscar-control # These entries are managed by SIS, please don't modify them. 10.2.6.1 node1.blah.com node1 10.2.6.2 node2.blah.com node2 10.2.6.3 node3.blah.com node3 10.2.6.4 node4.blah.com node4 10.2.6.5 node5.blah.com node5 10.2.6.6 node6.blah.com node6 10.2.6.7 node7.blah.com node7 10.2.6.8 node8.blah.com node8 10.2.6.9 node9.blah.com node9 10.2.6.10 node10.blah.com node10 On a node its exactly the same. > But definitely check the logs, those should tell you the problem > (possibly also the Torque/PBS logs). Cheers, oscarinstall.log reveals nothing about the failure. THis is the output from the test and the oscarinstall.log test: Performing root tests... Torque node check [PASSED] Torque service check:pbs_server [PASSED] Maui service check:maui [PASSED] /home mounts [PASSED] Preparing user tests... Performing user tests... SSH ping test [PASSED] SSH server->node [PASSED] SSH node->server [PASSED] Torque default queue definition [PASSED] Torque Shell Test [PASSED] PVM (via Torque) [FAILED] Checking for 10 free nodes: [FAILED] Not enough free nodes. Tests incomplete. Checking for 10 free nodes: [FAILED] Not enough free nodes. Tests incomplete. Ganglia test [FAILED] There were issues running some user test scripts. Please check your logs located in /home/oscartst. Run APItests... Running Installation tests for pvm [PASS] 2005-04-29T09:54:53Z pvmd-path-ls.apt [PASS] 2005-04-29T09:54:53Z modulecmd-path-ls.apt [PASS] 2005-04-29T09:54:53Z pvm-module-list.apt [PASS] 2005-04-29T09:54:53Z pvm-module-show-pvm_rsh.apt [PASS] 2005-04-29T09:54:53Z pvm-module-show-pvm_arch.apt [PASS] 2005-04-29T09:54:54Z pvm-module-show-pvm_root.apt ERROR 4 REPORTED ABOVE. oscarinstall.log ============================================================================= == Running step 8 of the OSCAR wizard: Test cluster setup ============================================================================= --> Step 8: Running tests: cd /opt/oscar/testing && xterm -sl 500 -e ./test_cluster --wait --> Step 8: Not waiting for completion The *.err files in oscartst are zero length. Where would I find more log files? Thanks for your help. COuld it be that I'm missing a package or something for pvm. -jeremy > > Bernard > > ________________________________ > > From: Jeremy Hansen [mailto:[EMAIL PROTECTED] > Sent: Thu 28/04/2005 7:03 PM > To: Bernard Li > Cc: [email protected] > Subject: RE: [Oscar-users] Not enough free nodes. Tests incomplete. > > > > On Thu, 28 Apr 2005, Bernard Li wrote: > > > Did you run 'Complete Cluster Setup' before you run the tests? > > Yes, and this completed fine. > > > Have you checked the error messages (located in /home/oscartst)? > > Not yet. > > I remember this being a problem in the past and it has something to do > with the definition of the master node for maui or torque. Does this ring > any bells? I will report the error log tomorrow. > > > Do you have network connectivity to your compute nodes? > > Yes. All tests pass except for the ones I highlighted. > > I've noticed some strangeness with OSCAR in the previous version when I > set up the cluster with the "public" interface up and assigned. I didn't > see exactly the same behavior this time, but this problem is reminiscent of > something I've seen before. > > Thanks > -jeremy > > > Cheers, > > > > Bernard > > > > > -----Original Message----- > > > From: [EMAIL PROTECTED] > > > [mailto:[EMAIL PROTECTED] On Behalf Of > > > Jeremy Hansen > > > Sent: Thursday, April 28, 2005 16:34 > > > To: [email protected] > > > Subject: [Oscar-users] Not enough free nodes. Tests incomplete. > > > > > > > > > PVM (via Torque) > > > [FAILED] > > > Checking for 10 free nodes: > > > [FAILED] > > > Not enough free nodes. Tests incomplete. > > > LAM/MPI (via Torque) > > > [FAILED] > > > Ganglia test > > > [FAILED] > > > > > > Any clues on this? > > > > > > Thanks > > > -jeremy > > > > > > > > > > > > ------------------------------------------------------- > > > SF.Net email is sponsored by: Tell us your software development plans! > > > Take this survey and enter to win a one-year sub to > > > SourceForge.net Plus IDC's 2005 look-ahead and a copy of this > > > survey Click here to start! > > > http://www.idcswdc.com/cgi-bin/survey?id=105hix > > > _______________________________________________ > > > Oscar-users mailing list > > > [email protected] > > > https://lists.sourceforge.net/lists/listinfo/oscar-users > > > > > > > > > ------------------------------------------------------- This SF.Net email is sponsored by: NEC IT Guy Games. Get your fingers limbered up and give it your best shot. 4 great events, 4 opportunities to win big! Highest score wins.NEC IT Guy Games. Play to win an NEC 61 plasma display. Visit http://www.necitguy.com/?r=20 _______________________________________________ Oscar-users mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/oscar-users
