Hi Dave:

1) Have you tried re-running the tests?
2) Are you using PBS (which you d/led from OPD) or Torque (which came
with the tarball)?

Cheers,

Bernard 

> -----Original Message-----
> From: [EMAIL PROTECTED] 
> [mailto:[EMAIL PROTECTED] On Behalf Of 
> Dave Thom
> Sent: Thursday, February 24, 2005 7:21
> To: [email protected]
> Subject: Re: [Oscar-users] No Free Nodes when testing
> 
> Hi All,
> 
> Bernard, thanks for the reply.
> I took the easy option and started over from scratch, and now 
> most things work, although I still get the following on 
> running the tests via the wizard
> 
> Performing root tests...
> PBS node check                                                
>  [PASSED]
> PBS service check:pbs_server                                  
>  [PASSED]
> Maui service check:maui                                       
>  [PASSED]
> /home mounts                                                  
>  [PASSED]
> 
> Preparing user tests...
> Performing user tests...
> SSH ping test                                                 
>  [PASSED]
> SSH server->node                                              
>  [PASSED]
> SSH node->server                                              
>  [PASSED]
> PVM (via PBS)                                                 
>  [PASSED]
> PBS default queue definition                                  
>  [PASSED]
> PBS Shell Test                                                
>  [PASSED]
> Ganglia test                                                  
>  [PASSED]
> LAM/MPI (via PBS)                                             
>  [FAILED]
> Checking for 5 free nodes:                                    
>  [FAILED]
> Not enough free nodes. Tests incomplete.
> 
> checking http://localhost/ganglia (screenshot attached) shows 
> all the hosts as up tail -f on the pbs log while running the 
> tests doesn't look like anything is wrong
> 
> 
> 
> 
> On Wednesday 23 Feb 2005 18:00, Bernard Li wrote:
> > Hi Dave:
> >
> > gmond and gmetad are part of Ganglia - can you check to see if your 
> > Ganglia graphs are coming up correctly (http://localhost/ganglia).  
> > Also worth a shot is taking a look at gmond.conf and gmetad.conf 
> > configuration files to see if anything is out of the ordinary.
> >
> > Having said that, this should not be related to the PBS/Torque 
> > problems you are encountering - have you tried re-running the tests?
> >
> > Cheers,
> >
> > Bernard
> >
> > > -----Original Message-----
> > > From: [EMAIL PROTECTED]
> > > [mailto:[EMAIL PROTECTED] On 
> Behalf Of Dave 
> > > Thom
> > > Sent: Wednesday, February 23, 2005 3:09
> > > To: [email protected]
> > > Subject: [Oscar-users] No Free Nodes when testing
> > >
> > > Hi all,
> > > I've installed OSCAR on 5 Dell GX150's (one server 4 clients) the 
> > > server has two NICs and the clients are all on a private subnet.
> > > there were a few issues with networking that I have sorted and 
> > > everything is almost working - the last issue comes when 
> running the 
> > > "test cluster setup"
> > > Server node is Fedora Core 2 workstation install and unpatched
> > >
> > >
> > > Performing root tests...
> > > Maui service check:maui
> > >  [PASSED]
> > > PBS node check
> > >  [PASSED]
> > > PBS service check:pbs_server
> > >  [PASSED]
> > > /home mounts
> > >  [PASSED]
> > >
> > > Preparing user tests...
> > > Performing user tests...
> > > SSH ping test
> > >  [PASSED]
> > > SSH server->node
> > >  [PASSED]
> > > SSH node->server
> > >  [PASSED]
> > > Checking for 4 free nodes:
> > >  [FAILED]
> > > Not enough free nodes. Tests incomplete.
> > > Ganglia test
> > >  [PASSED]
> > > Checking for 4 free nodes:
> > >  [FAILED]
> > > Not enough free nodes. Tests incomplete.
> > > Checking for 4 free nodes:
> > >  [FAILED]
> > > Not enough free nodes. Tests incomplete.
> > > PBS default queue definition
> > >  [PASSED]
> > > Checking for 4 free nodes:
> > >  [FAILED]
> > > Not enough free nodes. Tests incomplete.
> > > There were issues running some user test scripts.  Please 
> check your 
> > > logs
> > >
> > > on tailing /var/log/messages on the server I see continual
> > >
> > > Feb 23 10:47:12 cluster0 /usr/sbin/gmond[2246]: 
> server_thread() Host 
> > > xxx.xxx.xxx.xxx tried to connect and was refused Feb 23 10:47:12 
> > > cluster0 /usr/sbin/gmetad[1842]: Process XML (MATHS Cluster):
> > > XML_ParseBuffer() error at line 1: no element found
> > >
> > > Are the two related?, any hints on getting gmond running 
> properly, 
> > > or fixing the no free node error?
> > >
> > > many thanks for your time
> > >
> > > --
> > > Dave Thom
> > > IT Support
> > > Mathematics and Statistics, GU
> > > e: [EMAIL PROTECTED]
> > > t: 0141 330 3521
> > > f: 0141 330 4111
> >
> > -------------------------------------------------------
> > SF email is sponsored by - The IT Product Guide Read honest 
> & candid 
> > reviews on hundreds of IT Products from real users.
> > Discover which products truly live up to the hype. Start 
> reading now.
> > http://ads.osdn.com/?ad_ide95&alloc_id396&op=Click
> > _______________________________________________
> > Oscar-users mailing list
> > [email protected]
> > https://lists.sourceforge.net/lists/listinfo/oscar-users
> 
> --
> Dave Thom
> IT Support
> Mathematics and Statistics, GU
> e: [EMAIL PROTECTED]
> t: 0141 330 3521
> f: 0141 330 4111
> 


-------------------------------------------------------
SF email is sponsored by - The IT Product Guide
Read honest & candid reviews on hundreds of IT Products from real users.
Discover which products truly live up to the hype. Start reading now.
http://ads.osdn.com/?ad_ide95&alloc_id396&op=click
_______________________________________________
Oscar-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/oscar-users

Reply via email to