Hi Michael,

Wow, thanks for the quick response. =) Yeah, please don't break your production cluster just to check this out. I will look at the installer again to see if I see that check box you're talking about. It actually sounds really familiar. I installed an OSCAR cluster about 3 years ago and I think I remember seeing that then but I can't seem to find it now. I did try defining the master nodes private interface as node0.cluster using the install_server GUI with horrific results. Yeah, I won't try that again. It completely broke my access to the oda database so I couldn't back out of the installation or fix it. I spent quite a bit of time trying differen things like granting access to the [EMAIL PROTECTED] user but it still wouldn't work so I finally just threw my hands up and rebuilt the system. =P

As for just geting mpich to work on its own, it appears that it is configured correctly after the OSCAR installation so that isn't really a huge problem. Well, except for the fact that it is only seeing one processor for use. If you have any idea why that is happening please let me know. I would really like to have all the management stuff working though which is why I'm still pushing to try and get the xpbs stuff setup correctly. If you have any other ideas for me, that would be great. Thanks Michael.

-Jenny  =)
At 12:33 PM -0400 9/3/04, Michael Edwards wrote:
I am fairly sure there is a radio box somewhere in the install process
that says "Use head node to compute".  I will try and see if I can
find it if I can find a computer I don't mind breaking...  Don't want
to mess with my production cluster, at least one of the installer
steps does things even if you hit cancel :)

If you just need to use lam (or I assume mpich, never tried installing
that one), it is quite easy to install stand alone and configure to
run on one computer.  OSCAR is mainly handy for installing the
scheduler and resource managers.  Then you could go back and start
over with OSCAR once you get some compute nodes.  Just a thought for a
quick fix.

Unless there is something else in OSCAR you would need for
development, I can't think of anything off the top of my head though.
I could walk you through using lam on just one node if you need a
hand, I have done it before while testing things.

On Fri, 3 Sep 2004 09:05:44 -0700, Jenny Aquilino <[EMAIL PROTECTED]> wrote:
 Hi,

 I was hoping that someone out there might be able to help me out with
 this question.  I am currently working on a rather strange
 cluster...it's only one node.  I know that sounds strange but there
 is a reason for it.  The user only has one system to start out with
 but would like to add more nodes very soon.  The problem is that he
 can't wait for the other nodes to come on-line before he starts
 developing on it so I need to make this one node cluster functional
 for him.

 I saw the FAQ on the OSCAR sourceforge website that says to modify
 the /var/spool/pbs/server_priv/nodes file and then restart the
 pbs_mom, pbs_server in order to make the master a node so I did that.
 When I run "pbsnodes -a" it does reflect my one node.  However, when
 I run xpbs to see how many processors are allocated to my server, it
 reflects 0.  It also shows the workq as not having access to any
 processors.  I added a print statement to one of the mpich example
 programs to print out the number of cpus recognized by the
 "MPI_Comm_size" command and it returned 1 when in fact my system has
 2 processors.  Does anyone know why this might be?  Is one of the
 processors on the master node always reserved?  More generally, is
 there a set of steps anyone could recommend to be able to get the
 master node setup like a client as far as the monitoring and
 scheduling tools go?  I did go into qmgr and by hand can enter the
 information but it seems that when other nodes get built, that
> information is automatically populated into the tables and I would
 really like it if I could get the master to populate this information
 on itself when it is setup to also be a compute node.

 I hope this e-mail makes sense.  I'm a little crazy after several
 nights of tossing and turning trying to figure out why things aren't
 working the way I would expect them to.  Thanks in advance for any
 help you can offer.  =)

 -Jenny Aquilino  =)

 -------------------------------------------------------
 This SF.Net email is sponsored by BEA Weblogic Workshop
 FREE Java Enterprise J2EE developer tools!
 Get your free copy of BEA WebLogic Workshop 8.1 today.
 http://ads.osdn.com/?ad_id=5047&alloc_id=10808&op=click
 _______________________________________________
 Oscar-users mailing list
 [EMAIL PROTECTED]
 https://lists.sourceforge.net/lists/listinfo/oscar-users




------------------------------------------------------- This SF.Net email is sponsored by BEA Weblogic Workshop FREE Java Enterprise J2EE developer tools! Get your free copy of BEA WebLogic Workshop 8.1 today. http://ads.osdn.com/?ad_id=5047&alloc_id=10808&op=click _______________________________________________ Oscar-users mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/oscar-users

Reply via email to