Hi Michael,
Wow, thanks for the quick response. =) Yeah, please don't break your production cluster just to check this out. I will look at the installer again to see if I see that check box you're talking about. It actually sounds really familiar. I installed an OSCAR cluster about 3 years ago and I think I remember seeing that then but I can't seem to find it now. I did try defining the master nodes private interface as node0.cluster using the install_server GUI with horrific results. Yeah, I won't try that again. It completely broke my access to the oda database so I couldn't back out of the installation or fix it. I spent quite a bit of time trying differen things like granting access to the [EMAIL PROTECTED] user but it still wouldn't work so I finally just threw my hands up and rebuilt the system. =P
As for just geting mpich to work on its own, it appears that it is configured correctly after the OSCAR installation so that isn't really a huge problem. Well, except for the fact that it is only seeing one processor for use. If you have any idea why that is happening please let me know. I would really like to have all the management stuff working though which is why I'm still pushing to try and get the xpbs stuff setup correctly. If you have any other ideas for me, that would be great. Thanks Michael.
-Jenny =) At 12:33 PM -0400 9/3/04, Michael Edwards wrote:
I am fairly sure there is a radio box somewhere in the install process that says "Use head node to compute". I will try and see if I can find it if I can find a computer I don't mind breaking... Don't want to mess with my production cluster, at least one of the installer steps does things even if you hit cancel :)
If you just need to use lam (or I assume mpich, never tried installing that one), it is quite easy to install stand alone and configure to run on one computer. OSCAR is mainly handy for installing the scheduler and resource managers. Then you could go back and start over with OSCAR once you get some compute nodes. Just a thought for a quick fix.
Unless there is something else in OSCAR you would need for development, I can't think of anything off the top of my head though. I could walk you through using lam on just one node if you need a hand, I have done it before while testing things.
On Fri, 3 Sep 2004 09:05:44 -0700, Jenny Aquilino <[EMAIL PROTECTED]> wrote:> information is automatically populated into the tables and I wouldHi,
I was hoping that someone out there might be able to help me out with this question. I am currently working on a rather strange cluster...it's only one node. I know that sounds strange but there is a reason for it. The user only has one system to start out with but would like to add more nodes very soon. The problem is that he can't wait for the other nodes to come on-line before he starts developing on it so I need to make this one node cluster functional for him.
I saw the FAQ on the OSCAR sourceforge website that says to modify the /var/spool/pbs/server_priv/nodes file and then restart the pbs_mom, pbs_server in order to make the master a node so I did that. When I run "pbsnodes -a" it does reflect my one node. However, when I run xpbs to see how many processors are allocated to my server, it reflects 0. It also shows the workq as not having access to any processors. I added a print statement to one of the mpich example programs to print out the number of cpus recognized by the "MPI_Comm_size" command and it returned 1 when in fact my system has 2 processors. Does anyone know why this might be? Is one of the processors on the master node always reserved? More generally, is there a set of steps anyone could recommend to be able to get the master node setup like a client as far as the monitoring and scheduling tools go? I did go into qmgr and by hand can enter the information but it seems that when other nodes get built, thatreally like it if I could get the master to populate this information on itself when it is setup to also be a compute node.
I hope this e-mail makes sense. I'm a little crazy after several nights of tossing and turning trying to figure out why things aren't working the way I would expect them to. Thanks in advance for any help you can offer. =)
-Jenny Aquilino =)
------------------------------------------------------- This SF.Net email is sponsored by BEA Weblogic Workshop FREE Java Enterprise J2EE developer tools! Get your free copy of BEA WebLogic Workshop 8.1 today. http://ads.osdn.com/?ad_id=5047&alloc_id=10808&op=click _______________________________________________ Oscar-users mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/oscar-users
------------------------------------------------------- This SF.Net email is sponsored by BEA Weblogic Workshop FREE Java Enterprise J2EE developer tools! Get your free copy of BEA WebLogic Workshop 8.1 today. http://ads.osdn.com/?ad_id=5047&alloc_id=10808&op=click _______________________________________________ Oscar-users mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/oscar-users
