r5.4.3 come with oge6.2u5p2 in sge roll
if you want to  use different version of ge e.g. ge2011.11
just make sure these two version doesnot use the same SGE*_PORT
or,
IIRC you can just replace sge_execd with the newer one
regards

SGE_CELL=default; export SGE_CELL
SGE_CLUSTER_NAME=r53; export SGE_CLUSTER_NAME
SGE_QMASTER_PORT=536; export SGE_QMASTER_PORT
SGE_EXECD_PORT=537; export SGE_EXECD_PORT

On 8/1/2012 10:08 PM, Joseph Farran wrote:
Thanks Simon and Rayson.

That was pretty much what I was doing. On my new freshly installed node, I placed a copy of my sgeexecd.HPC in /etc/init.d, chkconfig to make sure it starts up on next boot, created /var/spool/oge and starting oge would then create the "compute-x-x" directory and files in /var/spool/oge.

The above worked just fine, except that I either found a bug or I am missing something else.


If I install OGE on a compute node using start_gui_installer and I then run:

$  qrsh -q sf "module list; hostname"
No Modulefiles Currently Loaded.
compute-1-1.local

All is well and my environment is correctly set. However, if on compute-1-1 I then stop OGE ( service sgeexecd.HPC stop ) and re-start it again, part of my environment setup is hosed:

When I re-run the same qrsh command after stopping and starting sge service on compute-1-1, I now get:

$  qrsh -q sf "module list; hostname"
bash: module: command not found
compute-1-1.local


So I thought that it was not as simple as dropping a copy of sgeexecd.HPC and creating the directory, etc, and starting the service.

I can duplicate the above with the same results.

Joseph



On 08/01/2012 06:38 PM, Rayson Ho wrote:
Hi Joseph,

If you want to add a node from scratch, then you can try
"install_execd" - it should create all the needed files& the needed
queues, etc for the node.

If the qmaster already has the queues defined, and everything is the
same (node name, etc) except that the node's filesystem is gone, then
it should be real easy. The execd should automatically re-create the
local spool dir and the sub-directories. And as mentioned by Simon,
you can copy the init script by hand.

Rayson



On Wed, Aug 1, 2012 at 9:12 PM, Joseph Farran<[email protected]>  wrote:
Hi.

I originally ran "start_gui_installer" which is a great and easy gui toll to
add compute nodes.

What is the proper way to re-add nodes but from the command line? I am running Rocks 5.4.3 and when a node is re-imaged all is gone, so is there an
easy way to re-add the node via command line?

Joseph

_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

--

<<attachment: laotsao.vcf>>

_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to