My OGE software resides on a shared NFS directory /data/hpc/oge.

When I run the ./start_gui_installer script set OGE up with:

    Qmaster Spool:  /var/spool/oge/default/spool/qmaster
    Global execd:    /var/spool/oge/default/spool
    Spooling: classic

The head node installs correctly, but compute nodes installation fails.   The 
error for the compute nodes show:

   AILED: Task failed.

   OUTPUT:
   Your $SGE_ROOT directory: /data/hpc/oge
   Using cell: >default<
   Creating local configuration for host >compute-1-1.local<
   [email protected] added "compute-1-1.local" to configuration list
   Local configuration for host >compute-1-1.local< created.
   Adding submit host >compute-1-1<
   compute-1-1.local added to submit host list
   cp /data/hpc/oge/default/common/sgeexecd /etc/init.d/sgeexecd.HPC
   /usr/lib/lsb/install_initd /etc/init.d/sgeexecd.HPC
       starting sge_execd
   [email protected] modified "@allhosts" in host group list
   [email protected] modified "all.q" in cluster queue list
   got select error: Connection refused
   got select error: closing "compute-1-1.local/execd/1"
   Execd on host compute-1-1.local is not started!

   ERROR:
   Warning: untrusted X11 forwarding setup failed: xauth key data not generated
   Warning: No xauth data; using fake authentication data for X11 forwarding.
   TERM environment variable not set.


If I setup OGE with

    Qmaster Spool:  /var/spool/oge/default/spool/qmaster
    Global execd:    /data/hpc/oge/default/spool
    Spooling: classic

Using the NFS share directory for "Global execd", then everything works just 
fine - compute nodes are setup correctly.

What am I doing wrong?

Joseph


On 06/04/2012 02:51 PM, Reuti wrote:
Hi,

Am 04.06.2012 um 22:59 schrieb Joseph Farran:

When installing OGE with respect to the Spooling Configuration, one can select:

    Qmaster spool directory
    Global execd spool directory

I installed OGE from the head node on a shared NFS directory ( /data/oge ) and 
like to make the spooling to be on the head node /var file system while leaving 
oge executables in the NFS share directory.

Would the options be to change "Qmaster spool directory" to something like 
"/var/oge"
Yes, or /var/spool/oge.


and leave the "Global execd spool directory" as is which is the shared NFS 
directory?
Well, this could also be /var/spool/oge, then it would be local on each node.

http://arc.liv.ac.uk/SGE/howto/nfsreduce.html

-- Reuti

_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to