My OGE software resides on a shared NFS directory /data/hpc/oge.
When I run the ./start_gui_installer script set OGE up with:
Qmaster Spool: /var/spool/oge/default/spool/qmaster
Global execd: /var/spool/oge/default/spool
Spooling: classic
The head node installs correctly, but compute nodes installation fails. The
error for the compute nodes show:
AILED: Task failed.
OUTPUT:
Your $SGE_ROOT directory: /data/hpc/oge
Using cell: >default<
Creating local configuration for host >compute-1-1.local<
[email protected] added "compute-1-1.local" to configuration list
Local configuration for host >compute-1-1.local< created.
Adding submit host >compute-1-1<
compute-1-1.local added to submit host list
cp /data/hpc/oge/default/common/sgeexecd /etc/init.d/sgeexecd.HPC
/usr/lib/lsb/install_initd /etc/init.d/sgeexecd.HPC
starting sge_execd
[email protected] modified "@allhosts" in host group list
[email protected] modified "all.q" in cluster queue list
got select error: Connection refused
got select error: closing "compute-1-1.local/execd/1"
Execd on host compute-1-1.local is not started!
ERROR:
Warning: untrusted X11 forwarding setup failed: xauth key data not generated
Warning: No xauth data; using fake authentication data for X11 forwarding.
TERM environment variable not set.
If I setup OGE with
Qmaster Spool: /var/spool/oge/default/spool/qmaster
Global execd: /data/hpc/oge/default/spool
Spooling: classic
Using the NFS share directory for "Global execd", then everything works just
fine - compute nodes are setup correctly.
What am I doing wrong?
Joseph
On 06/04/2012 02:51 PM, Reuti wrote:
Hi,
Am 04.06.2012 um 22:59 schrieb Joseph Farran:
When installing OGE with respect to the Spooling Configuration, one can select:
Qmaster spool directory
Global execd spool directory
I installed OGE from the head node on a shared NFS directory ( /data/oge ) and
like to make the spooling to be on the head node /var file system while leaving
oge executables in the NFS share directory.
Would the options be to change "Qmaster spool directory" to something like
"/var/oge"
Yes, or /var/spool/oge.
and leave the "Global execd spool directory" as is which is the shared NFS
directory?
Well, this could also be /var/spool/oge, then it would be local on each node.
http://arc.liv.ac.uk/SGE/howto/nfsreduce.html
-- Reuti
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users