Hello list,
Since about 3 weeks I try to make GE working (step by step, first the
master then client...)
I install it on a Debian (inux crosswords.ns42.fr 2.6.32-5-amd64 #1 SMP
Sun Sep 23 10:07:46 UTC 2012 x86_64 GNU/Linux) using the default packages:
ii gridengine-client 6.2u5-1squeeze1
Utilities for Grid Engine queue management
ii gridengine-common 6.2u5-1squeeze1
Distributed resource management - common files
ii gridengine-exec 6.2u5-1squeeze1
Distributed resource management - Execution Server
ii gridengine-master 6.2u5-1squeeze1
Distributed resource management - Master Server
ii gridengine-qmon 6.2u5-1squeeze1
Graphical utilities for Grid Engine queue management
I read that I need to populate several variables in
/etc/profile.d/gridengine.sh file
SGE_ROOT, is where the installation leave, in debian /usr/lib/gridengine
SGE_ARCH which should be $SGE_ROOT/util/arch. Those folders (util/arch)
does not exist. Did I have to create them?
I already create $SGE_ROOT/default/commont and link here
/etc/gridengine/bootstrap.
I can start gridengine-master (/etc/init.d/gridengine-master) and
gridengine-exec (/etc/init.d/gridengine-exec)
# ps aux | grep grid
root 4728 0.0 0.9 220152 10140 pts/0 Sl+ 07:27 0:00 vi
/etc/profile.d/gridengine.sh
sgeadmin 5751 0.0 0.5 127756 5288 ? Sl 07:36 0:00
/usr/lib/gridengine/sge_qmaster
sgeadmin 6108 0.0 0.1 50208 1596 ? Sl 08:08 0:00
/usr/lib/gridengine/sge_execd
root 6114 0.0 0.0 7548 836 pts/1 S+ 08:08 0:00 grep grid
When I try to launch something (qmon for exemple) it fail with this message:
error: fopen("/usr/lib/gridengine/default/common/act_qmaster") failed:
No such file or directory
error: unable to read qmaster name: can't open
"/usr/lib/gridengine/default/common/act_qmaster" for reading qmaster
hostname
error: fopen("/usr/lib/gridengine/default/common/act_qmaster") failed:
No such file or directory
unable to send message to qmaster using port 6444 on host
"/usr/lib/gridengine": got unexpected parameters
error: fopen("/usr/lib/gridengine/default/common/act_qmaster") failed:
No such file or directory
The only act_master file I can find leave in:
/var/lib/gridengine/default/common/act_qmaster and content crossword
(the machine hostname without domain).
I now that my question are ridiculous and confusing. If you have a good
tutorial that I can read, I will be very happy with that.
Thanks is advance,
Jacques Foucry
--
Jacques Foucry
*NOVΛSPARKS *
IT Manager
Tel : +33 (0)1 42 68 12 61
[email protected]
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users