Hello all,

Several weeks ago I ask you about my configuration. The conclusion was, the installation is too bad you should retry with a new one.

That what I did, but I still need your help.

I remade an installation, based on this page: <http://wiki.unixh4cks.com/index.php/Setting_up_Sun_Grid_Engine_on_Ubuntu>


My fist VM is a Debian (6.0.7) and will be the master node. The exec node is a centos 6.3 VM.

All works well until the "Configuration" step.

# sudo -u sgeadmin qconf -am jacques
error: commlib error: got select error (Connection refused)
ERROR: unable to send message to qmaster using port 6444 on host "sge0": got send error

It looks like qmaster is not running
# ps aux | grep qmaster
root 2218 0.0 0.0 7548 832 pts/0 S+ 17:32 0:00 grep qmaster

So I try to launch it:
# /etc/init.d/gridengine-master start

I found in /var/spool/gridengine/qmaster/messages

03/01/2013 17:33:04| main|sge0|W|local configuration sge0 not defined - using global configuration
03/01/2013 17:33:04|  main|sge0|E|global configuration not defined
03/01/2013 17:33:04|  main|sge0|C|setup faile

If I understood, I cannot configure qmaster because it's not running and it can run because I don't configure it.

My /etc/hosts:
127.0.0.1       sge0
#127.0.1.1      localhost.localdomain   localhost
192.168.77.111  sge0
192.168.77.113  sge1

# The following lines are desirable for IPv6 capable hosts
::1     ip6-localhost ip6-loopback
fe00::0 ip6-localnet
ff00::0 ip6-mcastprefix
ff02::1 ip6-allnodes
ff02::2 ip6-allrouters

I added sge0 (and sge1) into my local DNS (and it resolv the name).

What can I do? Where can I look to find a solution?

Thanks in advance for your help.

--
Jacques Foucry
*NOVΛSPARKS *
IT Manager
Tel : +33 (0)1 42 68 12 61
[email protected]
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to