Am 07.07.2011 um 19:44 schrieb Carlos Scaloni:

> It's a virtual machine. I want to install the qmaster and the execcd in the 
> same machine. 192.168.56.101 is connected with the outside world!
> 
> ifconfig
> eth0      Link encap:Ethernet  HWaddr 08:00:27:F3:80:43  
>           inet addr:10.0.2.15  Bcast:10.0.2.255  Mask:255.255.255.0
>           inet6 addr: fe80::a00:27ff:fef3:8043/64 Scope:Link
>           UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
>           RX packets:1635 errors:0 dropped:0 overruns:0 frame:0
>           TX packets:1506 errors:0 dropped:0 overruns:0 carrier:0
>           collisions:0 txqueuelen:1000 
>           RX bytes:1390858 (1.3 MiB)  TX bytes:104832 (102.3 KiB)

Okay, so this is the main interface and the ones which gives the name where the 
qmaster can be acessed. But below it's trying to access under "|  
main|proyecto-192|C|".There are some tools in $SGE_ROOT/utilbin/lx24-amd64:

$ ./gethostname -all

and then:

$ ./gethostbyname <name>
$ ./gethostbyaddr <addr>

by these names and address. Is there any firewall installed blocking traffic 
the machine itself? This needs to be reolved, why it is running under eth1. 
There is a file to map hostnames to interfaces and make an alias to them, but I 
think in your case we have to look elsewhere, as you want to run it on the main 
interface. A plain:

$ hostname

gives you the name from eth0?

-- Reuti


> eth1      Link encap:Ethernet  HWaddr 08:00:27:14:C4:0C  
>           inet addr:192.168.56.101  Bcast:192.168.56.255  Mask:255.255.255.0
>           inet6 addr: fe80::a00:27ff:fe14:c40c/64 Scope:Link
>           UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
>           RX packets:1805 errors:0 dropped:0 overruns:0 frame:0
>           TX packets:1152 errors:0 dropped:0 overruns:0 carrier:0
>           collisions:0 txqueuelen:1000 
>           RX bytes:162007 (158.2 KiB)  TX bytes:186807 (182.4 KiB)
> 
> lo        Link encap:Local Loopback  
>           inet addr:127.0.0.1  Mask:255.0.0.0
>           inet6 addr: ::1/128 Scope:Host
>           UP LOOPBACK RUNNING  MTU:16436  Metric:1
>           RX packets:92 errors:0 dropped:0 overruns:0 frame:0
>           TX packets:92 errors:0 dropped:0 overruns:0 carrier:0
>           collisions:0 txqueuelen:0 
>           RX bytes:4600 (4.4 KiB)  TX bytes:4600 (4.4 KiB)
> 
> 
> 2011/7/7 Reuti <[email protected]>
> Am 07.07.2011 um 19:27 schrieb Carlos Scaloni:
> 
> > Hi, thanks for answering!
> >
> > I have a file in /tmp called sge_messages with this content:
> >
> > 07/07/2011 18:59:41|  main|proyecto-192|C|abort qmaster startup due to 
> > communication errors
> 
> Well, you listed the complete /etc/hosts - i.e. no 127.0.0.2 is present (i.e. 
> no entry for it is good)?
> 
> What is the primary interface in the master node? As I see two entries I 
> assume you have at least two network interfaces, and one of them is connected 
> to the outside world, the other to the nodes. Maybe it's addressing the 
> cluster on the wrong one.
> 
> -- Reuti
> 
> 
> > 2011/7/7 Reuti <[email protected]>
> > Hi,
> >
> > Am 07.07.2011 um 19:11 schrieb Carlos Scaloni:
> >
> > > Hi friends! I can't install SGE, I need your help, please. Thanks a lot 
> > > in advance!
> > >
> > > Options I chose:
> > >
> > > admin user is sgeadmin
> > > set network ports with environment
> > > sge_qmaster port 6444
> > > sge_execd port 6445
> > > say no to pkgadd and verify permissions
> > > classic spooling, not berkeley db
> > > gid range 20000-21000
> > > enter list of execution hosts node01 thru node##
> >
> > did you get any error log in /tmp?
> >
> > -- Reuti
> >
> >
> > > Error:
> > >
> > > Grid Engine qmaster startup
> > > ---------------------------
> > >
> > > Starting qmaster daemon. Please wait ...
> > >    starting sge_qmaster
> > >
> > > sge_qmaster start problem
> > >
> > > sge_qmaster didn't start!
> > > sge_qmaster start problem
> > >
> > > cat /etc/hosts
> > >
> > > 127.0.0.1   localhost localhost.localdomain localhost4 
> > > localhost4.localdomain4
> > > ::1         localhost
> > > 192.168.56.101 proyecto-192.local
> > > 10.0.2.15 proyecto-10.local
> > >
> > >
> > > _______________________________________________
> > > users mailing list
> > > [email protected]
> > > https://gridengine.org/mailman/listinfo/users
> >
> >
> >
> > _______________________________________________
> > users mailing list
> > [email protected]
> > https://gridengine.org/mailman/listinfo/users
> 
> 


_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to