Re: [Linux-HA] glib: ucast: error binding socket. Retrying: Address already in use

Dejan Muhamedagic Wed, 10 Jun 2009 05:08:53 -0700

Hi,

On Wed, Jun 10, 2009 at 01:17:19PM +0200, jeroen groenewegen van der weyden 
wrote:
> no I did not try lsof  or fuser (next time I will). But shouldn-t 
> netstat show the process also.


Yes, netstat should show connections, but with lsof you'll see
which processes are holding them.

> further it would be strange for an other 
> proces to keep this port. "randomly" after a reboot it should occupy the 
> same again, shouldn-t it?

Yes. Though there are processes which get dynamically assigned
ports from portmapper (yellow pages and similar).

Thanks,

Dejan

> 
> regards
> 
> jeroen
> 
> Dejan Muhamedagic wrote:
> > Hi,
> >
> > On Wed, Jun 10, 2009 at 12:20:14PM +0200, jeroen groenewegen van der weyden 
> > wrote:
> >   
> >> Hi everybody,
> >>
> >> I just experienced a strange behavior, after rebooting our server manual 
> >> the heart beat came not into service after the reboot. The message log 
> >> show Retrying already in use? but in netstat nothing shows up on port 
> >>     
> >
> > Did you try lsof or fuser?
> >
> >   
> >> 694? The nodes were able to see each other. On both nodes services were 
> >> connecting using the same link (br0).
> >>
> >> A heartbeart stop/start did not help and resulted in the same log messages
> >> After the a second reboot the phenomenon was gone
> >>
> >> heartbeat V2.99.2
> >> openSUSE 11.1
> >>
> >> Anybody seen this before? or know the cause of it?
> >>     
> >
> > No. The only explanation I can imagine is that another process is
> > using this port.
> >
> > Thanks,
> >
> > Dejan
> >
> >   
> >> best regards
> >>
> >> jeroen
> >>
> >>  ====== log =========
> >> ClusterNode1:/ # tail /var/log/messages
> >> Jun 10 12:00:08 ClusterNode1 heartbeat: [5315]: ERROR: glib: ucast: 
> >> error binding socket. Retrying: Address already in use
> >> Jun 10 12:00:09 ClusterNode1 heartbeat: [5315]: ERROR: glib: ucast: 
> >> error binding socket. Retrying: Address already in use
> >> Jun 10 12:00:10 ClusterNode1 heartbeat: [5315]: ERROR: glib: ucast: 
> >> error binding socket. Retrying: Address already in use
> >> Jun 10 12:00:11 ClusterNode1 heartbeat: [5315]: ERROR: glib: ucast: 
> >> error binding socket. Retrying: Address already in use
> >> Jun 10 12:00:12 ClusterNode1 heartbeat: [5315]: ERROR: glib: ucast: 
> >> error binding socket. Retrying: Address already in use
> >> Jun 10 12:00:13 ClusterNode1 heartbeat: [5315]: ERROR: glib: ucast: 
> >> unable to bind socket. Giving up: Address already in use
> >> Jun 10 12:00:13 ClusterNode1 heartbeat: [5315]: ERROR: 
> >> make_io_childpair: cannot open ucast br0
> >> Jun 10 12:00:14 ClusterNode1 heartbeat: [5317]: CRIT: Emergency 
> >> Shutdown: Master Control process died.
> >> Jun 10 12:00:14 ClusterNode1 heartbeat: [5317]: CRIT: Killing pid 5315 
> >> with SIGTERM
> >> Jun 10 12:00:14 ClusterNode1 heartbeat: [5317]: CRIT: Emergency 
> >> Shutdown(MCP dead): Killing ourselves.
> >>
> >>
> >> ========= netstat -ntlp ============
> >>
> >> ClusterNode1:/ # netstat -ntlp
> >> Active Internet connections (only servers)
> >> Proto Recv-Q Send-Q Local Address           Foreign Address         
> >> State       PID/Program name
> >> tcp        0      0 0.0.0.0:5801            0.0.0.0:*               
> >> LISTEN      4039/xinetd
> >> tcp        0      0 0.0.0.0:5901            0.0.0.0:*               
> >> LISTEN      4039/xinetd
> >> tcp        0      0 0.0.0.0:111             0.0.0.0:*               
> >> LISTEN      3063/rpcbind
> >> tcp        0      0 0.0.0.0:6004            0.0.0.0:*               
> >> LISTEN      4823/Xvnc
> >> tcp        0      0 0.0.0.0:22              0.0.0.0:*               
> >> LISTEN      3907/sshd
> >> tcp        0      0 127.0.0.1:631           0.0.0.0:*               
> >> LISTEN      3841/cupsd
> >> tcp        0      0 127.0.0.1:25            0.0.0.0:*               
> >> LISTEN      3868/master
> >> tcp        0      0 :::111                  :::*                    
> >> LISTEN      3063/rpcbind
> >> tcp        0      0 :::6004                 :::*                    
> >> LISTEN      4823/Xvnc
> >> tcp        0      0 :::22                   :::*                    
> >> LISTEN      3907/sshd
> >>
> >>
> >> ======= ha.cf ==========
> >>
> >> use_logd yes
> >> ucast br0 192.168.1.1
> >> ucast br0 192.168.1.2
> >> ucast br1 172.27.74.136
> >> ucast br1 172.27.74.137
> >> #serial /dev/ttyS0
> >> node ClusterNode1
> >> node ClusterNode2
> >> respawn root /usr/lib64/heartbeat/hbagent
> >> apiauth mgmtd uid=root
> >> respawn root /usr/lib64/heartbeat/mgmtd -v
> >> crm on
> >>
> >> _______________________________________________
> >> Linux-HA mailing list
> >> [email protected]
> >> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> >> See also: http://linux-ha.org/ReportingProblems
> >>     
> > _______________________________________________
> > Linux-HA mailing list
> > [email protected]
> > http://lists.linux-ha.org/mailman/listinfo/linux-ha
> > See also: http://linux-ha.org/ReportingProblems
> > ------------------------------------------------------------------------
> >
> >
> > No virus found in this incoming message.
> > Checked by AVG - www.avg.com 
> > Version: 8.5.339 / Virus Database: 270.12.60/2166 - Release Date: 06/09/09 
> > 18:08:00
> >
> >   
> 
> _______________________________________________
> Linux-HA mailing list
> [email protected]
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] glib: ucast: error binding socket. Retrying: Address already in use

Reply via email to