A "problem" I am aware of is that a died peer (or connection dropped an interim firewall) is not detected as broken, because no messages are exchanged any longer. An often-used solution is KEEPALIVE, but this can also take some time to timeout (and may have bad effects on slow connection or those with outages of interim systems). I know that I wanted to implement the capability to activate KEEPALIVE, but I am not sure if I found time to actually do it. Will let you know once I can check that.
Rainer > -----Original Message----- > From: [email protected] [mailto:rsyslog- > [email protected]] On Behalf Of Kenneth Marshall > Sent: Monday, January 11, 2010 2:52 PM > To: rsyslog-users > Subject: Re: [rsyslog] rsyslog hang with imklog (/dev/log) on a RHEL5 > machine > > It does seem to act like the RELP problem, but my use is only > with a regular TCP connection using @@logmachine. It had the > same symptom and restarting rsyslog cleared it up. > > Regards, > Ken > > On Mon, Jan 11, 2010 at 12:15:55PM +0100, Rainer Gerhards wrote: > > I think there is a patch (or a recommendation) regarding RELP in my > mail > > backlog. If I got it right, RELP does not necessarily detect a broken > > connection, and thus no recovery action is initiated. I'll try to get > to this > > ASAP, but I am now the second day in office and there is still a pile > of > > things I need to look into ... > > > > Rainer > > > > > -----Original Message----- > > > From: [email protected] [mailto:rsyslog- > > > [email protected]] On Behalf Of [email protected] > > > Sent: Tuesday, January 05, 2010 10:13 PM > > > To: rsyslog-users > > > Cc: [email protected] > > > Subject: Re: [rsyslog] rsyslog hang with imklog (/dev/log) on a > RHEL5 > > > machine > > > > > > this sounds like rsyslog is failing to send the logs out to the > RELP > > > server, and so is building up a large queue. restarting rsyslog > would > > > clear the queued up log messages and make it fast again. > > > > > > David Lang > > > > > > > > > On Tue, 5 Jan 2010, Kenneth Marshall wrote: > > > > > > > Date: Tue, 5 Jan 2010 13:53:49 -0600 > > > > From: Kenneth Marshall <[email protected]> > > > > Reply-To: rsyslog-users <[email protected]> > > > > To: [email protected] > > > > Cc: [email protected] > > > > Subject: [rsyslog] rsyslog hang with imklog (/dev/log) on a RHEL5 > > > machine > > > > > > > > I am running rsyslog version 4.2.0 on a Redhat 5 machine > > > > and noticed slow logins to the box. The strace on the login > > > > sshd shows the following: > > > > > > > > 9937 0.000045 socket(PF_FILE, SOCK_DGRAM, 0) = 4 > > > > 9937 0.000025 fcntl(4, F_SETFD, FD_CLOEXEC) = 0 > > > > 9937 0.000019 connect(4, {sa_family=AF_FILE, > > > path="/dev/log"...}, 110) = 0 > > > > 9937 0.000040 sendto(4, "<86>Jan 5 13:36:12 sshd[9937]: > "..., > > > 90, MSG_NOSIGNAL, NULL, 0) = ? ERESTARTSYS (To be restarted) > > > > 9937 0.000042 --- SIGCHLD (Child exited) @ 0 (0) --- > > > > 9937 0.000018 sendto(4, "<86>Jan 5 13:36:12 sshd[9937]: > "..., > > > 90, MSG_NOSIGNAL, NULL, 0 <unfinished ...> > > > > 5095 7.001495 <... select resumed> ) = ? ERESTARTNOHAND (To > be > > > restarted) > > > > 5095 0.000040 --- SIGCHLD (Child exited) @ 0 (0) --- > > > > 5095 0.000025 wait4(-1, [{WIFEXITED(s) && WEXITSTATUS(s) == > > > 0}], WNOHANG, NULL) = 9844 > > > > 5095 0.000055 wait4(-1, 0x7fffbf6d198c, WNOHANG, NULL) = 0 > > > > 5095 0.000021 rt_sigaction(SIGCHLD, NULL, {0x2ad5c3ab2740, > [], > > > SA_RESTORER, 0x2ad5c65922d0}, 8) = 0 > > > > 5095 0.000028 rt_sigreturn(0x11) = -1 EINTR (Interrupted > > > system call) > > > > 5095 0.000027 select(7, [3 5], NULL, NULL, NULL <unfinished > > > ...> > > > > 9937 8.001608 <... sendto resumed> ) = 90 > > > > 9937 0.000028 close(4) = 0 > > > > 9937 0.000039 read(6, "\0\0\5\36", 4) = 4 > > > > 9937 0.000037 read(6, "\31\0\0\0\24'\363w{\376B\364Ye > > > !\365\232\216\220\352\343\"\262\334\0\0\0\20\0\0\0"..., 1310) = > 1310 > > > > 9937 0.000104 close(6) = 0 > > > > 9937 0.000029 mmap(NULL, 1310720, PROT_READ|PROT_WRITE, > > > MAP_SHARED|MAP_ANONYMOUS, -1, 0) = 0x2ad8627d9000 > > > > 9937 0.000074 munmap(0x2ad85caed000, 65536) = 0 > > > > 9937 0.000037 wait4(9938, [{WIFEXITED(s) && WEXITSTATUS(s) > == > > > 0}], 0, NULL) = 9938 > > > > 9937 0.000032 alarm(0) = 102 > > > > 9937 0.000023 rt_sigaction(SIGALRM, NULL, {0x2ad85c8637a0, > [], > > > SA_RESTORER|SA_INTERRUPT, 0x2ad85f3432d0}, 8) = 0 > > > > 9937 0.000029 rt_sigaction(SIGALRM, {SIG_DFL, [], > > > SA_RESTORER|SA_INTERRUPT, 0x2ad85f3432d0}, NULL, 8) = 0 > > > > ... > > > > > > > > The problem seems to be caused by writing to /dev/log which > should > > > > be being managed by the rsyslog program. I see a similar problem > > > > reported earlier on the forum: > > > > > > > > rsyslog hangs with imklog + omrelp (Same bug a imuxlog FC ?) > > > > > > > > This was for version 3.18.4 but the symptom sounded very similar. > > > > I restarted the rsyslog process and the login times returned to > > > normal. > > > > Let me know if there is something further I can do to help you > debug > > > > this matter. > > > > > > > > Regards, > > > > Ken > > > > _______________________________________________ > > > > rsyslog mailing list > > > > http://lists.adiscon.net/mailman/listinfo/rsyslog > > > > http://www.rsyslog.com > > > > > > > _______________________________________________ > > > rsyslog mailing list > > > http://lists.adiscon.net/mailman/listinfo/rsyslog > > > http://www.rsyslog.com > > _______________________________________________ > > rsyslog mailing list > > http://lists.adiscon.net/mailman/listinfo/rsyslog > > http://www.rsyslog.com > > > _______________________________________________ > rsyslog mailing list > http://lists.adiscon.net/mailman/listinfo/rsyslog > http://www.rsyslog.com _______________________________________________ rsyslog mailing list http://lists.adiscon.net/mailman/listinfo/rsyslog http://www.rsyslog.com

