> -----Original Message----- > From: [email protected] [mailto:rsyslog- > [email protected]] On Behalf Of [email protected] > Sent: Tuesday, November 03, 2009 4:06 PM > To: rsyslog-users > Subject: Re: [rsyslog] Queuing bug (4.5.5) > > On Tue, 3 Nov 2009, Aaron Wiebe wrote: > > > This is still taking place this hour - though I obviously can't > > restart onto a newer version without losing this case. Is there > > anything I can do in the current configuration to try and debug this > > situation? > > if you can strace each thread for a few seconds it may give you an idea > what's happening.
This is a good suggestion. It would also potentially be enlightening to attach gdb to the process at various points in time and get a backtrace from all running threads. Other than that, there is little you can do on a running version. If it were compiled for debug, and the environment setup so that we could capture stdout/stderr, sending SIGUSR2 would yield to much the same information as the gdb backtrace BUT when runtime instrumentation is enabled, the build is operating at a third to a quarter of its normal speed. > in the meantime you need to stop the system from getting further > behind, > can you redirect or reconfigure the senders so that they are not > sending > new logs to this box so that it can dig itself out (stopping the input > may > be enough by itself, rsyslog has historicly done a LOT of locking > around > the main queue, and if you have a full queue with more data trying to > be > delivered it can really slow things down. I wouldn't expect it to be > this > much, but it could be part of it) > This may clean up things, but I really doubt it, given the magnitude of delays. Also, Aaron runs 4.4.5, which already has lots of the locking removed/restructure (not to compare with the recent v5-devel, but much more effcient than early v4 and v3). Rainer > David Lang > > > (We're up to 18:46:51 now!) > > > > -Aaron > > > > On Tue, Nov 3, 2009 at 1:46 AM, Rainer Gerhards > > <[email protected]> wrote: > >> mhhh... This is obviously not intended behavior. There are some > rate-limiting > >> features that can deliberately slow down a queue, but I guess you > have not > >> configured these. So there either is a bug that activates them at > some point > >> during processing (e.g. an invalid memory access could do that) or > there is > >> some other bug that causes the dequeue to happen very slowly. In any > case, it > >> is hard to guess. > >> > >> Given the volume of the messages, a debug log probably does not > help. We > >> could gain a little insight in at least the queue sizes that rsyslog > sees via > >> imdiag plus the (very rudamentary) v5 debug front-end (it doesn't > require the > >> engine to be v5!). I would need to spend at least a little work on > the > >> front-end to enable remote access, but that's not really a problem. > >> > >> One other thing is that I am still holding a v4-beta with additional > fixes as > >> I didn't want to put these in the middle of some other debugging. > However, > >> these fixes address potential memory problems, so the most > appropriate course > >> of action would be to give that version at least a try. It needs to > be > >> released in the next days in any case. > >> > >> I have uploaded that (pre-4.5.6) version so that you can give it a > try if you > >> like: > >> > >> http://www.rsyslog.com/download/rsyslog/pre/rsyslog-4.5.6.tar.gz > >> > >> I think it would good if you could try it at least once, because I > think any > >> other troubleshooting will boil down to looking at the fixes this > version > >> contains. > >> > >> Rainer > >> > >>> -----Original Message----- > >>> From: [email protected] [mailto:rsyslog- > >>> [email protected]] On Behalf Of Aaron Wiebe > >>> Sent: Monday, November 02, 2009 11:52 PM > >>> To: rsyslog-users > >>> Subject: [rsyslog] Queuing bug (4.5.5) > >>> > >>> Greetings all, > >>> > >>> I appear to have run into an issue with 4.5.5 where queues are not > >>> being flushed in a timely manner. In this specific case, I have > data > >>> from last wednesday that is being written to disk in small chunks > >>> today since last wednesday. Unfortunately I cannot be too specific > in > >>> details, but here is what I can see: > >>> > >>> Using omfile+gzip, there appears to have been a decent burst in > >>> traffic sometime last wednesday. The rsyslog instance has grown to > >>> 1.8GB of memory and stayed there for a while. I have both main > >>> message and action queues defined using fixedarray, and I see no > >>> on-disk queues (appears to be entirely in memory). > >>> > >>> I've got templates writing out to filenames using an hourly > timestamp > >>> (filenames like: [token]-[host]-YYYYMMDD-HH.txt.gz) In some of > those > >>> files, all of them less than 5k in size, there are between 5 and 15 > >>> lines of content, all of them from last wednesday, and within a few > >>> seconds of each other. It's almost like there was a significant > queue > >>> built up, the hour rolled over, and only the first block of lines > were > >>> pulled from the queue. Then the hour rolled over again, and > another > >>> block of lines were pulled from the queue. Then the next hour, > then > >>> another 5-15 lines. > >>> > >>> Is it possible that one of the queues still has a good chunk of > data > >>> built up, and is flushing it out very slowly? It hasn't been > >>> consistantly at the top of the hour either, and not every hour. > But > >>> the log lines themselves are sequentially written out, and usually > the > >>> lines are within a few seconds of each other. > >>> > >>> For example: > >>> > >>> syslog-myhost-20091102-18.txt.gz: 3 lines, 2 with TS Oct 21 > 18:46:34 > >>> and one 18:46:35 > >>> syslog-myhost-20091102-19.txt.gz: 17 lines, 3 Oct 21 18:46:35, 14 > >>> lines Oct 21 18:46:36 > >>> syslog-myhost-20091102-20.txt.gz: 12 lines, 8 Oct 21 18:46:36, 4 > >>> lines Oct 21 18:46:37 > >>> > >>> Thoughts? > >>> -Aaron > >>> _______________________________________________ > >>> rsyslog mailing list > >>> http://lists.adiscon.net/mailman/listinfo/rsyslog > >>> http://www.rsyslog.com > >> _______________________________________________ > >> rsyslog mailing list > >> http://lists.adiscon.net/mailman/listinfo/rsyslog > >> http://www.rsyslog.com > >> > > _______________________________________________ > > rsyslog mailing list > > http://lists.adiscon.net/mailman/listinfo/rsyslog > > http://www.rsyslog.com > > _______________________________________________ rsyslog mailing list http://lists.adiscon.net/mailman/listinfo/rsyslog http://www.rsyslog.com

