I switched from syslog-ng to rsyslog a few months ago and amazed at
performance/features of rsyslog and responsive-ness of the dev team. The
documentation sure is spotty but the mailing list very well compensates for
lack of documentation.

Specifically about impstats, yes, I have used it to troubleshoot issues
upstream with flume/hadoop and disk IO issues. It is a very valuable tool.




On Sat, Oct 12, 2013 at 10:01 AM, Robert J. McIntyre
<[email protected]>wrote:

> Not a question, but sharing a story/experience.
>
>
>
> My rsyslog server (v. 7.4.4) takes in about 60K mps right now, does some
> light filtering, and then writes the messages out to two different file
> shares on Windows servers over CIFS.  I'm using dynamic filenames for both
> destinations, and separate disk-assisted action queues.  It can generally
> keep up, but during peak hours the two omfile queues usually run about 4
> million events behind, and from time-to-time one will spike into the
> hundred+ million events behind.  To make matters worse, the remote ruleset
> will periodically get backed up as well.
>
>
>
> I've been beating my head against these issues for several months, and just
> nursing the system along.  It's my production system, so I have to be
> careful about making changes to it, so I can't just willy-nilly try things
> out.  But, because of a recent post by Rainer, I found this page
> (http://www.rsyslog.com/rsyslog-statistic-counter/), and started to go
> through my stats log with a fresh eye.
>
>
>
> Looking at my dynafile stats outputs, I was getting millions of missed and
> evicted files over time, but I didn't know what that meant.  After reading
> the stats description, I knew immediately what was going on.
>
>
>
> My dynafile template is (in pseudo-code, because I don't have the template
> in front of me):
>
>
>
> <sending devicename  pulled from the message header using the %msg:F
> feature>-firewall-YYYY-MM-DDTHH.QH
>
>
>
> We have between 11 and 15 devices sending logs at any given time.  So, you
> can see that we need to have, at any time, at least 11, but possibly 15
> files open at any time.  With the default dynafile cache size of 10, we
> were
> guaranteed to have a steady stream of misses and evictions as the file(s)
> that weren't in the cache needed to be accessed.  After raising the cache
> queue to 20 (comfortably hold one set of files at a time), we've eliminated
> the excess cache misses/evictions, and our omfile queues rarely have any
> back up at all, with transient peak (queue maxsize) values in the low
> hundreds of thousands, vs. 10's or 100's of millions.  Rest assured, we'll
> be continuing to look at our impstats output with a magnifying glass now.
> :)
>
>
>
>
> I just thought I'd share this with the group, in case it helps anyone else,
> or inspires you to take a closer look at what's happening with your server.
>
>
>
> Cheers!
>
> Robert
>
> _______________________________________________
> rsyslog mailing list
> http://lists.adiscon.net/mailman/listinfo/rsyslog
> http://www.rsyslog.com/professional-services/
> What's up with rsyslog? Follow https://twitter.com/rgerhards
> NOTE WELL: This is a PUBLIC mailing list, posts are ARCHIVED by a myriad
> of sites beyond our control. PLEASE UNSUBSCRIBE and DO NOT POST if you
> DON'T LIKE THAT.
>
_______________________________________________
rsyslog mailing list
http://lists.adiscon.net/mailman/listinfo/rsyslog
http://www.rsyslog.com/professional-services/
What's up with rsyslog? Follow https://twitter.com/rgerhards
NOTE WELL: This is a PUBLIC mailing list, posts are ARCHIVED by a myriad of 
sites beyond our control. PLEASE UNSUBSCRIBE and DO NOT POST if you DON'T LIKE 
THAT.

Reply via email to