Re: [rsyslog] Measuring performance of Rsyslog 3

david Tue, 15 Feb 2011 12:31:08 -0800

On Tue, 15 Feb 2011, Dirk wrote:

Am 15.02.11 10:32, schrieb Rainer Gerhards:
I guess you look for performance counters. Unfortunately, there are not
present before the introduction of impstat in v5.
We will have a look at that, thanks. We start trying to implant v5 intoSLES10.
We stumbled upon a strange phenomenon: rsyslog reads messages from a log fileand sends them to rsyslog on a central log server which parses them andwrites them to log files. The client needs 1 % CPU for that, and the serverneeds 100 % CPU for that - with only the messages from this one client! Bothmachines are exactly the same.
The configuration on the server is quite complex, so the messages we testwith have to be parsed by 550 rules before they match, get written anddiscarded.
Is this asynchronous resource usage "normal"? Or is it specially v3doing it thus - would we benefit from using v5? Does it depend on thenumber of rules to be parsed - would we benefit from using regularexpressions (assuming this is possible)?

yes, it is very normal for the receiver to use much more CPU than thesender.

if you think about what's happening, all the sender needs to do is to readthe text, add a bit of formatting, and then send it over the network

the receiver needs to receive arbatrary text, parse it to decide what sortof message it is and how it is formatted, then process the rules todecide if each rule applies, and then if the rule does apply, assemble anew output message (potentially changing the text that it has) and writingit out.


that being said, there are a lot of ways to improve this.

there is a fair amount of overhead in rsyslog when receiving messages asthey get moved to and from the queue, the newer versions will movemultiple messages at once, so they cut down this overhead a lot. There area lot of other performance improvements since version 3.

you can save 5-10% CPU by having predefined templates for writing the logsto a file instead of using the very flexible runtime defined templates

but the big cost (and therefor the big win) will be in working to optimizethe rules that you have to evaluate.


why do you have so many rules?

can you say that once a rule has matched the log none of the other rulesapply? (or if you can't say this as a blanket statement, are there caseswhere you can say this?)

do you have some rules that are much more common to match than others?(especially important in combination with the prior question)

if you think of your rules logically, do they (or portions of them) form atree where you can look for something and then branch into two differentsets of rules to then evaluate after that (if so, then the new rulesetsfeature may be the right thing for you)

As part of this, the different types of matching rules have very differentcosts (an if (regex) then arrangement being the highest overhead). it maybe worth trying to use different types of matching rules, especially forthe most common cases.

once we can get an idea of what your rules look like, we may be able tosuggest other optimizations.


David Lang
_______________________________________________
rsyslog mailing list
http://lists.adiscon.net/mailman/listinfo/rsyslog
http://www.rsyslog.com

Re: [rsyslog] Measuring performance of Rsyslog 3

Reply via email to