Re: [rsyslog] rsyslog performance as receiver, heavily using regex in templates

Rainer Gerhards Thu, 31 Jan 2013 05:44:10 -0800

On Thu, 2013-01-31 at 14:51 +0200, Radu Gheorghe wrote:
> Hi Ben,
> 
> 2013/1/31 Ben Bradley <[email protected]>
> 
> > Hi everyone
> >
> > I'm currently using logstash as the log collector from a few rsyslog
> > sender clients. I'd like to use rsyslog to receive the remote logs instead
> > of logstash. This means I'm keeping things simple and can possibly also use
> > RELP.
> >
> > If the rsyslog receiver is doing alot of regex parsing on each message
> > received (i.e. parsing Apache logs into ElasticSearch fields) at what sort
> > of volume of log messages would I start to notice performance problems?
> >
> > Eventually I'm expecting about 5-10GB per day to be received by our
> > centralised rsyslog log server.
> >
> 
> I guess it all comes down to performance testing, but 10GB would probably
> mean ~20M logs or something like that. If the majority of those will be
> sent during the day (say 10 hours), my poor math says if you handle 500-600
> logs/sec you should be fine.


seeing that number, I'd say it requires quite some regexpes to get
rsyslog to sweat. HOWEVER... do we really need regexpes? Can you post a
couple of samples?

Rainer
> 
> I've never used regex with rsyslog in a performance situation, so I can't
> say, but it seems to me like it should easily handle that amount.
> 
> 
> >
> > Should I actually get the rsyslog senders to parse the regex patterns of
> > Apache logs into JSON then forward that JSON to the receiver? So the
> > sender's got the regex overhead?
> >
> > Or will an rsyslog receiver easily be able to parse all the regex patterns
> > with my volume of logging?
> > Having the regex patterns parsed in one place would make for easier
> > management. If necessary we can just throw more vCPUs and memory at the log
> > server without needing to touch the web nodes.
> >
> 
> I suspect the load won't be too high, but making the clients to that will
> scale a lot better and - especially since we don't expect the total load to
> be high - nobody will feel that load if it's that distributed. And if you
> add more web nodes, you won't have to touch anything. Not even adding vCPUs
> and memory.
> 
> Personally, I'd try the "centralized" method first, because it's easier to
> get started. If all works smoothly, you can push the same(ish) config to
> the web nodes. If you ever feel the need to do that :) By then, configuring
> them might get easier because of natural evolutions of packaging, testing
> and documentation.
> 
> Best regards,
> Radu
> _______________________________________________
> rsyslog mailing list
> http://lists.adiscon.net/mailman/listinfo/rsyslog
> http://www.rsyslog.com/professional-services/
> What's up with rsyslog? Follow https://twitter.com/rgerhards
> NOTE WELL: This is a PUBLIC mailing list, posts are ARCHIVED by a myriad of 
> sites beyond our control. PLEASE UNSUBSCRIBE and DO NOT POST if you DON'T 
> LIKE THAT.

_______________________________________________
rsyslog mailing list
http://lists.adiscon.net/mailman/listinfo/rsyslog
http://www.rsyslog.com/professional-services/
What's up with rsyslog? Follow https://twitter.com/rgerhards
NOTE WELL: This is a PUBLIC mailing list, posts are ARCHIVED by a myriad of 
sites beyond our control. PLEASE UNSUBSCRIBE and DO NOT POST if you DON'T LIKE 
THAT.

Re: [rsyslog] rsyslog performance as receiver, heavily using regex in templates

Reply via email to