Re: [rsyslog] A solution to action load balancing

Pavel Levshin Sun, 20 Oct 2013 04:24:55 -0700

My exact problem has been decribed two days ago in a thread named"mmnormalize under high load".

We are dealing with just one huge stream of syslog messages. All theyshare same source "host:port" pair (in fact, it is spoofed source), anda single destination "host:port" (our syslog server). These messages arevery similar, having the same PRI, and, to make things even worse, theyare not RFC-compliant. Rsyslog is unable to parse them properly.

For now, we just have to write incoming messages into files, one fileper minute. This works fine. But if we want (and we definitely will) toanalyze messages in real time, there is a place when somethingCPU-intensive kicks in. Something like mmnormalize. There will beexactly one heavy action, which cannot be paralleled.

Then, in the future, we will forward messages to a few backend syslog ORdatabase servers. To spread the load, we again must do some distinctionbetween messages to select one of predefined actions.



--
Pavel Levshin


20.10.2013 14:38, David Lang:

I can see other uses for a sequence number, so thanks for creating this.

However:
The picture is not quite as bleak as you are making it sound. Rsyslogalready scales pretty well to large numbers of cores.
The key thing to remember is that you are almost always going to bedoing more than one thing, so while any one thing may end up beingsingle threaded, you can still have many threads operating at a time.
most action modules have some point where they cannot be singlethreaded (think writing to a file or TCP socket).
The key to doing a lot of things in parallel is the rsyslog queueparameters.
If you configure multiple queue workers, they may not be doing thesame action at the same time, but they can be working on differentactions at the same time.
With some action modules, such as the ones that do database inserts,the module does support having multiple threads, because the remoteend is able to handle parallel writes.
With file output, you can enable async writes, so that you have onethread writing the output to disk (potentially with compression,signing, etc) while another thread is crafting the strings to be written.
It's very common that the bottleneck ends up being in stringgeneration (complex template patterns for the file format or for thedynamic filename). Rsyslog supports string modules, which can besignificantly more efficient in creating these strings than thetemplate languange. The built-in templates were implemented this wayand resulted in a noticable improvement on the peak performance ofrsyslog, and they are relatively simple templates. With more complextemplates the gains can be substantially bigger.
What action are you doing that is running into a problem?

David Lang


_______________________________________________
rsyslog mailing list
http://lists.adiscon.net/mailman/listinfo/rsyslog
http://www.rsyslog.com/professional-services/
What's up with rsyslog? Follow https://twitter.com/rgerhards
NOTE WELL: This is a PUBLIC mailing list, posts are ARCHIVED by a myriad of 
sites beyond our control. PLEASE UNSUBSCRIBE and DO NOT POST if you DON'T LIKE 
THAT.

Re: [rsyslog] A solution to action load balancing

Reply via email to