Re: [rsyslog] global queue configuration?

mostolog--- via rsyslog Wed, 14 Dec 2016 06:07:52 -0800

On Wed, 14 Dec 2016, David Lang wrote:
this will slow rsyslog down by a factor of >1000x. I ran tests onthis using a very high end PCI based SSD and had a 2000x - 8000xslowdown, and this was with v5, the added improvements in performancesince wukk cause an even bigger slowdown as they would reduce theamount of time rsyslog spends processing the message outside of thequeue manipulation, I would expect the slowdown to be significantlyabove 10,000x nowdays.
you also have to set batch size = 1
The problem is that in a crash you can't count on anything in memorymaking it to disk, so you have to write each message to disk, do afsync on the data, possibly do a fsync on the directory [1], and onlythen respond with the ack
doing anything else exposes you to a window where the message could belost.
But the performance of filesystems is such that doing a fsync forevery write to the disk is crippling, even with SSDs [2]. You alsoneed to make sure that the SSDs don't buffer the write at all (manydo), and that you are writing to a RAID set (so that the failure of ssingle SSD doesn't loose the data), which then means you need to makesure your RAID controller either doesn't do any buffering, or has abattery backed cache.
If you are willing to loose messages in the case of a complete systemcrash/power outage/hardware failure, then things get much simpler, youdon't need to use a disk type queue, just use a disk assisted queue(type FixedArray or LinkedList with a filename) and saveonshutdown andyou get very good performance (except for the known performace issueswith retrieving data from a disk queue)

LinkedList and saveonshutdown will be.

About the "creating temporary queues" if ruleset doesn't have a queuedefined, I'll wait rgerhards for a deep explanation, full of examples,colors and music all around.


would imrelp->file->mmjson+mmnorm->elastic be a better approach?

[1] it's actually even worse than this with filesystems newer than ext2

you write the data to the file and do a fsync on the data.
if this changes the size of the file in disk blocks, you then need todo a fsync on the directory the file is in so that the size change iswritteneach of these writes are written to the filesystem journal as atemporary thing, and then the filesystem needs to write a copy of thedata to the final location and (once it knows the data is safe there,possibly involving another fsync for both the data and metadata), doanother write to the journal to know that the data was safely saved inthe final location
so processing one log message can require 6 writes, all of which canrequire a fsync
[2] without SSDs you are limited to <rpm/60 fsyncs/sec due to thesimple physics of a write requireing the disk to rotate once, andsince you aren't infinantly fast, you end up a fair bit below that.
ext3 had a bug where a fsync required writing all pending data to thedisk, including data that was written after the fsync started. Peopledocumented a single fsync taking >30 seconds.

These are enough reasons to definitively totally absolutely use fsync asmuch as we can

:P

_______________________________________________
rsyslog mailing list
http://lists.adiscon.net/mailman/listinfo/rsyslog
http://www.rsyslog.com/professional-services/
What's up with rsyslog? Follow https://twitter.com/rgerhards
NOTE WELL: This is a PUBLIC mailing list, posts are ARCHIVED by a myriad of 
sites beyond our control. PLEASE UNSUBSCRIBE and DO NOT POST if you DON'T LIKE 
THAT.

Re: [rsyslog] global queue configuration?

Reply via email to