Re: [lopsa-discuss] Anyone ever use nxlog for heterogeneous logging

david Wed, 18 Apr 2012 18:06:43 -0700

On Wed, 18 Apr 2012, Ski Kacoroski wrote:

I am in the process of redoing my logging architecture to support both*nix and windows platforms. We currently have Splunk, but because ofthe per/GB pricing we have already decided that we cannot use it for allour logs (which kind of defeats the purpose of a central loggingsystem). So I was looking at Greylog2 when my intern found NXlog. Ifany one has used it for either a complete system or just the as a systemto forward windows logs to a unix style logging system I wouldappreciate your comments. If you have any other ideas for centralizedlogging infrastructures that support easy adhoc queries via a graphicalinterface, please let me know.

I have not used nxlog yet (I just learned about it recently), but itsounds like it's a strong contender.

I think that you will not go very wrong if you use nxlog, rsyslog, orsyslog-ng as you should be able to replace any one of them with one of theothers if you run into serious trouble.

a few years ago I evaluated syslog options and ended up going withrsyslog, and the rsyslog performance has improved by almost two orders ofmagnatude since then, so I'm confident that it can transport your logsfast enough.

However, all of these are just going to solve your log transport problem,not your complete logging solution.

What I ended up building uses syslog as the transport mechanism, but thenit sends the logs to multiple destinations, one of which is Splunk forit's easy searching, and if you can filter out a lot of the noise, you mayfind that Splunk is still a valued piece of your logging infrastructure.

That being said, any of the modern syslog daemons can also write to manydifferent types of destinations, including hadoop, Postgres, elasticsearchand others, so the ultimate destination of the logs is an independantdecision from the log transport.



Going into a bit more detail of my logging infrastructure.

It was designed to handle at least 100K logs/sec of ~250 byte logmessages. It has hit a peak of 92K logs in a second, so it seems to beholding, and all measurements show that it can able to handle peaks up to~400K logs/sec, which is wire-speed for my gig-E network, I just don'tknow how sustained such a burst could be.

I have a first tier of log relay boxes, these boxes receive the logs fromall my different networks and do whatever cleanup is needed (custom logformats to fix broken senders, running rsyslog cleanup parser modules todeal with things like Cisco routers adding an extra field if they log byname, etc).

The first tier then delivers the logs over a dedicated switch (Cisco 3550)via UDP to a multicast MAC address to multiple farms of servers thatreceive them (using the iptables CLUSTERIP feature). This allows one copyof the log to be send and received by multiple farms of machines. Each ofthese farms can have multiple boxes splitting the inbound traffic betweenthem. This puts a single copy of the log message on the wire, no matterhow many systems are recieving it.


the farms that I currently have are:

1. Archive (simple store to disk)

2. Reporting (actually the same box as #1, runs periodic scripts againstthe logs to create hourly and daily reports)

3. Alerting (Simple Event Correlator to generate alerts on specificmessages or combinations of messages)


4. Searching (Splunk for easy ad-hoc searching of the logs)

In the past I've had a couple other alerting and reporting farms runningproprietary tools, but right now they've been phased out.

This approach allows me to add an additional farm of receiving boxeswithout having to make any configuration changes on the relay boxes, andaccording to my testing, it can scale up to gig-E wire speed with nopacket loss after sending several billion test log messages (at least aslong as the disk I/O can keep up with rsyslog writing the data out todisk)

The drawback to this approach is that since it does use UDP, if areceiving farm dies entirely, the sending systems don't know it. I havethe relay boxes write a copy of the logs locally as well as forwardingthem, so I have always been able to recover with those logs (I really needto finish getting all of my farms HA :-)

If a log is extremely long I currently allow it to be truncated to onepacket. I've thought about going to jumbo frames on the log deliverynetwork so that I could handle log lines up to 9K cleanly, but it hasn'tbeen enough of an issue for me to have done so yet. The other thing that Icould do is to detect extra-long lines and switch from delivering via theefficient UDP multicast method to delivering multiple copies (one to eachfarm) via TCP, but I probably won't do that until after going to jumboframes (if ever)


David Lang
_______________________________________________
Discuss mailing list
[email protected]
https://lists.lopsa.org/cgi-bin/mailman/listinfo/discuss
This list provided by the League of Professional System Administrators
http://lopsa.org/

Re: [lopsa-discuss] Anyone ever use nxlog for heterogeneous logging

Reply via email to