2012/5/22 <[email protected]>

> On Tue, 22 May 2012, Juan Jose Pavlik wrote:
>
>  2012/5/22 Tomas Heinrich <[email protected]>
>>
>>  Hi. Just some quick notes.
>>>
>>> Upgrade if you can. 5.8.5 is way too old.
>>>
>>
>>
>> I'm running OpenSuSE 12.1 and that's the rsyslog version that comes with
>> it, i don't like using software out of the repositories. What version
>> should i try??? I've heard something about bugs in this version..., this
>> is
>> a good start.
>>
>
> rsyslog improves rapidly, unfortunantly the distros upgrade very slowly
> and don't backport many of the fixes.
>
> the current version 5 release is 5.8.11, you should upgrade to that.
>
>
>  On 05/22/2012 03:56 PM, Juan Jose Pavlik wrote:
>>>
>>>  Right after the queues filled up, it stoped sending logs to the second
>>>> log
>>>> server too.
>>>>
>>>>
>>> My guess was that is hangs on one action which fills the main queue which
>>> slows message processing. But forwarding would be the one to suspect if
>>> other outputs are just plain files.
>>>
>>
>>
>> I thought that too, that's way i dissabled the database writing (the db is
>> in a remote server), i don't think that the other rsyslog is the one
>> slowing it down. Maybe is a network problem...?
>>
>
> do you have batch writes enabled to the database server? It's very
> possible that rsyslog is just unable to write it's output fast enough and
> that causes everything to stall. Configuring batch writes (and adjusting
> the batch size) will allow you to insert multiple logs in one transaction.
> In past tests I've been able to write 100+ logs in a transaction at
> approximatly the same transactions/sec rate as writing single logs per
> transaction.
>

No clue about batch writting, i don't have to much experience with
postgress, but i will check this out.


> What is the rate of logs that you are getting? I just write to flat files
> and post process them (on a one minute cron cycle), but I routinly handle
> 30K logs/second and have handled >92K logs in a single second.
>
> I would need more details of your systems to point out other possible
> issues. For example, if you have your database on the same filesystem that
> rsyslog is writing to, and that filesystem is ext3, there's a chance that
> the fsync calls that your database is performing is stalling all I/O to the
> filesystem. Ext3 is pathalogicly bad at performing fsyncs and can stall all
> I/O for up to 30 seconds at a time. This sort of delay can cause a huge
> backlog to build up in rsyslog.
>
> If you look at the different rsyslog threads (the 'H' option in top), you
> may see that one of them is going haywire trying to do something when you
> have trouble, or you may find that they all go to zero cpu when you have
> problems. what they do will indicate different sorts of problems. Based on
> what you are describing, I would guess they go to zero CPU, but you should
> check.
>

My munin graphs confirm that cpu usage went to zero, when that happened. I
will take a look with top if this happen again.


>
> David Lang
>
>   in my centralized logging server and im getting some troubles i'd really
>>>
>>>> love to figure out. I've around 170 servers/switches/otherthings
>>>>>> logging on
>>>>>> this server, most of them just send auth.* logs, some apaches sending
>>>>>> the
>>>>>> access and error logs, and switches sending warns and errors.
>>>>>> Sometimes
>>>>>> the
>>>>>> rsyslog queues get complettly filled up and it stops writing logs to
>>>>>> disk,
>>>>>> this is the exact logs of what happened:
>>>>>>
>>>>>>
>>>>>  Stops completely or just writes them incredibly slowly?
>>>
>>>
>> It writes them incredibly slow, right.
>>
>>
>>
>>>  Once *size* reaches 10000 (the default max as far as i know) things get
>>>
>>>>
>>>>>> complicated, rsyslog starts to drop logs and misbehave. The rsyslog
>>>>>>
>>>>>>
>>>>>  Dropping is a default action in case of congestion, do you really see
>>> some
>>> misbehavior?
>>>
>>>
>>>  What i see is (i've munin graphs of the server):
>>
>> -disk writing goes down, to almost zero.
>> -rsyslog queues starts to grow dramatically fast.
>>
>>
>>
>>
>>>  configuration write a per host files into /var/log/servidores/, it also
>>>
>>>> sends some logs to another rsyslog server and a postgress database
>>>>>> running
>>>>>> in another server. 2 weeks ago, i disabled sending logs to the
>>>>>> postgress
>>>>>> databse, because i had this same problem and we lost too many hours of
>>>>>> logs. Most of the servers are sending logs by TCP and a few servers
>>>>>> and
>>>>>> other devices use UDP.
>>>>>>
>>>>>> Is there a way i can avoid this problem? should i increase the
>>>>>> mainqueue
>>>>>> size? use other queues? Any help will be great. Thanks
>>>>>>
>>>>>>
>>>>>  Any particular size of a queue (or available ram) is finite. If you
>>> can
>>> identify the output that blocks the processing, put it in a separate
>>> queue
>>> and configure enqueuing to have a short timeout. This should mitigate the
>>> issue to some degree, not to be an ideal solution.
>>>
>>>
>> How can i identify the blocking proccess? any idea?
>>
>>
>>
>>> e.g.
>>> $ActionQueueTimeoutEnqueue <milisec>
>>>
>>> This could also be a bug, but I can't recall all the issues all the way
>>> back to 5.8.5. Do you encrypt the forwarded logs?
>>>
>>>
>>>  I'm not encrypting logs, i think it's a bug too.
>>
>>
>>  Tomas
>>> ______________________________****_________________
>>> rsyslog mailing list
>>> http://lists.adiscon.net/****mailman/listinfo/rsyslog<http://lists.adiscon.net/**mailman/listinfo/rsyslog>
>>> <http:**//lists.adiscon.net/mailman/**listinfo/rsyslog<http://lists.adiscon.net/mailman/listinfo/rsyslog>
>>> >
>>> http://www.rsyslog.com/****professional-services/<http://www.rsyslog.com/**professional-services/>
>>> <http://**www.rsyslog.com/professional-**services/<http://www.rsyslog.com/professional-services/>
>>> >
>>>
>>> What's up with rsyslog? Follow https://twitter.com/rgerhards
>>>
>>>
>>
>>
>>
>>  ______________________________**_________________
> rsyslog mailing list
> http://lists.adiscon.net/**mailman/listinfo/rsyslog<http://lists.adiscon.net/mailman/listinfo/rsyslog>
> http://www.rsyslog.com/**professional-services/<http://www.rsyslog.com/professional-services/>
> What's up with rsyslog? Follow https://twitter.com/rgerhards
>

Should i go for 5.8.11 or 6.2.1? I have to read a bit about queues, that's
for sure.

-- 
Pavlik Salles Juan José
Prosecretaría de Informática - UNC
Área Redes y Servidores
_______________________________________________
rsyslog mailing list
http://lists.adiscon.net/mailman/listinfo/rsyslog
http://www.rsyslog.com/professional-services/
What's up with rsyslog? Follow https://twitter.com/rgerhards

Reply via email to