On Thu, 29 Jan 2009, Rainer Gerhards wrote:

> Hi all,
>
> I had another interesting discussion with Lorenzo today. Those of you
> interested in details my find the chatlog interesting:
>
> http://blog.gerhards.net/2009/01/some-more-on-rsyslog-data-race.html

so, distilling this down I think I am reading the following.

1. mixing mutex and atomic operations is a problem, one or the other is 
safe

2. reliable duplication of the problem requires

fast machine
multiple cores _not_ sharing L1 cache (early Intel 4-core machines or 
multi-socket machines)
a complex rsyslog config that uses multiple thread heavily
high traffic log volume to heavily load rsyslog
high system load external to rsyslog increases the chancesof the race

question, have you tried enabling/disabling preemption in the kernel on 
these systems to see if that affects the probability of having a problem?

I'm eagerly waiting for the fixes to appear in the 4.1 branch to test them 
out.

David Lang


> Rainer
>
>> -----Original Message-----
>> From: [email protected] [mailto:rsyslog-
>> [email protected]] On Behalf Of Rainer Gerhards
>> Sent: Wednesday, January 28, 2009 6:32 PM
>> To: rsyslog-users
>> Subject: Re: [rsyslog] rsyslog still crashes
>>
>> Hi all,
>>
>> thanks to Lorenzo's help, we made good progress. It is too much to
> post
>> inside a mail, please have a look at my analysis of the bug:
>>
>> http://blog.gerhards.net/2009/01/rsyslog-data-race-analysis.html
>>
>> The short story is that we have at least improved the situation very
>> much and I hope to have fixes for all branches within the next couple
>> of
>> days.
>>
>> Rainer
>>
>>> -----Original Message-----
>>> From: [email protected] [mailto:rsyslog-
>>> [email protected]] On Behalf Of Rainer Gerhards
>>> Sent: Friday, January 16, 2009 3:22 PM
>>> To: rsyslog-users
>>> Subject: Re: [rsyslog] rsyslog still crashes
>>>
>>> Lorenzo,
>>>
>>> I have created a new branch "raceDebug" and done a first commit to
>> it.
>>> The change is very lightweight. Please pull, compile as usual and
>> give
>>> it a try. It spits out some info to stdout from time to time
>>> (hopefully). I am not sure if it aborts, depending on the output it
>> may
>>> or may not. Even if we get messages, they are probably not enough to
>>> pinpoint the bug, but I wanted to do something very light to see if
>> the
>>> bug stays.
>>>
>>> Feedback appreciated.
>>>
>>> Rainer
>> _______________________________________________
>> rsyslog mailing list
>> http://lists.adiscon.net/mailman/listinfo/rsyslog
>> http://www.rsyslog.com
> _______________________________________________
> rsyslog mailing list
> http://lists.adiscon.net/mailman/listinfo/rsyslog
> http://www.rsyslog.com
>
_______________________________________________
rsyslog mailing list
http://lists.adiscon.net/mailman/listinfo/rsyslog
http://www.rsyslog.com

Reply via email to