Hi!

Sometimes people on this list proudly state that "you always get an answer to 
your questions" or similar. As you can see, this case is a very good example of 
this. Problem solved in three emails, including the initial question - great 
job! [Big smiley here!]

/Fredrik

> 4 feb 2014 kl. 17:10 skrev Joel Hansell <joel.hans...@gmail.com>:
> 
> Hello,
> I was unable to reproduce the problem on 5.7.1, so I assume that this was 
> already fixed. I'll upgrade our production installation to 5.7.2 as soon as 
> possible.
> BR,
> Joel Hansell
> 
> 
>> On Tue, Feb 4, 2014 at 3:09 PM, Joel Hansell <joel.hans...@gmail.com> wrote:
>> Hi again,
>> 
>> As an update, I was able to reproduce the error by writing a script that 
>> issues 4096 HUP signals to snmptrapd. The process starts issuing the 
>> "maximum conf file count (4096) exceeded" error, and forgets the configured 
>> logging format.
>> I don't suppose this is a known bug which was fixed some time after 5.6.1.1? 
>> It looks like an issue with the signal handling - there is only one config 
>> file, which is read multiple times. Seems like the config file counter 
>> should be reset when SIGHUP is received.
>> I can't find the source for 5.6.1.1, but in the 5.6.2 code, I don't see what 
>> the problem could be since "files" is in function-local scope. It should be 
>> zeroed every time the read_config() function is called. I guess there could 
>> be some scope confusion (or confusion on my part, I'm not a habitual C 
>> programmer), or the problem was already fixed in 5.6.2...
>> 
>> I'm trying to reproduce the bug on 5.7.1 now, but unfortunately, since 
>> snmptrapd takes about a second to restart on SIGHUP, the test takes just 
>> over an hour to execure.
>> 
>> BR,
>> Joel Hansell
>> 
>> 
>> 
>> 
>>> On Tue, Feb 4, 2014 at 10:58 AM, Joel Hansell <joel.hans...@gmail.com> 
>>> wrote:
>>> Hi list,
>>> 
>>> Here's one I've been scratching my head over lately.
>>> 
>>> We have a bit of an oddball solution, where we've got snmptrapd logging a 
>>> lot of traps into a file, which is parsed by an external tool. This is all 
>>> running on a HP-UX 11.2 system, with Net-SNMP version 5.6.1.1 delivered 
>>> with the "HP-UX Internet Express" package.
>>> 
>>> A cron job runs "logrotate" every 15 minutes, and if the file is too big, 
>>> it's rotated, and the postrotate script issues a SIGHUP to snmptrapd. That 
>>> normally triggers the daemon to re-read its config and to restart the 
>>> logging into a new file.
>>> 
>>> The snmptrapf.config is set up to use a particular one-line trap logging 
>>> format.
>>> 
>>> It seems that every so often, the snmptrapd fails subtly on SIGHUP. It only 
>>> seems to happen after more than a couple of months have passed. 60 days, 91 
>>> days, 101 days, 113 days are some of the fault intervals.
>>> 
>>> I've observed the following about the failure state after it happens:
>>> - Snmptrapd is executing and logging traps
>>> - The trap logging format has changed to the default trap logging format 
>>> (three lines per trap). This causes our parser to fail
>>> - snmptrapd logs the error "[...]/snmptrapd.conf: line 0: Error: maximum 
>>> conf file count (4096) exceeded" at the start of the log file.
>>> 
>>> The flow of traps is such that the log file is usually rotated every 30 
>>> minutes, but it goes up and down a bit. Could this failure be happening 
>>> after 4096 SIGHUPs? That would explain the varying time between failures.
>>> 
>>> I'm grateful for any input.
>>> 
>>> Regards,
>>> Joel Hansell
> 
> ------------------------------------------------------------------------------
> Managing the Performance of Cloud-Based Applications
> Take advantage of what the Cloud has to offer - Avoid Common Pitfalls.
> Read the Whitepaper.
> http://pubads.g.doubleclick.net/gampad/clk?id=121051231&iu=/4140/ostg.clktrk
> _______________________________________________
> Net-snmp-users mailing list
> Net-snmp-users@lists.sourceforge.net
> Please see the following page to unsubscribe or change other options:
> https://lists.sourceforge.net/lists/listinfo/net-snmp-users
------------------------------------------------------------------------------
Managing the Performance of Cloud-Based Applications
Take advantage of what the Cloud has to offer - Avoid Common Pitfalls.
Read the Whitepaper.
http://pubads.g.doubleclick.net/gampad/clk?id=121051231&iu=/4140/ostg.clktrk
_______________________________________________
Net-snmp-users mailing list
Net-snmp-users@lists.sourceforge.net
Please see the following page to unsubscribe or change other options:
https://lists.sourceforge.net/lists/listinfo/net-snmp-users

Reply via email to