Re: [Pool] Odd problems today - Leap Second UTC

Fabian Wenk Sun, 01 Jul 2012 08:25:27 -0700

Hello Antonio

On 01.07.2012 00:47, Antonio M. Moreiras wrote:

I had two odd events today, just after midnight (utc).


My nagios monitoring went crazy (all servers really ok, but alarming on
nagios). I think some regular expression in the ntp monitoring plugin
didn't like some new response triggered at entering the day. Probably it
will be OK again after todays's midnight.

I also monitor ntpd with Nagios with check_ntp_peer andcheck_ntp_time. I mostly monitor servers (which have the leapsecond file from NIST) and only one client (Mac OS X 10.6), whichdoes not have the leap second file. Nagios triggered an errorjust after the leap second (local time is in CEST aka UTC+2):

[01-07-2012 02:01:49] flashback CRITICAL NTP time NTP CRITICAL:Offset 0.99893713 secs[01-07-2012 02:08:19] flashback CRITICAL NTP peer NTP CRITICAL:Offset -1.001178 secs, jitter=816.351000, stratum=1[01-07-2012 02:16:49] flashback OK NTP time NTP OK: Offset0.000156879425 secs[01-07-2012 02:18:19] flashback OK NTP peer NTP OK: Offset-0.000113 secs, jitter=0.073000, stratum=1

This is also one of the systems I had setup my script to reporttime during leap second (using unix date, as ntptime was notavailable). It was the only system which did not do the leapsecond, so it went off this 1 second for a short time, whichNagios reported.


Output during leap second from:
for i in `/opt/local/bin/gseq 0 240`; do date ; sleep 0.5 ; done

Sun Jul  1 01:59:57 CEST 2012
Sun Jul  1 01:59:57 CEST 2012
Sun Jul  1 01:59:58 CEST 2012
Sun Jul  1 01:59:58 CEST 2012
Sun Jul  1 01:59:59 CEST 2012
Sun Jul  1 01:59:59 CEST 2012
Sun Jul  1 02:00:00 CEST 2012
Sun Jul  1 02:00:00 CEST 2012
Sun Jul  1 02:00:01 CEST 2012
Sun Jul  1 02:00:01 CEST 2012
Sun Jul  1 02:00:02 CEST 2012
Sun Jul  1 02:00:02 CEST 2012


Output during leap second on a FreeBSD 7.4 system from (on one line):

for i in `/usr/local/bin/gseq 0 240`; do ntptime | awk '/time d3/{print $3" "$4" "$5" "$6" "$7}' | sed 's/,$//' ; sleep 0.5 ; done


Sun, Jul 1 2012 1:59:57.030
Sun, Jul 1 2012 1:59:57.540
Sun, Jul 1 2012 1:59:58.046
Sun, Jul 1 2012 1:59:58.556
Sun, Jul 1 2012 1:59:59.065
Sun, Jul 1 2012 1:59:59.578
Sun, Jul 1 2012 1:59:59.085
Sun, Jul 1 2012 1:59:59.594
Sun, Jul 1 2012 2:00:00.105
Sun, Jul 1 2012 2:00:00.615
Sun, Jul 1 2012 2:00:01.135
Sun, Jul 1 2012 2:00:01.641
Sun, Jul 1 2012 2:00:02.151
Sun, Jul 1 2012 2:00:02.907

The similar output was on a recent Gentoo system. Even a secondscript running on this systems with unix date did report the:59th second twice (four times in the output) and not reportingthe leap second with :60.

Two and half hours later, one of my stratum one servers just crashed:
version="ntpd [email protected] Mon Jan  9 15:21:05 UTC 2012 (1)",
processor="x86_64", system="Linux/2.6.33.7-rt29-pps536-ts22". I didn't
believe in coincidences, and think that it could be related to the
treatment of the leap second in the linux kernel.

I also have one 'older' Linux System in my network, which crashedone hour after the leap second. Seems to be this known LinuxKernel bug (prior to 2.6.29) which is mention in this Slashdotposting [1].

[1]http://it.slashdot.org/story/12/06/30/2123248/the-leap-second-is-here-are-your-systems-ready



bye
Fabian
_______________________________________________
pool mailing list
[email protected]
http://lists.ntp.org/listinfo/pool

Re: [Pool] Odd problems today - Leap Second UTC

Reply via email to