Re: mail stopped coming in

Jeroen Geilman Thu, 05 Jul 2012 17:12:58 -0700

On 07/03/2012 07:10 PM, myron wrote:

On Jul 3, 2012, at 1:00 PM, Bill Cole wrote:
On 3 Jul 2012, at 12:07, myron wrote:
On Jul 3, 2012, at 11:52 AM, Ralf Hildebrandt wrote:
* myron <kowal...@cs.moravian.edu>:
This morning I came in to find no new mail in my inbox. I ended up
reboot the server. These were the log entries that seem to indicate
when and the
problem was. I checked the time on the server and it was not off at
all. Can someone suggest from these entries what the problem was?

Jul  2 20:52:11 errol dovecot: dovecot: Fatal: Time just moved
backwards by 9 seconds. This might cause a lot of problems, so I'll
just kill myself now. http://wiki.dovecot.org/TimeMovedBackwards
Dovevot killed itself because the time moved backwards by 9 seconds.
BTW: why? Use NTP!
FWIW, using NTP (badly) can be the proximate cause of such abackwards jump. It's probably the most common one behind a humanmanually changing the system time. An unregulated clock chip jumpingbackwards is definitionally defective.
I am (was) using ntp. I don't know why the time moved backwards. Itwasn't when I checked the time before rebooting.
root@errol:/var/log# ps -ef | grep ntpd
ntp 1049 1 0 08:25 ? 00:00:00 /usr/sbin/ntpd -p/var/run/ntpd.pid -g -u 107:116
root      4457  1996  0 12:03 pts/0    00:00:00 grep --color=auto ntpd
running ntpd != functional ntpd.
It used to be that firewalls and trivial configurations were the mostcommon causes of NTP running without actually keeping the systemclock stable and accurate. These days it is more common forvirtualization to be the cause of trouble which can be highly opaque.Subjectively, I have also been convinced that server RTC chips havehad a huge drop in accuracy and stability in the past decade, causingthe drift assumptions in traditionally good-enough ntpdconfigurations to break down. It also doesn't help that apparently alot of software can't handle leap seconds, and that caused lots oftrouble at 00:00 GMT 7/1.
A deep discussion of solid time synch would be off-topic, but as youcan see it is absolutely critical when using Dovecot SASL to eithersolve the time synch problem or catch the well-documented failuremode of Dovecot shooting itself in the head to avoid doing bad things.
It definitely reset more than a few seconds. It looks like I lostinternet connection down the line with the ntp servers.

That has absolutely no impact on the accuracy or continuity of the*local ntp server*.

Keep in mind that ntpd contains thousands of lines of code to ensurethat the time keeps running ACCURATELY, even in prolonged absence ofhigher-stratum peers.

syslog.1:Jul  2 20:52:11 errol ntpd[24983]: time reset -10.407874 s
syslog.1:Jul 2 20:52:11 errol ntpd[24983]: kernel time sync statuschange 6001syslog.1:Jul 2 20:57:56 errol ntpd[24983]: synchronized to91.189.94.4, stratum 2syslog.1:Jul 2 20:57:56 errol ntpd[24983]: kernel time sync statuschange 2001
syslog.1:Jul  2 21:07:31 errol ntpd[24983]: no servers reachable
syslog.1:Jul 2 21:14:59 errol ntpd[24983]: synchronized to91.189.94.4, stratum 2
syslog.1:Jul  2 21:16:13 errol ntpd[24983]: time reset +10.403478 s
syslog.1:Jul 2 21:26:33 errol ntpd[24983]: synchronized to91.189.94.4, stratum 2

Instead, I suspect this is one more symptom of the leap second bug thatkeeps cropping up this week.

Consult The Interwebs for your version of ntpd and see if others reportthe same.



--
J.

Re: mail stopped coming in

Reply via email to