Our production dovecot/postfix server has been stable for a number of years.
In the last month or so, we are seeing increasing errors such as these:
Dec 6 15:51:20 mail dovecot: imap(redac...@lilythicket.com): Warning:
Transaction log file
/home/vmail/lilythicket.com/diana/Maildir/dovecot.index.log was locked for 322
seconds
Dec 6 15:50:54 mail dovecot: imap(redac...@theormans.com): Warning: Maildir
/home/vmail/theormans.com/connieorman/Maildir/.Junk: Synchronization took 66
seconds (1 new msgs, 0 flag change attempts, 0 expunge attempts)
Dec 6 15:51:43 mail dovecot: master: Error: service(pop3-login): Initial
status notification not received in 30 seconds, killing the process
Dec 6 15:51:43 mail dovecot: master: Error: service(pop3-login): command
startup failed, throttling
Dec 6 15:51:43 mail dovecot: master: Error: service(imap-login): child 5868
killed with signal 9
Dec 6 15:51:43 mail dovecot: master: Error: service(imap-login): command
startup failed, throttling
Dec 6 15:55:31 mail dovecot: imap-login: Fatal: Corrupted SSL
ssl-parameters.dat in state_dir: Truncated file
Dec 6 15:55:32 mail dovecot: pop3-login: Fatal: Error reading configuration:
Timeout reading config from /var/run/dovecot/config
And so forth. Seems to be all over the place. The server slows down to a
crawl. Restarting dovecot or postfix has no effect on the problem. Only a
server reboot solves it, temporarily. Sometimes for weeks, sometimes for
hours. The hard drive SMART status reads okay.
During this time, of course, users cannot connect to check their email.
Thoughts on where to go to troubleshoot this and why it’s happening?
Ethon