Hi,

I've got a stand-alone SmartOS/Triton hypervisor that's been having some unscheduled, seemingly random, reboots. This feels more core though which is why I'm raising it here rather than the SmartOS ML.

This is a custom build of SmartOS 20220310T004022Z. The two changes we apply are to SEGPDEFSIZE (to 8G) and PORT_DEFAULT_PORTS (to 0x08000). This image is running without any issue on multiple other machines.

There is no evidence of anything in either syslog, messages, auth.log nor the BMC logs. There is nothing in /var/crash. fmadm faulty is clean both before and after. There is however some weird logging in last where it appears as if the system went down, or was told to go down, hours before it happened.

Entries like:
reboot  system boot             Thu Jun  8 00:22
reboot  system down             Wed Jun  7 15:49
and
reboot  system boot             Sat May 27 23:58
reboot  system down             Fri May 26 13:30

Both these reboots actually occurred a minute or so before the system boot timestamp so the reboots only took a minute and there was no end-user impact at that time of night. The system is just over a year old and the BMC logs show good time and a scheduled shutdown and boot logged the correct time so I've no reason to think it's a BIOS battery or hardware clock issue.

I wondered if this could happen if someone scheduled a reboot with a long timeout on a shutdown command (and did it for the GZ rather than a zone by accident) but I've been unable to replicate the last entries in my experiments with both /usr/sbin/shutdown nor /usr/ucb/shutdown. One of the shutdowns came close as it didn't leave any log evidence but didn't replicate those weird last entries.

Has anyone seen this before?

Thanks,
Jon

------------------------------------------
illumos: illumos-discuss
Permalink: 
https://illumos.topicbox.com/groups/discuss/T9b0e3a6300508f9b-M0afde3820b39848987a28217
Delivery options: https://illumos.topicbox.com/groups/discuss/subscription

Reply via email to