Hi, Had a fun time this morning trying to recover a pve node.
I suspect a hardware issue on the host itself (cpu or memory issue). All the kvm guests had a kernel panic, and upon rebooting them all of them had filesystem errors. A preliminary checkup on the raid controller and the disks, I did not find any issues. I do see these entries in the logs: Mar 17 11:14:19 vm004 kernel: kvm: 857701: cpu0 unhandled rdmsr: 0xc0010112 Mar 17 11:14:19 vm004 kernel: kvm: 857701: cpu0 unhandled rdmsr: 0xc0010001 Googling that I found that these should be harmless, but I don't find any of those in the logs of our other nodes. Anyone got any clues? Also, on the node itself, syslog was not running any more, and possibly other processes as well. Cheers, Frederic _______________________________________________ pve-user mailing list [email protected] http://pve.proxmox.com/cgi-bin/mailman/listinfo/pve-user
