Hi,

How old are the physical servers? I've had crashes due to old BIOSes (and bugs with CPU ACPI state management if I recall correctly).

Cheers
Eneko

On 07/11/14 11:01, [email protected] wrote:
Hello,

we have the problem that from time to time our Proxmox node crash badly (kernel panic). We have checked the machines involved without error and the crash is wandering if along with the main load, so harware should be fine. The load consist of about 10-15 VMs with Linux/Windows and nearly all us virtio block devices which are located at a local RAID1 or RAID10. There are some "special" configurations with serial line access (USV Monitoring) and one machine with a LPT-Dongle but beside this nothing fancy. Shortly before the latest crash we have this in the log:

daemon.log.1:Nov 6 17:05:20 pve-intern-02 pvestatd[2796]: WARNING: unable to connect to VM 30002 socket - timeout after 31 retries daemon.log.1:Nov 6 17:05:30 pve-intern-02 pvestatd[2796]: WARNING: unable to connect to VM 30002 socket - timeout after 31 retries daemon.log.1:Nov 6 17:05:33 pve-intern-02 pvestatd[2796]: WARNING: unable to connect to VM 30005 socket - timeout after 31 retries daemon.log.1:Nov 6 17:05:36 pve-intern-02 pvestatd[2796]: WARNING: unable to connect to VM 31030 socket - timeout after 31 retries daemon.log.1:Nov 6 17:05:39 pve-intern-02 pvestatd[2796]: WARNING: unable to connect to VM 30004 socket - timeout after 31 retries daemon.log.1:Nov 6 17:05:42 pve-intern-02 pvestatd[2796]: WARNING: unable to connect to VM 31013 socket - timeout after 31 retries daemon.log.1:Nov 6 17:05:45 pve-intern-02 pvestatd[2796]: WARNING: unable to connect to VM 30002 socket - timeout after 31 retries daemon.log.1:Nov 6 17:05:48 pve-intern-02 pvestatd[2796]: WARNING: unable to connect to VM 30005 socket - timeout after 31 retries daemon.log.1:Nov 6 17:05:51 pve-intern-02 pvestatd[2796]: WARNING: unable to connect to VM 31030 socket - timeout after 31 retries

Proxmox is latest with community subscription. We will now try to get a panic log with netconsole if this is supported by proxmox

https://openvz.org/Remote_console_setup

Any further ideas are welcome

Thanks

Andreas



_______________________________________________
pve-user mailing list
[email protected]
http://pve.proxmox.com/cgi-bin/mailman/listinfo/pve-user



--
Zuzendari Teknikoa / Director Técnico
Binovo IT Human Project, S.L.
Telf. 943575997
      943493611
Astigarraga bidea 2, planta 6 dcha., ofi. 3-2; 20180 Oiartzun (Gipuzkoa)
www.binovo.es

_______________________________________________
pve-user mailing list
[email protected]
http://pve.proxmox.com/cgi-bin/mailman/listinfo/pve-user

Reply via email to