[pve-devel] need help to debug random host freeze on multiple hosts

2014-12-28 Thread Alexandre DERUMIER
Hi, I have random host freeze since around 1 week, all hosts are opteron 63XX or 61XX. kernel is pve-kernel-3.10-4 or pve-kernel-3.10-5, qemu 2.1. They were all fine since months. I have also tried last rhel 7.1beta kernel with no success. I really don't known how to debug that, because the

Re: [pve-devel] need help to debug random host freeze on multiple hosts

2014-12-28 Thread Stefan Priebe - Profihost AG
What about magic sysrq ? Does that still react? Screen frozen or black? Stefan Excuse my typo sent from my mobile phone. Am 28.12.2014 um 17:37 schrieb Alexandre DERUMIER aderum...@odiso.com: Hi, I have random host freeze since around 1 week, all hosts are opteron 63XX or 61XX.

Re: [pve-devel] need help to debug random host freeze on multiple hosts

2014-12-28 Thread Michael Rasmussen
On Sun, 28 Dec 2014 17:37:50 +0100 (CET) Alexandre DERUMIER aderum...@odiso.com wrote: I really don't known how to debug that, because the system freeze, and I don't have any kernel panic output in display or serial. Can somebody help me to add something to have debug output ? Bad RAM

Re: [pve-devel] need help to debug random host freeze on multiple hosts

2014-12-28 Thread Michael Rasmussen
On Sun, 28 Dec 2014 19:02:04 +0100 Michael Rasmussen m...@datanom.net wrote: On Sun, 28 Dec 2014 17:37:50 +0100 (CET) Alexandre DERUMIER aderum...@odiso.com wrote: I really don't known how to debug that, because the system freeze, and I don't have any kernel panic output in display or

Re: [pve-devel] need help to debug random host freeze on multiple hosts

2014-12-28 Thread Alexandre DERUMIER
Bad RAM stick? Bad PSU? Overheating of the CPU? No errors reporting in dell Idrac. (I have the problem on 6 differents nodes.) I was also thinking of electrical problem, but voltages don't report any error. Maybe the only difference is that I have more load currently on all my nodes

Re: [pve-devel] need help to debug random host freeze on multiple hosts

2014-12-28 Thread Cesar Peschiera
Maybe i ask you a silly question, did you see the syslog and kern.log file? - Original Message - From: Alexandre DERUMIER aderum...@odiso.com To: datanom.net m...@datanom.net Cc: pve-devel pve-devel@pve.proxmox.com Sent: Monday, December 29, 2014 1:49 AM Subject: Re: [pve-devel] need

Re: [pve-devel] need help to debug random host freeze on multiple hosts

2014-12-28 Thread Alexandre DERUMIER
Maybe i ask you a silly question, did you see the syslog and kern.log file? Yes sure , I have nothing in logs. (That's why I thinked of kdump to try to have more info). I'll really don't known if it's a software real kernel panic, or a hardware bug. I just see on vmware forum some amd

Re: [pve-devel] need help to debug random host freeze on multiple hosts

2014-12-28 Thread Cesar Peschiera
I know that this isn't a solution, but i will tell you only as a comment for future decisions: Long time ago, when i worked with Novell Netware, i had a problem of cache in the AMD processor, so i had that disable it, and after, this server was very slow, but was stable. Since that time i never

Re: [pve-devel] need help to debug random host freeze on multiple hosts

2014-12-28 Thread Cesar Peschiera
I know that this isn't a solution, but i will tell you only as a comment for future decisions: Long time ago, when i worked with Novell Netware, i had a problem of cache in the AMD processor, so i had that disable it, and after, this server was very slow, but was stable. Since that time i never