Re: [PVE-User] hardware issue?
hmm, mpt2sas seems to be handling perc (LSI) controllers so I'd guess you have a raid controller. It's PERC H200I, FW Revision:7.15.08.00-IR Please check which SAS controller you have (lspci -nn) and it's configuration. #lspci -nn 00:00.0 Host bridge [0600]: Intel Corporation 5500 I/O Hub to ESI Port [8086:3403] (rev 13) 00:01.0 PCI bridge [0604]: Intel Corporation 5520/5500/X58 I/O Hub PCI Express Root Port 1 [8086:3408] (rev 13) 00:03.0 PCI bridge [0604]: Intel Corporation 5520/5500/X58 I/O Hub PCI Express Root Port 3 [8086:340a] (rev 13) 00:07.0 PCI bridge [0604]: Intel Corporation 5520/5500/X58 I/O Hub PCI Express Root Port 7 [8086:340e] (rev 13) 00:09.0 PCI bridge [0604]: Intel Corporation 7500/5520/5500/X58 I/O Hub PCI Express Root Port 9 [8086:3410] (rev 13) 00:0a.0 PCI bridge [0604]: Intel Corporation 7500/5520/5500/X58 I/O Hub PCI Express Root Port 10 [8086:3411] (rev 13) 00:14.0 PIC [0800]: Intel Corporation 7500/5520/5500/X58 I/O Hub System Management Registers [8086:342e] (rev 13) 00:14.1 PIC [0800]: Intel Corporation 7500/5520/5500/X58 I/O Hub GPIO and Scratch Pad Registers [8086:3422] (rev 13) 00:14.2 PIC [0800]: Intel Corporation 7500/5520/5500/X58 I/O Hub Control Status and RAS Registers [8086:3423] (rev 13) 00:16.0 System peripheral [0880]: Intel Corporation 5520/5500/X58 Chipset QuickData Technology Device [8086:3430] (rev 13) 00:16.1 System peripheral [0880]: Intel Corporation 5520/5500/X58 Chipset QuickData Technology Device [8086:3431] (rev 13) 00:16.2 System peripheral [0880]: Intel Corporation 5520/5500/X58 Chipset QuickData Technology Device [8086:3432] (rev 13) 00:16.3 System peripheral [0880]: Intel Corporation 5520/5500/X58 Chipset QuickData Technology Device [8086:3433] (rev 13) 00:16.4 System peripheral [0880]: Intel Corporation 5520/5500/X58 Chipset QuickData Technology Device [8086:3429] (rev 13) 00:16.5 System peripheral [0880]: Intel Corporation 5520/5500/X58 Chipset QuickData Technology Device [8086:342a] (rev 13) 00:16.6 System peripheral [0880]: Intel Corporation 5520/5500/X58 Chipset QuickData Technology Device [8086:342b] (rev 13) 00:16.7 System peripheral [0880]: Intel Corporation 5520/5500/X58 Chipset QuickData Technology Device [8086:342c] (rev 13) 00:1a.0 USB controller [0c03]: Intel Corporation 82801JI (ICH10 Family) USB UHCI Controller #4 [8086:3a37] 00:1a.1 USB controller [0c03]: Intel Corporation 82801JI (ICH10 Family) USB UHCI Controller #5 [8086:3a38] 00:1a.7 USB controller [0c03]: Intel Corporation 82801JI (ICH10 Family) USB2 EHCI Controller #2 [8086:3a3c] 00:1d.0 USB controller [0c03]: Intel Corporation 82801JI (ICH10 Family) USB UHCI Controller #1 [8086:3a34] 00:1d.1 USB controller [0c03]: Intel Corporation 82801JI (ICH10 Family) USB UHCI Controller #2 [8086:3a35] 00:1d.2 USB controller [0c03]: Intel Corporation 82801JI (ICH10 Family) USB UHCI Controller #3 [8086:3a36] 00:1d.3 USB controller [0c03]: Intel Corporation 82801JI (ICH10 Family) USB UHCI Controller #6 [8086:3a39] 00:1d.7 USB controller [0c03]: Intel Corporation 82801JI (ICH10 Family) USB2 EHCI Controller #1 [8086:3a3a] 00:1e.0 PCI bridge [0604]: Intel Corporation 82801 PCI Bridge [8086:244e] (rev 90) 00:1f.0 ISA bridge [0601]: Intel Corporation 82801JIR (ICH10R) LPC Interface Controller [8086:3a16] 01:00.0 Ethernet controller [0200]: Broadcom Corporation NetXtreme II BCM5716 Gigabit Ethernet [14e4:163b] (rev 20) 01:00.1 Ethernet controller [0200]: Broadcom Corporation NetXtreme II BCM5716 Gigabit Ethernet [14e4:163b] (rev 20) 02:00.0 Serial Attached SCSI controller [0107]: LSI Logic / Symbios Logic SAS2008 PCI-Express Fusion-MPT SAS-2 [Falcon] [1000:0072] (rev 03) 04:00.0 Ethernet controller [0200]: Broadcom Corporation NetXtreme II BCM5709 Gigabit Ethernet [14e4:1639] (rev 20) 04:00.1 Ethernet controller [0200]: Broadcom Corporation NetXtreme II BCM5709 Gigabit Ethernet [14e4:1639] (rev 20) 05:00.0 Ethernet controller [0200]: Broadcom Corporation NetXtreme II BCM5709 Gigabit Ethernet [14e4:1639] (rev 20) 05:00.1 Ethernet controller [0200]: Broadcom Corporation NetXtreme II BCM5709 Gigabit Ethernet [14e4:1639] (rev 20) 06:03.0 VGA compatible controller [0300]: Matrox Electronics Systems Ltd. MGA G200eW WPCM450 [102b:0532] (rev 0a) http://mirrors.verycdn.com.cn/pub/1.jpg http://mirrors.verycdn.com.cn/pub/2.jpg lyt_y...@126.com From: Alessandro Briosi Date: 2013-09-12 14:52 To: lyt_y...@126.com CC: pve-user Subject: Re: [PVE-User] hardware issue? Il 12/09/2013 04:11, lyt_y...@126.com ha scritto: very tks you reply! It obviously is caused by the raid (either hardware or software) no raid, each disk alone use hmm, mpt2sas seems to be handling perc (LSI) controllers so I'd guess you have a raid controller. Does it say something on the console when you reboot the server? no,nothing, all normal ok I'd try removing the BBU as a test, and if it does not crash ask dell to replace it. I'll try it,and continue to follow up well, if you do
Re: [PVE-User] hardware issue?
Il 12/09/2013 09:33, lyt_y...@126.com ha scritto: It's PERC H200I, FW Revision:7.15.08.00-IR ok. I think the h200 does not have a BBU, so it can't be that one. first check the firmware I'd do some tests with some other distro which has a more updated kernel (there has been activity on the driver, but dunno how much of this has been backported). If that works then probably it might be a bug in mpt2sas, though proxmox is using RedHat kernel so it should have come up by now from somebody else :) It could also be 'cause you are using the raid as a simple controller and not as a raid. There's also a dell version of the driver for RH6 kernel, which seems pretty recent. You could try that one too (but then be carefull when upgrading, and be sure to have console access to the server). [1] But I have no experience with h200, only bigger models, so this are simply some guesses. Alessandro [1] http://www.dell.com/support/drivers/us/en/bmdhs1/DriverDetails?driverId=NTNH8 ___ pve-user mailing list pve-user@pve.proxmox.com http://pve.proxmox.com/cgi-bin/mailman/listinfo/pve-user
Re: [PVE-User] hardware issue?
Il 11/09/2013 03:30, lyt_y...@126.com ha scritto: hi, This device configuration is Dell R510: 2TB SAS Disk x 12 64G Mem Intel Xeon CPU E5620 x 2 6Gbps SAS Controller(MPT2BIOS-7.11.10.00(2011.06.02)) Recently,the kernel of the device is crashed,and occurs once every two days. I have checked the hardware without exception. It obviously is caused by the raid (either hardware or software) I had similar problems in the past, and it was the BBU unit of the raid controller. Does it say something on the console when you reboot the server? I'd try removing the BBU as a test, and if it does not crash ask dell to replace it. Alessandro ___ pve-user mailing list pve-user@pve.proxmox.com http://pve.proxmox.com/cgi-bin/mailman/listinfo/pve-user
Re: [PVE-User] hardware issue?
very tks you reply! It obviously is caused by the raid (either hardware or software) no raid, each disk alone use Does it say something on the console when you reboot the server? no,nothing, all normal I'd try removing the BBU as a test, and if it does not crash ask dell to replace it. I'll try it,and continue to follow up lyt_y...@126.com From: Alessandro Briosi Date: 2013-09-11 22:56 To: lyt_y...@126.com CC: pve-user Subject: Re: [PVE-User] hardware issue? Il 11/09/2013 03:30, lyt_y...@126.com ha scritto: hi, This device configuration is Dell R510: 2TB SAS Disk x 12 64G Mem Intel Xeon CPU E5620 x 2 6Gbps SAS Controller(MPT2BIOS-7.11.10.00(2011.06.02)) Recently,the kernel of the device is crashed,and occurs once every two days. I have checked the hardware without exception. It obviously is caused by the raid (either hardware or software) I had similar problems in the past, and it was the BBU unit of the raid controller. Does it say something on the console when you reboot the server? I'd try removing the BBU as a test, and if it does not crash ask dell to replace it. Alessandro___ pve-user mailing list pve-user@pve.proxmox.com http://pve.proxmox.com/cgi-bin/mailman/listinfo/pve-user