Re: [PVE-User] hardware issue?

2013-09-12 Thread lyt_y...@126.com
hmm, mpt2sas seems to be handling perc (LSI) controllers so I'd guess 
you have a raid controller.
It's PERC H200I, FW Revision:7.15.08.00-IR 

Please check which SAS controller you have (lspci -nn) and it's 
configuration.
#lspci -nn
00:00.0 Host bridge [0600]: Intel Corporation 5500 I/O Hub to ESI Port 
[8086:3403] (rev 13)
00:01.0 PCI bridge [0604]: Intel Corporation 5520/5500/X58 I/O Hub PCI Express 
Root Port 1 [8086:3408] (rev 13)
00:03.0 PCI bridge [0604]: Intel Corporation 5520/5500/X58 I/O Hub PCI Express 
Root Port 3 [8086:340a] (rev 13)
00:07.0 PCI bridge [0604]: Intel Corporation 5520/5500/X58 I/O Hub PCI Express 
Root Port 7 [8086:340e] (rev 13)
00:09.0 PCI bridge [0604]: Intel Corporation 7500/5520/5500/X58 I/O Hub PCI 
Express Root Port 9 [8086:3410] (rev 13)
00:0a.0 PCI bridge [0604]: Intel Corporation 7500/5520/5500/X58 I/O Hub PCI 
Express Root Port 10 [8086:3411] (rev 13)
00:14.0 PIC [0800]: Intel Corporation 7500/5520/5500/X58 I/O Hub System 
Management Registers [8086:342e] (rev 13)
00:14.1 PIC [0800]: Intel Corporation 7500/5520/5500/X58 I/O Hub GPIO and 
Scratch Pad Registers [8086:3422] (rev 13)
00:14.2 PIC [0800]: Intel Corporation 7500/5520/5500/X58 I/O Hub Control Status 
and RAS Registers [8086:3423] (rev 13)
00:16.0 System peripheral [0880]: Intel Corporation 5520/5500/X58 Chipset 
QuickData Technology Device [8086:3430] (rev 13)
00:16.1 System peripheral [0880]: Intel Corporation 5520/5500/X58 Chipset 
QuickData Technology Device [8086:3431] (rev 13)
00:16.2 System peripheral [0880]: Intel Corporation 5520/5500/X58 Chipset 
QuickData Technology Device [8086:3432] (rev 13)
00:16.3 System peripheral [0880]: Intel Corporation 5520/5500/X58 Chipset 
QuickData Technology Device [8086:3433] (rev 13)
00:16.4 System peripheral [0880]: Intel Corporation 5520/5500/X58 Chipset 
QuickData Technology Device [8086:3429] (rev 13)
00:16.5 System peripheral [0880]: Intel Corporation 5520/5500/X58 Chipset 
QuickData Technology Device [8086:342a] (rev 13)
00:16.6 System peripheral [0880]: Intel Corporation 5520/5500/X58 Chipset 
QuickData Technology Device [8086:342b] (rev 13)
00:16.7 System peripheral [0880]: Intel Corporation 5520/5500/X58 Chipset 
QuickData Technology Device [8086:342c] (rev 13)
00:1a.0 USB controller [0c03]: Intel Corporation 82801JI (ICH10 Family) USB 
UHCI Controller #4 [8086:3a37]
00:1a.1 USB controller [0c03]: Intel Corporation 82801JI (ICH10 Family) USB 
UHCI Controller #5 [8086:3a38]
00:1a.7 USB controller [0c03]: Intel Corporation 82801JI (ICH10 Family) USB2 
EHCI Controller #2 [8086:3a3c]
00:1d.0 USB controller [0c03]: Intel Corporation 82801JI (ICH10 Family) USB 
UHCI Controller #1 [8086:3a34]
00:1d.1 USB controller [0c03]: Intel Corporation 82801JI (ICH10 Family) USB 
UHCI Controller #2 [8086:3a35]
00:1d.2 USB controller [0c03]: Intel Corporation 82801JI (ICH10 Family) USB 
UHCI Controller #3 [8086:3a36]
00:1d.3 USB controller [0c03]: Intel Corporation 82801JI (ICH10 Family) USB 
UHCI Controller #6 [8086:3a39]
00:1d.7 USB controller [0c03]: Intel Corporation 82801JI (ICH10 Family) USB2 
EHCI Controller #1 [8086:3a3a]
00:1e.0 PCI bridge [0604]: Intel Corporation 82801 PCI Bridge [8086:244e] (rev 
90)
00:1f.0 ISA bridge [0601]: Intel Corporation 82801JIR (ICH10R) LPC Interface 
Controller [8086:3a16]
01:00.0 Ethernet controller [0200]: Broadcom Corporation NetXtreme II BCM5716 
Gigabit Ethernet [14e4:163b] (rev 20)
01:00.1 Ethernet controller [0200]: Broadcom Corporation NetXtreme II BCM5716 
Gigabit Ethernet [14e4:163b] (rev 20)
02:00.0 Serial Attached SCSI controller [0107]: LSI Logic / Symbios Logic 
SAS2008 PCI-Express Fusion-MPT SAS-2 [Falcon] [1000:0072] (rev 03)
04:00.0 Ethernet controller [0200]: Broadcom Corporation NetXtreme II BCM5709 
Gigabit Ethernet [14e4:1639] (rev 20)
04:00.1 Ethernet controller [0200]: Broadcom Corporation NetXtreme II BCM5709 
Gigabit Ethernet [14e4:1639] (rev 20)
05:00.0 Ethernet controller [0200]: Broadcom Corporation NetXtreme II BCM5709 
Gigabit Ethernet [14e4:1639] (rev 20)
05:00.1 Ethernet controller [0200]: Broadcom Corporation NetXtreme II BCM5709 
Gigabit Ethernet [14e4:1639] (rev 20)
06:03.0 VGA compatible controller [0300]: Matrox Electronics Systems Ltd. MGA 
G200eW WPCM450 [102b:0532] (rev 0a)

http://mirrors.verycdn.com.cn/pub/1.jpg
http://mirrors.verycdn.com.cn/pub/2.jpg





lyt_y...@126.com

From: Alessandro Briosi
Date: 2013-09-12 14:52
To: lyt_y...@126.com
CC: pve-user
Subject: Re: [PVE-User] hardware issue?
Il 12/09/2013 04:11, lyt_y...@126.com ha scritto:
 very tks you reply!
  
It obviously is caused by the raid (either hardware or software)
 no raid, each disk alone use
  

hmm, mpt2sas seems to be handling perc (LSI) controllers so I'd guess
you have a raid controller.

Does it say something on the console when you reboot the server?
 no,nothing, all normal
  
ok

I'd try removing the BBU as a test, and if it does not crash ask dell to
 replace it.
 I'll try it,and continue to follow up

well, if you do

Re: [PVE-User] hardware issue?

2013-09-12 Thread Alessandro Briosi
Il 12/09/2013 09:33, lyt_y...@126.com ha scritto:
 It's PERC H200I, FW Revision:7.15.08.00-IR

ok. I think the h200 does not have a BBU, so it can't be that one.

first check the firmware

I'd do some tests with some other distro which has a more updated kernel
(there has been activity on the driver, but dunno how much of this has
been backported).

If that works then probably it might be a bug in mpt2sas, though proxmox
is using RedHat kernel so it should have come up by now from somebody
else :)

It could also be 'cause you are using the raid as a simple controller
and not as a raid.

There's also a dell version of the driver for RH6 kernel, which seems
pretty recent. You could try that one too (but then be carefull when
upgrading, and be sure to have console access to the server). [1]

But I have no experience with h200, only bigger models, so this are
simply some guesses.

Alessandro

[1]
http://www.dell.com/support/drivers/us/en/bmdhs1/DriverDetails?driverId=NTNH8


___
pve-user mailing list
pve-user@pve.proxmox.com
http://pve.proxmox.com/cgi-bin/mailman/listinfo/pve-user


Re: [PVE-User] hardware issue?

2013-09-11 Thread Alessandro Briosi
Il 11/09/2013 03:30, lyt_y...@126.com ha scritto:
 hi,
 This device configuration is Dell R510:
 2TB SAS Disk x 12
 64G Mem
 Intel Xeon CPU E5620 x 2
 6Gbps SAS Controller(MPT2BIOS-7.11.10.00(2011.06.02))
  
 Recently,the kernel of the device is crashed,and occurs once every two days.
  
 I have checked the hardware without exception.

It obviously is caused by the raid (either hardware or software)

I had similar problems in the past, and it was the BBU unit of the raid
controller.

Does it say something on the console when you reboot the server?

I'd try removing the BBU as a test, and if it does not crash ask dell to
replace it.

Alessandro
___
pve-user mailing list
pve-user@pve.proxmox.com
http://pve.proxmox.com/cgi-bin/mailman/listinfo/pve-user


Re: [PVE-User] hardware issue?

2013-09-11 Thread lyt_y...@126.com
very tks you reply!

It obviously is caused by the raid (either hardware or software)
no raid, each disk alone use

Does it say something on the console when you reboot the server?
no,nothing, all normal

I'd try removing the BBU as a test, and if it does not crash ask dell to
replace it.
I'll try it,and continue to follow up




lyt_y...@126.com

From: Alessandro Briosi
Date: 2013-09-11 22:56
To: lyt_y...@126.com
CC: pve-user
Subject: Re: [PVE-User] hardware issue?
Il 11/09/2013 03:30, lyt_y...@126.com ha scritto:
 hi,
 This device configuration is Dell R510:
 2TB SAS Disk x 12
 64G Mem
 Intel Xeon CPU E5620 x 2
 6Gbps SAS Controller(MPT2BIOS-7.11.10.00(2011.06.02))
  
 Recently,the kernel of the device is crashed,and occurs once every two days.
  
 I have checked the hardware without exception.

It obviously is caused by the raid (either hardware or software)

I had similar problems in the past, and it was the BBU unit of the raid
controller.

Does it say something on the console when you reboot the server?

I'd try removing the BBU as a test, and if it does not crash ask dell to
replace it.

Alessandro___
pve-user mailing list
pve-user@pve.proxmox.com
http://pve.proxmox.com/cgi-bin/mailman/listinfo/pve-user