Hi, I have a SmartOS server with old Supermicro motherboard and a LSI HBA. It has been hanging occasionally and now after rebooting due to other maintenance I got and old fault management event email that wasn’t delivered earlier for some reason. I ran fmadm faulty and here’s a shortened result:
TIME EVENT-ID MSG-ID SEVERITY --------------- ------------------------------------ -------------- --------- Nov 25 11:47:32 b69129f2-e166-69a9-9270-bb26d90c2bda SUNOS-8000-J0 Major Fault class : fault.sunos.eft.unexpected_telemetry 50% defect.sunos.eft.unexpected_telemetry 50% Problem in : dev:////pci@0,0 faulted and taken out of service TIME EVENT-ID MSG-ID SEVERITY --------------- ------------------------------------ -------------- --------- Nov 25 11:47:32 c0745d40-c50a-42e7-e8a6-c6ebc3720114 PCIEX-8000-DJ Major Fault class : fault.io.pciex.device-interr max 40% fault.io.pciex.bus-noresp 20% fault.io.pciex.device-noresp 20% Affects : dev:////pci@0,0/pci8086,3408@1/pci15d9,600@0 dev:////pci@0,0/pci8086,3408@1 faulted but still in service FRU : "MB" (hc://:product-id=X8DTU:server-id=smartos01:chassis-id=[xxxxxxxxxx]/motherboard=0) faulty Does this indicate that the problem is likely with the HBA or the motherboard itself? I couldn’t find much with those pci ids. Thanks, Perttu ------------------------------------------ illumos: illumos-discuss Permalink: https://illumos.topicbox.com/groups/discuss/T9109d9d6615bd15c-M00fa00376a32da4ee5a0aa12 Delivery options: https://illumos.topicbox.com/groups/discuss/subscription