Hi All I am running into an issue on a new system Its a Dell R730xd fitted with a Perc h730p ( mr_sas based ) on platform 20170330T015208Z It has been running fine for a while hosting native zones that host postgres slave databases. However we starting seeing this issue where the servers would become unresponsive under high disk load, and while scrubbing.
The serial console is printing out this mr_sas timeout over and over 2017-04-09T01:23:30.813196+00:00 ch2-c105-n02 mr_sas: [ID 270009 kern.warning] WARNING: mr_sas0: io_timeout_checker: FW Fault, calling reset adapter 2017-04-09T01:23:30.813213+00:00 ch2-c105-n02 mr_sas: [ID 643100 kern.notice] mr_sas0: io_timeout_checker: fw_outstanding 0x35 max_fw_cmds 0x39F 2017-04-09T01:23:44.012892+00:00 ch2-c105-n02 mr_sas: [ID 887724 kern.warning] WARNING: mr_sas0: mrsas_tbolt_reset_ppc:resetadapter bit is set already check retry count 101 2017-04-09T01:23:45.012867+00:00 ch2-c105-n02 mr_sas: [ID 270009 kern.warning] WARNING: mr_sas0: io_timeout_checker: FW Fault, calling reset adapter 2017-04-09T01:23:45.012891+00:00 ch2-c105-n02 mr_sas: [ID 643100 kern.notice] mr_sas0: io_timeout_checker: fw_outstanding 0x35 max_fw_cmds 0x39F 2017-04-09T01:23:58.212525+00:00 ch2-c105-n02 mr_sas: [ID 887724 kern.warning] WARNING: mr_sas0: mrsas_tbolt_reset_ppc:resetadapter bit is set already check retry count 101 2017-04-09T01:23:59.212501+00:00 ch2-c105-n02 mr_sas: [ID 270009 kern.warning] WARNING: mr_sas0: io_timeout_checker: FW Fault, calling reset adapter 2017-04-09T01:23:59.212518+00:00 ch2-c105-n02 mr_sas: [ID 643100 kern.notice] mr_sas0: io_timeout_checker: fw_outstanding 0x35 max_fw_cmds 0x39F 2017-04-09T01:24:12.412180+00:00 ch2-c105-n02 mr_sas: [ID 887724 kern.warning] WARNING: mr_sas0: mrsas_tbolt_reset_ppc:resetadapter bit is set already check retry count 101 2017-04-09T01:24:13.412154+00:00 ch2-c105-n02 mr_sas: [ID 270009 kern.warning] WARNING: mr_sas0: io_timeout_checker: FW Fault, calling reset adapter 2017-04-09T01:24:13.412173+00:00 ch2-c105-n02 mr_sas: [ID 643100 kern.notice] mr_sas0: io_timeout_checker: fw_outstanding 0x35 max_fw_cmds 0x39F The dell support systems all say there are no faults in the hardware so I am at a loss on what is going on here. A thread from sept of last year has a similar looking error with another Dell Perc that is also mr_sas based https://www.mail-archive.com/[email protected]/msg03735.html I tried upgrading the mr_sas firmware and there is no change. Anyone have any ideas here ? -- --- Mark Saad [email protected] ------------------------------------------- smartos-discuss Archives: https://www.listbox.com/member/archive/184463/=now RSS Feed: https://www.listbox.com/member/archive/rss/184463/25769125-55cfbc00 Modify Your Subscription: https://www.listbox.com/member/?member_id=25769125&id_secret=25769125-7688e9fb Powered by Listbox: http://www.listbox.com
