Hi All
  I am running into an issue on a new system Its a Dell R730xd fitted
with a Perc h730p ( mr_sas based ) on platform 20170330T015208Z It has
been running fine for a while hosting native zones that host postgres
slave databases. However we starting seeing this issue where the
servers would become unresponsive under high disk load, and while
scrubbing.

The serial console is printing out this mr_sas timeout over and over

2017-04-09T01:23:30.813196+00:00 ch2-c105-n02 mr_sas: [ID 270009
kern.warning] WARNING: mr_sas0: io_timeout_checker: FW Fault, calling
reset adapter
2017-04-09T01:23:30.813213+00:00 ch2-c105-n02 mr_sas: [ID 643100
kern.notice] mr_sas0: io_timeout_checker: fw_outstanding 0x35
max_fw_cmds 0x39F
2017-04-09T01:23:44.012892+00:00 ch2-c105-n02 mr_sas: [ID 887724
kern.warning] WARNING: mr_sas0: mrsas_tbolt_reset_ppc:resetadapter bit
is set already check retry count 101
2017-04-09T01:23:45.012867+00:00 ch2-c105-n02 mr_sas: [ID 270009
kern.warning] WARNING: mr_sas0: io_timeout_checker: FW Fault, calling
reset adapter
2017-04-09T01:23:45.012891+00:00 ch2-c105-n02 mr_sas: [ID 643100
kern.notice] mr_sas0: io_timeout_checker: fw_outstanding 0x35
max_fw_cmds 0x39F
2017-04-09T01:23:58.212525+00:00 ch2-c105-n02 mr_sas: [ID 887724
kern.warning] WARNING: mr_sas0: mrsas_tbolt_reset_ppc:resetadapter bit
is set already check retry count 101
2017-04-09T01:23:59.212501+00:00 ch2-c105-n02 mr_sas: [ID 270009
kern.warning] WARNING: mr_sas0: io_timeout_checker: FW Fault, calling
reset adapter
2017-04-09T01:23:59.212518+00:00 ch2-c105-n02 mr_sas: [ID 643100
kern.notice] mr_sas0: io_timeout_checker: fw_outstanding 0x35
max_fw_cmds 0x39F
2017-04-09T01:24:12.412180+00:00 ch2-c105-n02 mr_sas: [ID 887724
kern.warning] WARNING: mr_sas0: mrsas_tbolt_reset_ppc:resetadapter bit
is set already check retry count 101
2017-04-09T01:24:13.412154+00:00 ch2-c105-n02 mr_sas: [ID 270009
kern.warning] WARNING: mr_sas0: io_timeout_checker: FW Fault, calling
reset adapter
2017-04-09T01:24:13.412173+00:00 ch2-c105-n02 mr_sas: [ID 643100
kern.notice] mr_sas0: io_timeout_checker: fw_outstanding 0x35
max_fw_cmds 0x39F


The dell support systems all say there are no faults in the hardware
so I am at a loss on what is going on here.

A thread from sept of last year has a similar looking error with
another Dell Perc that is also mr_sas based

https://www.mail-archive.com/[email protected]/msg03735.html

I tried upgrading the mr_sas firmware and there is no change.

Anyone have any ideas here ?


-- 
---
Mark Saad
[email protected]


-------------------------------------------
smartos-discuss
Archives: https://www.listbox.com/member/archive/184463/=now
RSS Feed: https://www.listbox.com/member/archive/rss/184463/25769125-55cfbc00
Modify Your Subscription: 
https://www.listbox.com/member/?member_id=25769125&id_secret=25769125-7688e9fb
Powered by Listbox: http://www.listbox.com

Reply via email to