Re: [gentoo-user] 4.4.2-hardened and Areca ARC-1110 problems

2016-04-16 Thread J. Roeleveld
On Saturday, April 16, 2016 06:17:28 PM Calum wrote:
> Hello all,
> 
> I have a server (in another country) running Gentoo with 3.17.7-hardened-r1.
> It has a "RAID bus controller: Areca Technology Corp. ARC-1110 4-Port PCI-X
> to SATA RAID Controller" (17d3:1110).
> 
> I have no physical access to it, and only a serial console and Debian
> rescue image to recover with).
> I don't have any access or info about the settings on the RAID controller
> (unless I can get them from the OS).
> 
> This works well enough most of the time. (I do get some random hangs which
> I have to hard reset to recover from).
> 
> I tried updating to linux-4.4.2-hardened (with the same kernel .config) but
> get the following error upon booting
> 
> [   35.805083] arcmsr0: abort device command of scsi id = 0 lun = 0
> [   35.817201] arcmsr0: scsi id = 0 lun = 0 ccb = '0x8800d5120800' poll
> command abort successfully
> [   35.835617] arcmsr0: abort device command of scsi id = 0 lun = 0
> [   35.847700] arcmsr0: abort device command of scsi id = 0 lun = 0
> [   35.859783] arcmsr0: abort device command of scsi id = 0 lun = 0
> [   35.871870] arcmsr0: abort device command of scsi id = 0 lun = 0
> [   35.883953] arcmsr0: abort device command of scsi id = 0 lun = 0
> [   35.896035] arcmsr0: abort device command of scsi id = 0 lun = 0
> [   35.908117] arcmsr0: abort device command of scsi id = 0 lun = 0
> [   35.920199] arcmsr0: abort device command of scsi id = 0 lun = 0
> [   35.932283] arcmsr0: abort device command of scsi id = 0 lun = 0
> [   35.944364] arcmsr0: abort device command of scsi id = 0 lun = 0
> [   35.956446] arcmsr0: abort device command of scsi id = 0 lun = 0
> [   35.968529] arcmsr0: abort device command of scsi id = 0 lun = 0
> [   35.980611] arcmsr0: abort device command of scsi id = 0 lun = 0
> [   35.992693] arcmsr0: abort device command of scsi id = 0 lun = 0
> [   36.004775] arcmsr0: abort device command of scsi id = 0 lun = 0
> [   36.016858] arcmsr0: abort device command of scsi id = 0 lun = 0
> [   36.028939] arcmsr0: abort device command of scsi id = 0 lun = 0
> [   36.041021] arcmsr0: abort device command of scsi id = 0 lun = 0
> [   36.053102] arcmsr0: abort device command of scsi id = 0 lun = 0
> [   36.065187] arcmsr0: abort device command of scsi id = 0 lun = 0
> [   36.077269] arcmsr0: abort device command of scsi id = 0 lun = 0
> [   36.089351] arcmsr0: abort device command of scsi id = 0 lun = 0
> [   36.101439] arcmsr0: abort device command of scsi id = 0 lun = 0
> [   36.113543] arcmsr: executing bus reset eh.num_resets = 0,
> num_aborts = 24
> [   36.128326] arcmsr0: executing hw bus reset .
> [   49.149028] arcmsr0: waiting for hw bus reset return, retry=0
> [   59.160983] arcmsr0: waiting for hw bus reset return, retry=1
> [   69.172962] arcmsr0: waiting for hw bus reset return, retry=2
> [   79.184902] arcmsr0: waiting for hw bus reset return, retry=3
> [   89.212917] Areca RAID Controller0: Model ARC-1110, F/W V1.49 2010-12-02
> [   89.240861] arcmsr: scsi  bus reset eh returns with success
> [   96.108524] random: nonblocking pool is initialized
> [  109.252788] arcmsr0: abort device command of scsi id = 0 lun = 0
> [  109.264881] arcmsr0: scsi id = 0 lun = 0 ccb = '0x8800d5120e80' poll
> command abort successfully
> [  109.283294] sd 0:0:0:0: Device offlined - not ready after error recovery
> [  109.296764] sd 0:0:0:0: Device offlined - not ready after error recovery
> [  109.310231] sd 0:0:0:0: Device offlined - not ready after error recovery
> [  109.323700] sd 0:0:0:0: Device offlined - not ready after error recovery
> [  109.337176] sd 0:0:0:0: Device offlined - not ready after error recovery
> [  109.350645] sd 0:0:0:0: Device offlined - not ready after error recovery
> [  109.364111] sd 0:0:0:0: Device offlined - not ready after error recovery
> [  109.377580] sd 0:0:0:0: Device offlined - not ready after error recovery
> [  109.391048] sd 0:0:0:0: Device offlined - not ready after error recovery
> [  109.404517] sd 0:0:0:0: Device offlined - not ready after error recovery
> [  109.417986] sd 0:0:0:0: Device offlined - not ready after error recovery
> [  109.431455] sd 0:0:0:0: Device offlined - not ready after error recovery
> [  109.444924] sd 0:0:0:0: Device offlined - not ready after error recovery
> [  109.458393] sd 0:0:0:0: Device offlined - not ready after error recovery
> [  109.471862] sd 0:0:0:0: Device offlined - not ready after error recovery
> [  109.485330] sd 0:0:0:0: Device offlined - not ready after error recovery
> [  109.498799] sd 0:0:0:0: Device offlined - not ready after error recovery
> [  109.512267] sd 0:0:0:0: Device offlined - not ready after error recovery
> [  109.525736] sd 0:0:0:0: Device offlined - not ready after error recovery
> [  109.539206] sd 0:0:0:0: Device offlined - not ready after error recovery
> [  109.552675] sd 0:0:0:0: Device offlined - not ready after error recovery
> [  109.566144] sd 0:0:0:0: Device offlined - not 

[gentoo-user] 4.4.2-hardened and Areca ARC-1110 problems

2016-04-16 Thread Calum
Hello all,

I have a server (in another country) running Gentoo with 3.17.7-hardened-r1.
It has a "RAID bus controller: Areca Technology Corp. ARC-1110 4-Port PCI-X
to SATA RAID Controller" (17d3:1110).

I have no physical access to it, and only a serial console and Debian
rescue image to recover with).
I don't have any access or info about the settings on the RAID controller
(unless I can get them from the OS).

This works well enough most of the time. (I do get some random hangs which
I have to hard reset to recover from).

I tried updating to linux-4.4.2-hardened (with the same kernel .config) but
get the following error upon booting

[   35.805083] arcmsr0: abort device command of scsi id = 0 lun = 0
[   35.817201] arcmsr0: scsi id = 0 lun = 0 ccb = '0x8800d5120800' poll
command abort successfully
[   35.835617] arcmsr0: abort device command of scsi id = 0 lun = 0
[   35.847700] arcmsr0: abort device command of scsi id = 0 lun = 0
[   35.859783] arcmsr0: abort device command of scsi id = 0 lun = 0
[   35.871870] arcmsr0: abort device command of scsi id = 0 lun = 0
[   35.883953] arcmsr0: abort device command of scsi id = 0 lun = 0
[   35.896035] arcmsr0: abort device command of scsi id = 0 lun = 0
[   35.908117] arcmsr0: abort device command of scsi id = 0 lun = 0
[   35.920199] arcmsr0: abort device command of scsi id = 0 lun = 0
[   35.932283] arcmsr0: abort device command of scsi id = 0 lun = 0
[   35.944364] arcmsr0: abort device command of scsi id = 0 lun = 0
[   35.956446] arcmsr0: abort device command of scsi id = 0 lun = 0
[   35.968529] arcmsr0: abort device command of scsi id = 0 lun = 0
[   35.980611] arcmsr0: abort device command of scsi id = 0 lun = 0
[   35.992693] arcmsr0: abort device command of scsi id = 0 lun = 0
[   36.004775] arcmsr0: abort device command of scsi id = 0 lun = 0
[   36.016858] arcmsr0: abort device command of scsi id = 0 lun = 0
[   36.028939] arcmsr0: abort device command of scsi id = 0 lun = 0
[   36.041021] arcmsr0: abort device command of scsi id = 0 lun = 0
[   36.053102] arcmsr0: abort device command of scsi id = 0 lun = 0
[   36.065187] arcmsr0: abort device command of scsi id = 0 lun = 0
[   36.077269] arcmsr0: abort device command of scsi id = 0 lun = 0
[   36.089351] arcmsr0: abort device command of scsi id = 0 lun = 0
[   36.101439] arcmsr0: abort device command of scsi id = 0 lun = 0
[   36.113543] arcmsr: executing bus reset eh.num_resets = 0,
num_aborts = 24
[   36.128326] arcmsr0: executing hw bus reset .
[   49.149028] arcmsr0: waiting for hw bus reset return, retry=0
[   59.160983] arcmsr0: waiting for hw bus reset return, retry=1
[   69.172962] arcmsr0: waiting for hw bus reset return, retry=2
[   79.184902] arcmsr0: waiting for hw bus reset return, retry=3
[   89.212917] Areca RAID Controller0: Model ARC-1110, F/W V1.49 2010-12-02
[   89.240861] arcmsr: scsi  bus reset eh returns with success
[   96.108524] random: nonblocking pool is initialized
[  109.252788] arcmsr0: abort device command of scsi id = 0 lun = 0
[  109.264881] arcmsr0: scsi id = 0 lun = 0 ccb = '0x8800d5120e80' poll
command abort successfully
[  109.283294] sd 0:0:0:0: Device offlined - not ready after error recovery
[  109.296764] sd 0:0:0:0: Device offlined - not ready after error recovery
[  109.310231] sd 0:0:0:0: Device offlined - not ready after error recovery
[  109.323700] sd 0:0:0:0: Device offlined - not ready after error recovery
[  109.337176] sd 0:0:0:0: Device offlined - not ready after error recovery
[  109.350645] sd 0:0:0:0: Device offlined - not ready after error recovery
[  109.364111] sd 0:0:0:0: Device offlined - not ready after error recovery
[  109.377580] sd 0:0:0:0: Device offlined - not ready after error recovery
[  109.391048] sd 0:0:0:0: Device offlined - not ready after error recovery
[  109.404517] sd 0:0:0:0: Device offlined - not ready after error recovery
[  109.417986] sd 0:0:0:0: Device offlined - not ready after error recovery
[  109.431455] sd 0:0:0:0: Device offlined - not ready after error recovery
[  109.444924] sd 0:0:0:0: Device offlined - not ready after error recovery
[  109.458393] sd 0:0:0:0: Device offlined - not ready after error recovery
[  109.471862] sd 0:0:0:0: Device offlined - not ready after error recovery
[  109.485330] sd 0:0:0:0: Device offlined - not ready after error recovery
[  109.498799] sd 0:0:0:0: Device offlined - not ready after error recovery
[  109.512267] sd 0:0:0:0: Device offlined - not ready after error recovery
[  109.525736] sd 0:0:0:0: Device offlined - not ready after error recovery
[  109.539206] sd 0:0:0:0: Device offlined - not ready after error recovery
[  109.552675] sd 0:0:0:0: Device offlined - not ready after error recovery
[  109.566144] sd 0:0:0:0: Device offlined - not ready after error recovery
[  109.579612] sd 0:0:0:0: Device offlined - not ready after error recovery
[  109.593087] sd 0:0:0:0: [sda] tag#9 FAILED Result: hostbyte=DID_OK
driverbyte=DRIVER_TIMEOUT
[