Re: mptsas crash on expander hot-remove

2017-06-20 Thread John Garry

On 19/06/2017 09:23, Johannes Thumshirn wrote:

On Fri, Jun 16, 2017 at 07:57:08PM -0400, Will Simoneau wrote:

Is this is a known / obvious issue, or should I try to bisect it?




Out of curiousity, does this issue also occur when only the disk is 
hot-removed from the expander?



This is a known issue with SAS (and FC) drivers, although no soulution exists
to this point.

Johannes






Re: mptsas crash on expander hot-remove

2017-06-19 Thread Johannes Thumshirn
On Fri, Jun 16, 2017 at 07:57:08PM -0400, Will Simoneau wrote:
> Is this is a known / obvious issue, or should I try to bisect it?

This is a known issue with SAS (and FC) drivers, although no soulution exists
to this point.

Johannes

-- 
Johannes Thumshirn  Storage
jthumsh...@suse.de+49 911 74053 689
SUSE LINUX GmbH, Maxfeldstr. 5, 90409 Nürnberg
GF: Felix Imendörffer, Jane Smithard, Graham Norton
HRB 21284 (AG Nürnberg)
Key fingerprint = EC38 9CAB C2C4 F25D 8600 D0D0 0393 969D 2D76 0850


mptsas crash on expander hot-remove

2017-06-16 Thread Will Simoneau
I've got a disk box with a SAS expander connected to a card based on the 
LSI SAS1068E chip. I recently upgraded the machine from 4.5.7 to 4.11.5, 
and now disconnecting (i.e. hot removing) the SAS expander from the HBA 
results in fireworks like this:


[ 4738.044119]  end_device-6:0:0: mptsas: ioc2: removing sata device: 
fw_channel 0, fw_id 8, phy 0,sas_addr 0x500194752200
[ 4738.044126]  phy-6:0:8: mptsas: ioc2: delete phy 0, phy-obj 
(0x883f21a80c00)
[ 4738.044143]  port-6:0:0: mptsas: ioc2: delete port 0, sas_addr 
(0x500194752200)
[ 4738.051435]  end_device-6:0:3: mptsas: ioc2: removing ssp device: fw_channel 
0, fw_id 13, phy 5,sas_addr 0x5000c5000184240d
[ 4738.051442]  phy-6:0:13: mptsas: ioc2: delete phy 5, phy-obj 
(0x883f21a86c00)
[ 4738.051456]  port-6:0:3: mptsas: ioc2: delete port 3, sas_addr 
(0x5000c5000184240d)
[ 4738.054796]  end_device-6:0:2: mptsas: ioc2: removing sata device: 
fw_channel 0, fw_id 12, phy 4,sas_addr 0x500194752204
[ 4738.054801]  phy-6:0:12: mptsas: ioc2: delete phy 4, phy-obj 
(0x883f21a85400)
[ 4738.054814]  port-6:0:2: mptsas: ioc2: delete port 2, sas_addr 
(0x500194752204)
[ 4738.062425]  end_device-6:0:1: mptsas: ioc2: removing ssp device: fw_channel 
0, fw_id 9, phy 1,sas_addr 0x5000c5000182d58d
[ 4738.062432]  phy-6:0:9: mptsas: ioc2: delete phy 1, phy-obj 
(0x883f21a82400)
[ 4738.062446]  port-6:0:1: mptsas: ioc2: delete port 1, sas_addr 
(0x5000c5000182d58d)
[ 4738.062558]  end_device-6:0:0: mptsas: ioc2: removing sata device: 
fw_channel 0, fw_id 8, phy 0,sas_addr 0x500194752200
[ 4738.062560]  phy-6:0:8: mptsas: ioc2: delete phy 0, phy-obj 
(0x883f21a80c00)
[ 4738.062564]  port-6:0:0: mptsas: ioc2: delete port 0, sas_addr 
(0x500194752200)
[ 4738.062937]  end_device-6:0:7: mptsas: ioc2: removing ssp device: fw_channel 
0, fw_id 32, phy 24,sas_addr 0x50019475223e
[ 4738.062939]  phy-6:0:32: mptsas: ioc2: delete phy 24, phy-obj 
(0x883f21a78800)
[ 4738.062946]  port-6:0:7: mptsas: ioc2: delete port 7, sas_addr 
(0x50019475223e)
[ 4738.065679]  end_device-6:0:5: mptsas: ioc2: removing sata device: 
fw_channel 0, fw_id 20, phy 12,sas_addr 0x50019475220c
[ 4738.065683]  phy-6:0:20: mptsas: ioc2: delete phy 12, phy-obj 
(0x883f21a9a000)
[ 4738.065698]  port-6:0:5: mptsas: ioc2: delete port 5, sas_addr 
(0x50019475220c)
[ 4738.074843]  end_device-6:0:4: mptsas: ioc2: removing sata device: 
fw_channel 0, fw_id 16, phy 8,sas_addr 0x500194752208
[ 4738.074856]  phy-6:0:16: mptsas: ioc2: delete phy 8, phy-obj 
(0x883f21a82000)
[ 4738.074883]  port-6:0:4: mptsas: ioc2: delete port 4, sas_addr 
(0x500194752208)
[ 4738.136115] sd 6:0:3:0: [sdl] Synchronizing SCSI cache
[ 4738.136192] sd 6:0:3:0: [sdl] Synchronize Cache(10) failed: Result: 
hostbyte=DID_NO_CONNECT driverbyte=DRIVER_OK
[ 4738.225172] [ cut here ]
[ 4738.225188] WARNING: CPU: 0 PID: 19546 at fs/sysfs/group.c:237 
sysfs_remove_group+0x89/0x90
[ 4738.225189] sysfs group 'power' not found for kobject 'target6:0:0'
[ 4738.225191] Modules linked in: iscsi_tcp libiscsi_tcp libiscsi 
scsi_transport_iscsi serpent_sse2_x86_64 serpent_generic ablk_helper 
algif_skcipher af_alg vmnet(O) vmblock(O) vmmon(O) vmw_vsock_vmci_transport 
vsock vmw_vmci nfsd rpcsec_gss_krb5 auth_rpcgss oid_registry nfsv4 nfs lockd 
grace sunrpc bonding cachefiles fscache binfmt_misc usb_storage ipmi_ssif 
pl2303 amdgpu usbserial snd_oxygen dcdbas i2c_algo_bit snd_oxygen_lib 
drm_kms_helper snd_hda_codec_hdmi coretemp snd_mpu401_uart cfbfillrect 
syscopyarea snd_rawmidi cfbimgblt sr_mod sysfillrect snd_seq_device sysimgblt 
cdrom snd_hda_intel fb_sys_fops mptsas cfbcopyarea snd_hda_codec ttm 
snd_hda_core drm snd_hwdep snd_pcm snd_timer snd soundcore i7300_edac bnx2 
edac_core mptspi mptscsih ipmi_si mptbase ipmi_devintf xhci_pci ipmi_msghandler 
xhci_hcd
[ 4738.225270] CPU: 0 PID: 19546 Comm: kworker/0:9 Tainted: G   O
4.11.5+ #8
[ 4738.225272] Hardware name: Dell Inc. PowerEdge R900/0TT975, BIOS 1.2.0 
11/11/2010
[ 4738.225283] Workqueue: mpt/2 mptsas_firmware_event_work [mptsas]
[ 4738.225285] Call Trace:
[ 4738.225297]  dump_stack+0x4d/0x65
[ 4738.225303]  __warn+0xc7/0xf0
[ 4738.225304]  warn_slowpath_fmt+0x46/0x50
[ 4738.225306]  sysfs_remove_group+0x89/0x90
[ 4738.225310]  dpm_sysfs_remove+0x52/0x60
[ 4738.225313]  device_del+0x119/0x320
[ 4738.225315]  ? kobject_release+0x4c/0x80
[ 4738.225319]  scsi_target_reap_ref_release+0x28/0x40
[ 4738.225320]  scsi_target_reap+0x29/0x30
[ 4738.225322]  scsi_remove_target+0x189/0x1a0
[ 4738.225325]  sas_rphy_remove+0x5b/0x70
[ 4738.225328]  sas_port_delete+0x28/0x160
[ 4738.225331]  ? sysfs_remove_link+0x14/0x30
[ 4738.225334]  mptsas_del_end_device+0x16c/0x1a0 [mptsas]
[ 4738.225336]  mptsas_expander_delete+0x129/0x310 [mptsas]
[ 4738.225338]  mptsas_firmware_event_work+0x69f/0xcda [mptsas]
[ 4738.225340]  ? mptsas_firmware_event_work+0x69f/0xcda [mptsas]
[ 4738.225346]  ?