Tziporet, UNH-IOL completed the testing of the new daily build and here is what we found.
+++++++++++++++++++++++++++++++++++ SL 6.3 2.6.32-279.el6.x86_64 OFED-3.5-20121016-0341.tgz 16-Oct-2012 03:42 18M The new build now allows you to load and unload successfully with no system crashes. We were also able to run the OFA Interop SRP tests successfully. However the Module took a long time (~1-2 minutes) to unload 2. Message saying something to the effect of 'stale connection...retrying' was observed The attached file is a capture from the dmesg output. ++++++++++++++++++++++++++++++++++++ This is bug 2374. Thanks Rupert -----Original Message----- From: Rupert Dance [mailto:rsda...@soft-forge.com] Sent: Tuesday, October 16, 2012 6:59 AM To: 'Tziporet Koren' Cc: 'Vladimir Sokolovsky' Subject: RE: [PATCH] ib_srp: Avoid that module removal can trigger a deadlock Tziporet, I have asked them to get this done today. I will let you know as soon as I can confirm. Thanks Rupert -----Original Message----- From: Tziporet Koren [mailto:tzipo...@mellanox.com] Sent: Tuesday, October 16, 2012 6:36 AM To: Vladimir Sokolovsky; Rupert Dance Cc: 'Bart Van Assche'; 'ewg' Subject: RE: [PATCH] ib_srp: Avoid that module removal can trigger a deadlock Rupert I must get your input for SRP stability to know when we can build a new RC Thanks Tziporet -----Original Message----- From: Vladimir Sokolovsky [mailto:v...@dev.mellanox.co.il] Sent: Tuesday, October 16, 2012 12:34 PM To: Rupert Dance Cc: 'Bart Van Assche'; 'ewg'; Tziporet Koren Subject: Re: [PATCH] ib_srp: Avoid that module removal can trigger a deadlock On 10/15/2012 04:46 PM, Rupert Dance wrote: > Vlad, > > Thanks for getting this done. Is this in today's daily build or if not > when will I have access? > > Thanks > > Rupert Hi Rupert, Yes, today's daily build includes this fix. Regards, Vladimir > > -----Original Message----- > From: Vladimir Sokolovsky [mailto:v...@dev.mellanox.co.il] > Sent: Monday, October 15, 2012 9:28 AM > To: Bart Van Assche > Cc: Rupert Dance; ewg; Tziporet Koren > Subject: Re: [PATCH] ib_srp: Avoid that module removal can trigger a > deadlock > > On 10/12/2012 02:03 PM, Bart Van Assche wrote: >> Avoid that scsi_remove_host() is invoked from the context of a work >> queue thread on which work has been queued that scsi_remove_host() >> might be waiting for. That avoids that module removal of ib_srp >> triggers a deadlock on a pre-2.6.36 kernel. This patch has been >> tested on RHEL 6.1, RHEL 6.2, RHEL 6.3 and SLES 11 SP2. >> >> Reported-by: Rupert Dance <rsda...@soft-forge.com> >> Signed-off-by: Bart Van Assche <bvanass...@acm.org> >> --- > > Applied, > > Regards, > Vladimir > > >
scsi host4: ib_srp: new target: id_ext c19d350003c90200 ioc_guid 0002c90300359de0 pkey ffff service_id c19d350003c90200 dgid fe80:0000:0000:0000:0002:c903:0035:9de1 scsi host4: ib_srp: REJ received scsi host4: REJ reason: stale connection scsi host4: ib_srp: retrying stale connection scsi host4: ib_srp: REJ received scsi host4: REJ reason: stale connection scsi host4: ib_srp: retrying stale connection scsi host4: ib_srp: REJ received scsi host4: REJ reason: stale connection scsi host4: ib_srp: retrying stale connection scsi host4: ib_srp: REJ received scsi host4: REJ reason: stale connection scsi host4: ib_srp: giving up on stale connection scsi host4: ib_srp: Connection failed scsi host5: ib_srp: new target: id_ext c19d350003c90200 ioc_guid 0002c90300359de0 pkey ffff service_id c19d350003c90200 dgid fe80:0000:0000:0000:0002:c903:0035:9de2 scsi host5: ib_srp: REJ received scsi host5: REJ reason: stale connection scsi host5: ib_srp: retrying stale connection scsi host5: ib_srp: REJ received scsi host5: REJ reason: stale connection scsi host5: ib_srp: retrying stale connection scsi host5: ib_srp: REJ received scsi host5: REJ reason: stale connection scsi host5: ib_srp: retrying stale connection scsi host5: ib_srp: REJ received scsi host5: REJ reason: stale connection scsi host5: ib_srp: giving up on stale connection scsi host5: ib_srp: Connection failed scsi host6: ib_srp: new target: id_ext c09e350003c90200 ioc_guid 0002c90300359e10 pkey ffff service_id c09e350003c90200 dgid fe80:0000:0000:0000:0002:c903:0035:9e11 scsi6 : SRP.T10:C09E350003C90200 scsi 6:0:0:0: Direct-Access DDN SFA 12000 1.50 PQ: 0 ANSI: 5 sd 6:0:0:0: Attached scsi generic sg2 type 0 sd 6:0:0:0: Warning! Received an indication that the LUN assignments on this target have changed. The Linux SCSI layer does not automatically remap LUN assignments. sd 6:0:0:0: [sdb] Unit Not Ready sd 6:0:0:0: [sdb] Sense Key : Unit Attention [current] sd 6:0:0:0: [sdb] Add. Sense: Reported luns data has changed sd 6:0:0:0: [sdb] 4412407808 512-byte logical blocks: (2.25 TB/2.05 TiB) sd 6:0:0:0: [sdb] Write Protect is off sd 6:0:0:0: [sdb] Mode Sense: 6f 00 10 08 scsi 6:0:0:4: Direct-Access DDN SFA 12000 1.50 PQ: 0 ANSI: 5 sd 6:0:0:0: [sdb] Write cache: enabled, read cache: enabled, supports DPO and FUA sd 6:0:0:4: Attached scsi generic sg3 type 0 sd 6:0:0:4: [sdc] 2197815296 512-byte logical blocks: (1.12 TB/1.02 TiB) sdb: sd 6:0:0:4: [sdc] Write Protect is off sd 6:0:0:4: [sdc] Mode Sense: 6f 00 10 08 sd 6:0:0:4: [sdc] Write cache: enabled, read cache: enabled, supports DPO and FUA unknown partition table sd 6:0:0:0: [sdb] Attached SCSI disk sdc: unknown partition table sd 6:0:0:4: [sdc] Attached SCSI disk scsi host7: ib_srp: new target: id_ext c09e350003c90200 ioc_guid 0002c90300359e10 pkey ffff service_id c09e350003c90200 dgid fe80:0000:0000:0000:0002:c903:0035:9e12 scsi7 : SRP.T10:C09E350003C90200 scsi 7:0:0:0: RAID DDN SFA 12000 1.50 PQ: 0 ANSI: 5 scsi 7:0:0:0: Attached scsi generic sg4 type 12 scsi 7:0:0:1: Direct-Access DDN SFA 12000 1.50 PQ: 0 ANSI: 5 sd 7:0:0:1: Attached scsi generic sg5 type 0 sd 7:0:0:1: Warning! Received an indication that the LUN assignments on this target have changed. The Linux SCSI layer does not automatically remap LUN assignments. sd 7:0:0:1: [sdd] Unit Not Ready sd 7:0:0:1: [sdd] Sense Key : Unit Attention [current] sd 7:0:0:1: [sdd] Add. Sense: Reported luns data has changed sd 7:0:0:1: [sdd] 4412407808 512-byte logical blocks: (2.25 TB/2.05 TiB) sd 7:0:0:1: [sdd] Write Protect is off sd 7:0:0:1: [sdd] Mode Sense: 6f 00 10 08 sd 7:0:0:1: [sdd] Write cache: enabled, read cache: enabled, supports DPO and FUA sdd: unknown partition table sd 7:0:0:1: [sdd] Attached SCSI disk sd 6:0:0:0: [sdb] Synchronizing SCSI cache sd 6:0:0:0: [sdb] Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK sd 6:0:0:4: [sdc] Synchronizing SCSI cache sd 6:0:0:4: [sdc] Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK scsi host6: ib_srp: connection closed sd 7:0:0:1: [sdd] Synchronizing SCSI cache sd 7:0:0:1: [sdd] Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK scsi host7: ib_srp: connection closed scsi host8: ib_srp: new target: id_ext c19d350003c90200 ioc_guid 0002c90300359de0 pkey ffff service_id c19d350003c90200 dgid fe80:0000:0000:0000:0002:c903:0035:9de1 scsi8 : SRP.T10:C19D350003C90200 scsi 8:0:0:0: RAID DDN SFA 12000 1.50 PQ: 0 ANSI: 5 scsi 8:0:0:0: Attached scsi generic sg2 type 12 scsi 8:0:0:2: Direct-Access DDN SFA 12000 1.50 PQ: 0 ANSI: 5 sd 8:0:0:2: Attached scsi generic sg3 type 0 sd 8:0:0:2: Warning! Received an indication that the LUN assignments on this target have changed. The Linux SCSI layer does not automatically remap LUN assignments. sd 8:0:0:2: [sdb] Unit Not Ready sd 8:0:0:2: [sdb] Sense Key : Unit Attention [current] sd 8:0:0:2: [sdb] Add. Sense: Reported luns data has changed sd 8:0:0:2: [sdb] 4412407808 512-byte logical blocks: (2.25 TB/2.05 TiB) sd 8:0:0:2: [sdb] Write Protect is off sd 8:0:0:2: [sdb] Mode Sense: 6f 00 10 08 sd 8:0:0:2: [sdb] Write cache: enabled, read cache: enabled, supports DPO and FUA sdb: unknown partition table sd 8:0:0:2: [sdb] Attached SCSI disk scsi host9: ib_srp: new target: id_ext c19d350003c90200 ioc_guid 0002c90300359de0 pkey ffff service_id c19d350003c90200 dgid fe80:0000:0000:0000:0002:c903:0035:9de2 scsi9 : SRP.T10:C19D350003C90200 scsi 9:0:0:0: RAID DDN SFA 12000 1.50 PQ: 0 ANSI: 5 scsi 9:0:0:0: Attached scsi generic sg4 type 12 scsi 9:0:0:3: Direct-Access DDN SFA 12000 1.50 PQ: 0 ANSI: 5 sd 9:0:0:3: Attached scsi generic sg5 type 0 sd 9:0:0:3: Warning! Received an indication that the LUN assignments on this target have changed. The Linux SCSI layer does not automatically remap LUN assignments. sd 9:0:0:3: [sdc] Unit Not Ready sd 9:0:0:3: [sdc] Sense Key : Unit Attention [current] sd 9:0:0:3: [sdc] Add. Sense: Reported luns data has changed sd 9:0:0:3: [sdc] 4412407808 512-byte logical blocks: (2.25 TB/2.05 TiB) sd 9:0:0:3: [sdc] Write Protect is off sd 9:0:0:3: [sdc] Mode Sense: 6f 00 10 08 sd 9:0:0:3: [sdc] Write cache: enabled, read cache: enabled, supports DPO and FUA sdc: unknown partition table sd 9:0:0:3: [sdc] Attached SCSI disk scsi host10: ib_srp: new target: id_ext c09e350003c90200 ioc_guid 0002c90300359e10 pkey ffff service_id c09e350003c90200 dgid fe80:0000:0000:0000:0002:c903:0035:9e11 scsi10 : SRP.T10:C09E350003C90200 scsi 10:0:0:0: Direct-Access DDN SFA 12000 1.50 PQ: 0 ANSI: 5 sd 10:0:0:0: Attached scsi generic sg6 type 0 sd 10:0:0:0: Warning! Received an indication that the LUN assignments on this target have changed. The Linux SCSI layer does not automatically remap LUN assignments. sd 10:0:0:0: [sdd] Unit Not Ready sd 10:0:0:0: [sdd] Sense Key : Unit Attention [current] sd 10:0:0:0: [sdd] Add. Sense: Reported luns data has changed sd 10:0:0:0: [sdd] 4412407808 512-byte logical blocks: (2.25 TB/2.05 TiB) scsi 10:0:0:4: Direct-Access DDN SFA 12000 1.50 PQ: 0 ANSI: 5 sd 10:0:0:0: [sdd] Write Protect is off sd 10:0:0:0: [sdd] Mode Sense: 6f 00 10 08 sd 10:0:0:0: [sdd] Write cache: enabled, read cache: enabled, supports DPO and FUA sd 10:0:0:4: Attached scsi generic sg7 type 0 sd 10:0:0:4: [sde] 2197815296 512-byte logical blocks: (1.12 TB/1.02 TiB) sdd: unknown partition table sd 10:0:0:4: [sde] Write Protect is off sd 10:0:0:4: [sde] Mode Sense: 6f 00 10 08 sd 10:0:0:4: [sde] Write cache: enabled, read cache: enabled, supports DPO and FUA sd 10:0:0:0: [sdd] Attached SCSI disk sde: unknown partition table sd 10:0:0:4: [sde] Attached SCSI disk scsi host11: ib_srp: new target: id_ext c09e350003c90200 ioc_guid 0002c90300359e10 pkey ffff service_id c09e350003c90200 dgid fe80:0000:0000:0000:0002:c903:0035:9e12 scsi11 : SRP.T10:C09E350003C90200 scsi 11:0:0:0: RAID DDN SFA 12000 1.50 PQ: 0 ANSI: 5 scsi 11:0:0:0: Attached scsi generic sg8 type 12 scsi 11:0:0:1: Direct-Access DDN SFA 12000 1.50 PQ: 0 ANSI: 5 sd 11:0:0:1: Attached scsi generic sg9 type 0 sd 11:0:0:1: Warning! Received an indication that the LUN assignments on this target have changed. The Linux SCSI layer does not automatically remap LUN assignments. sd 11:0:0:1: [sdf] Unit Not Ready sd 11:0:0:1: [sdf] Sense Key : Unit Attention [current] sd 11:0:0:1: [sdf] Add. Sense: Reported luns data has changed sd 11:0:0:1: [sdf] 4412407808 512-byte logical blocks: (2.25 TB/2.05 TiB) sd 11:0:0:1: [sdf] Write Protect is off sd 11:0:0:1: [sdf] Mode Sense: 6f 00 10 08 sd 11:0:0:1: [sdf] Write cache: enabled, read cache: enabled, supports DPO and FUA sdf: unknown partition table sd 11:0:0:1: [sdf] Attached SCSI disk sd 8:0:0:2: [sdb] Synchronizing SCSI cache sd 8:0:0:2: [sdb] Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK scsi host8: ib_srp: connection closed sd 9:0:0:3: [sdc] Synchronizing SCSI cache sd 9:0:0:3: [sdc] Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK scsi host9: ib_srp: connection closed sd 10:0:0:0: [sdd] Synchronizing SCSI cache sd 10:0:0:0: [sdd] Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK sd 10:0:0:4: [sde] Synchronizing SCSI cache sd 10:0:0:4: [sde] Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK usb 4-2: new low speed USB device using uhci_hcd and address 4 usb 4-2: New USB device found, idVendor=413c, idProduct=2003 usb 4-2: New USB device strings: Mfr=1, Product=2, SerialNumber=0 usb 4-2: Product: Dell USB Keyboard usb 4-2: Manufacturer: Dell usb 4-2: configuration #1 chosen from 1 choice input: Dell Dell USB Keyboard as /devices/pci0000:00/0000:00:1d.2/usb4/4-2/4-2:1.0/input/input7 generic-usb 0003:413C:2003.0005: input,hidraw2: USB HID v1.10 Keyboard [Dell Dell USB Keyboard] on usb-0000:00:1d.2-2/input0 scsi host10: ib_srp: connection closed sd 11:0:0:1: [sdf] Synchronizing SCSI cache sd 11:0:0:1: [sdf] Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK scsi host11: ib_srp: connection closed
_______________________________________________ ewg mailing list ewg@lists.openfabrics.org http://lists.openfabrics.org/cgi-bin/mailman/listinfo/ewg