Bug#597276: qla2xxx_eh_abort(5) - kernel NULL pointer dereference

2010-09-22 Thread Josip Rodin
On Sun, Sep 19, 2010 at 11:44:50PM -0700, Giridhar Malavali wrote:
 Thanks for letting us know about this problem. Can u please provide logs
 with ql2xextended_error_logging enabled. Also, can u please provide more
 details about the test case.

OK. The machine has this hardware:

% sudo lspci -v
[...]
0b:00.0 Fibre Channel: QLogic Corp. ISP2432-based 4Gb Fibre Channel to PCI 
Express HBA (rev 02)
Subsystem: Hewlett-Packard Company Device 7041
Flags: bus master, fast devsel, latency 0, IRQ 16
I/O ports at 5000 [size=256]
Memory at fdef (64-bit, non-prefetchable) [size=16K]
[virtual] Expansion ROM at d000 [disabled] [size=256K]
Capabilities: [44] Power Management version 2
Capabilities: [4c] Express Endpoint, MSI 00
Capabilities: [64] Message Signalled Interrupts: Mask- 64bit+ Queue=0/4 
Enable-
Capabilities: [74] Vital Product Data ?
Capabilities: [7c] MSI-X: Enable- Mask- TabSize=16
Capabilities: [100] Advanced Error Reporting ?
Capabilities: [138] Power Budgeting ?
Kernel driver in use: qla2xxx
Kernel modules: qla2xxx

0b:00.1 Fibre Channel: QLogic Corp. ISP2432-based 4Gb Fibre Channel to PCI 
Express HBA (rev 02)
Subsystem: Hewlett-Packard Company Device 7041
Flags: bus master, fast devsel, latency 0, IRQ 17
I/O ports at 5400 [size=256]
Memory at fdee (64-bit, non-prefetchable) [size=16K]
[virtual] Expansion ROM at d004 [disabled] [size=256K]
Capabilities: [44] Power Management version 2
Capabilities: [4c] Express Endpoint, MSI 00
Capabilities: [64] Message Signalled Interrupts: Mask- 64bit+ Queue=0/4 
Enable-
Capabilities: [74] Vital Product Data ?
Capabilities: [7c] MSI-X: Enable- Mask- TabSize=16
Capabilities: [100] Advanced Error Reporting ?
Capabilities: [138] Power Budgeting ?
Kernel driver in use: qla2xxx
Kernel modules: qla2xxx

13:00.0 Fibre Channel: QLogic Corp. ISP2432-based 4Gb Fibre Channel to PCI 
Express HBA (rev 02)
Subsystem: Hewlett-Packard Company Device 7041
Flags: bus master, fast devsel, latency 0, IRQ 17
I/O ports at 6000 [size=256]
Memory at fdff (64-bit, non-prefetchable) [size=16K]
[virtual] Expansion ROM at d020 [disabled] [size=256K]
Capabilities: [44] Power Management version 2
Capabilities: [4c] Express Endpoint, MSI 00
Capabilities: [64] Message Signalled Interrupts: Mask- 64bit+ Queue=0/4 
Enable-
Capabilities: [74] Vital Product Data ?
Capabilities: [7c] MSI-X: Enable- Mask- TabSize=16
Capabilities: [100] Advanced Error Reporting ?
Capabilities: [138] Power Budgeting ?
Kernel driver in use: qla2xxx
Kernel modules: qla2xxx

13:00.1 Fibre Channel: QLogic Corp. ISP2432-based 4Gb Fibre Channel to PCI 
Express HBA (rev 02)
Subsystem: Hewlett-Packard Company Device 7041
Flags: bus master, fast devsel, latency 0, IRQ 18
I/O ports at 6400 [size=256]
Memory at fdfe (64-bit, non-prefetchable) [size=16K]
[virtual] Expansion ROM at d024 [disabled] [size=256K]
Capabilities: [44] Power Management version 2
Capabilities: [4c] Express Endpoint, MSI 00
Capabilities: [64] Message Signalled Interrupts: Mask- 64bit+ Queue=0/4 
Enable-
Capabilities: [74] Vital Product Data ?
Capabilities: [7c] MSI-X: Enable- Mask- TabSize=16
Capabilities: [100] Advanced Error Reporting ?
Capabilities: [138] Power Budgeting ?
Kernel driver in use: qla2xxx
Kernel modules: qla2xxx

Anyway, we had been running an earlier 2.6.32 kernel up until a few days
ago, which gave us this on boot:

[2.656008] QLogic Fibre Channel HBA Driver: 8.03.01-k6-debug
[2.656188] qla2xxx :0b:00.0: PCI INT A - GSI 16 (level, low) - IRQ 16
[2.710842] qla2xxx :0b:00.0: Found an ISP2432, irq 16, iobase 
0xc9c6c000
[2.719526] qla2xxx :0b:00.0: MSI-X: Unsupported ISP2432 (0x2, 0x0).
[2.727776]   alloc irq_desc for 61 on node -1
[2.727778]   alloc kstat_irqs on node -1
[2.728002] qla2xxx :0b:00.0: irq 61 for MSI/MSI-X
[2.728184] qla2xxx :0b:00.0: MSI: Enabled.
[2.732040] IRQ 59/cciss0: IRQF_DISABLED is not guaranteed on shared IRQs
[2.732058] cciss0: 0x3230 at PCI :06:00.0 IRQ 59 using DAC
[2.747326] qla2xxx :0b:00.0: Configuring PCI space...
[2.747479]  cciss/c0d0: p1
[2.755773] qla2xxx :0b:00.0: setting latency timer to 64
[2.756280]  p2
[2.760467] qla2xxx :0b:00.0: FLTL[DEF] = 0x11400.
[2.773807] qla2xxx :0b:00.0: FLT[DEF]: boot=0x0 fw=0x2 
vpd_nvram=0x48000 vpd=0x0 nvram=0x0 fdt=0x11000 flt=0x11400
[2.787143] qla2xxx :0b:00.0: FDT[MID]: (0xbf/0x80) erase=0x7ffd0352 
pro=0 upro=0 wrtd=0x9c blk=0x8000.
[2.789701] qla2xxx :0b:00.0: 

Bug#597276: qla2xxx_eh_abort(5) - kernel NULL pointer dereference

2010-09-20 Thread Giridhar Malavali

Hi Josip,

Thanks for letting us know about this problem. Can u please provide logs
with ql2xextended_error_logging enabled. Also, can u please provide more
details about the test case.

-- Giri

On 9/18/10 4:48 AM, Josip Rodin j...@debbugs.entuzijast.net wrote:

 
 so it looks generic.




--
To UNSUBSCRIBE, email to debian-kernel-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Archive: http://lists.debian.org/c8bc50f2.9a54%giridhar.malav...@qlogic.com



Bug#597276: qla2xxx_eh_abort(5) - kernel NULL pointer dereference

2010-09-18 Thread Josip Rodin
Package: linux-2.6
Version: 2.6.32-21~bpo50+1

Hi,

Got this in dmesg on a server:

Sep 18 02:46:52 birdun kernel: [387093.744649] qla2xxx_eh_abort(5): aborting sp 
8801b58013c0 from RISC. pid=46881441.
Sep 18 02:46:56 birdun kernel: [387093.836909] BUG: unable to handle kernel 
NULL pointer dereference at 0040
Sep 18 02:46:56 birdun kernel: [387093.924511] IP: [812f8ea1] 
_spin_lock_irqsave+0x1a/0x34
Sep 18 02:46:56 birdun kernel: [387093.996511] PGD 22d846067 PUD 22d678067 PMD 0
Sep 18 02:46:56 birdun kernel: [387094.048511] Oops: 0002 [#1] SMP
Sep 18 02:46:56 birdun kernel: [387094.086651] last sysfs file: 
/sys/devices/pci:00/:00:04.0/:13:00.0/host4/rport-4:0-3/target4:0:3/fc_transport/target4:0:3/node_name
Sep 18 02:46:56 birdun kernel: [387094.236007] CPU 4
Sep 18 02:46:56 birdun kernel: [387094.260007] Modules linked in: ipmi_devintf 
nf_conntrack_ipv6 ip6t_LOG ip6table_filter ip6_tables xt_tcpudp 
nf_conntrack_ipv4 nf_defrag_ipv4 xt_state nf_conntrack ipt_REJECT ipt_LOG 
iptable_filter ip_tables x_tables bonding xfs exportfs dm_round_robin 
dm_multipath scsi_dh loop snd_pcsp snd_pcm snd_timer psmouse ipmi_si rng_core 
snd soundcore i5000_edac serio_raw hpilo ipmi_msghandler snd_page_alloc 
edac_core evdev container i5k_amb button processor shpchp pci_hotplug ext3 jbd 
mbcache dm_mirror dm_region_hash dm_log dm_snapshot dm_mod st ch osst sd_mod 
crc_t10dif sg sr_mod cdrom ata_piix ata_generic qla2xxx scsi_transport_fc 
libata scsi_tgt cciss usbhid hid bnx2 ehci_hcd uhci_hcd floppy usbcore nls_base 
scsi_mod thermal fan thermal_sys
Sep 18 02:46:56 birdun kernel: [387095.008511] Pid: 763, comm: scsi_eh_5 Not 
tainted 2.6.32-bpo.5-amd64 #1 ProLiant DL360 G5
Sep 18 02:46:56 birdun kernel: [387095.104511] RIP: 0010:[812f8ea1]  
[812f8ea1] _spin_lock_irqsave+0x1a/0x34
Sep 18 02:46:56 birdun kernel: [387095.204007] RSP: 0018:88022b1c5d70  
EFLAGS: 00010082
Sep 18 02:46:56 birdun kernel: [387095.264511] RAX: 0282 RBX: 
0040 RCX: 381d
Sep 18 02:46:56 birdun kernel: [387095.348511] RDX: 0001 RSI: 
0282 RDI: 0040
Sep 18 02:46:56 birdun kernel: [387095.432258] RBP: 8801b58013c0 R08: 
000a26c8 R09: 000a
Sep 18 02:46:56 birdun kernel: [387095.512512] R10:  R11: 
81673868 R12: 0001
Sep 18 02:46:56 birdun kernel: [387095.596512] R13: 88014066e100 R14: 
8801b5801e80 R15: 
Sep 18 02:46:56 birdun kernel: [387095.684513] FS:  () 
GS:880008d0() knlGS:
Sep 18 02:46:56 birdun kernel: [387095.780002] CS:  0010 DS: 0018 ES: 0018 CR0: 
8005003b
Sep 18 02:46:56 birdun kernel: [387095.844512] CR2: 0040 CR3: 
00022d42b000 CR4: 06e0
Sep 18 02:46:56 birdun kernel: [387095.928512] DR0:  DR1: 
 DR2: 
Sep 18 02:46:56 birdun kernel: [387096.012511] DR3:  DR6: 
0ff0 DR7: 0400
Sep 18 02:46:56 birdun kernel: [387096.096005] Process scsi_eh_5 (pid: 763, 
threadinfo 88022b1c4000, task 88022ba39c40)
Sep 18 02:46:56 birdun kernel: [387096.192511] Stack:
Sep 18 02:46:56 birdun kernel: [387096.216511]  381d 
a014cb8b  0286
Sep 18 02:46:56 birdun kernel: [387096.300959] 0 ff10 
8801b58013c0 2002 0286
Sep 18 02:46:56 birdun kernel: [387096.390206] 0 88022df0a900 
88022b1c 88022b881840 a01407e4
Sep 18 02:46:56 birdun kernel: [387096.480511] Call Trace:
Sep 18 02:46:56 birdun kernel: [387096.508511]  [a014cb8b] ? 
qla24xx_abort_command+0x3f/0x1db [qla2xxx]
Sep 18 02:46:56 birdun kernel: [387096.592513]  [a01407e4] ? 
qla2xxx_eh_abort+0xf2/0x250 [qla2xxx]
Sep 18 02:46:56 birdun kernel: [387096.672511]  [a001ccde] ? 
scsi_error_handler+0x302/0x5b5 [scsi_mod]
Sep 18 02:46:56 birdun kernel: [387096.756512]  [a001c9dc] ? 
scsi_error_handler+0x0/0x5b5 [scsi_mod]
Sep 18 02:46:56 birdun kernel: [387096.836513]  [81063601] ? 
kthread+0x79/0x81
Sep 18 02:46:56 birdun kernel: [387096.896512]  [81011baa] ? 
child_rip+0xa/0x20
Sep 18 02:46:56 birdun kernel: [387096.956511]  [81063588] ? 
kthread+0x0/0x81
Sep 18 02:46:56 birdun kernel: [387097.012512]  [81011ba0] ? 
child_rip+0x0/0x20
Sep 18 02:46:56 birdun kernel: [387097.072511] Code: 31 d2 89 d0 c3 f0 83 2f 01 
79 05 e8 ca ae e9 ff c3 48 83 ec 08 9c 58 0f 1f 44 00 00 48 89 c6 fa 66 0f 1f 
44 00 00 ba 00 00 01 00 f0 0f c1 17 0f b7 ca c1 ea 10 39 d1 74 07 f3 90 0f b7 
0f eb f5
Sep 18 02:46:56 birdun kernel: [387097.292511] RIP  [812f8ea1] 
_spin_lock_irqsave+0x1a/0x34
Sep 18 02:46:56 birdun kernel: [387097.364514]  RSP 88022b1c5d70
Sep 18 02:46:56 birdun kernel: [387097.404511] CR2: 0040
Sep 18 02:46:56 birdun kernel: