Bug#597276: qla2xxx_eh_abort(5) - kernel NULL pointer dereference
On Sun, Sep 19, 2010 at 11:44:50PM -0700, Giridhar Malavali wrote: Thanks for letting us know about this problem. Can u please provide logs with ql2xextended_error_logging enabled. Also, can u please provide more details about the test case. OK. The machine has this hardware: % sudo lspci -v [...] 0b:00.0 Fibre Channel: QLogic Corp. ISP2432-based 4Gb Fibre Channel to PCI Express HBA (rev 02) Subsystem: Hewlett-Packard Company Device 7041 Flags: bus master, fast devsel, latency 0, IRQ 16 I/O ports at 5000 [size=256] Memory at fdef (64-bit, non-prefetchable) [size=16K] [virtual] Expansion ROM at d000 [disabled] [size=256K] Capabilities: [44] Power Management version 2 Capabilities: [4c] Express Endpoint, MSI 00 Capabilities: [64] Message Signalled Interrupts: Mask- 64bit+ Queue=0/4 Enable- Capabilities: [74] Vital Product Data ? Capabilities: [7c] MSI-X: Enable- Mask- TabSize=16 Capabilities: [100] Advanced Error Reporting ? Capabilities: [138] Power Budgeting ? Kernel driver in use: qla2xxx Kernel modules: qla2xxx 0b:00.1 Fibre Channel: QLogic Corp. ISP2432-based 4Gb Fibre Channel to PCI Express HBA (rev 02) Subsystem: Hewlett-Packard Company Device 7041 Flags: bus master, fast devsel, latency 0, IRQ 17 I/O ports at 5400 [size=256] Memory at fdee (64-bit, non-prefetchable) [size=16K] [virtual] Expansion ROM at d004 [disabled] [size=256K] Capabilities: [44] Power Management version 2 Capabilities: [4c] Express Endpoint, MSI 00 Capabilities: [64] Message Signalled Interrupts: Mask- 64bit+ Queue=0/4 Enable- Capabilities: [74] Vital Product Data ? Capabilities: [7c] MSI-X: Enable- Mask- TabSize=16 Capabilities: [100] Advanced Error Reporting ? Capabilities: [138] Power Budgeting ? Kernel driver in use: qla2xxx Kernel modules: qla2xxx 13:00.0 Fibre Channel: QLogic Corp. ISP2432-based 4Gb Fibre Channel to PCI Express HBA (rev 02) Subsystem: Hewlett-Packard Company Device 7041 Flags: bus master, fast devsel, latency 0, IRQ 17 I/O ports at 6000 [size=256] Memory at fdff (64-bit, non-prefetchable) [size=16K] [virtual] Expansion ROM at d020 [disabled] [size=256K] Capabilities: [44] Power Management version 2 Capabilities: [4c] Express Endpoint, MSI 00 Capabilities: [64] Message Signalled Interrupts: Mask- 64bit+ Queue=0/4 Enable- Capabilities: [74] Vital Product Data ? Capabilities: [7c] MSI-X: Enable- Mask- TabSize=16 Capabilities: [100] Advanced Error Reporting ? Capabilities: [138] Power Budgeting ? Kernel driver in use: qla2xxx Kernel modules: qla2xxx 13:00.1 Fibre Channel: QLogic Corp. ISP2432-based 4Gb Fibre Channel to PCI Express HBA (rev 02) Subsystem: Hewlett-Packard Company Device 7041 Flags: bus master, fast devsel, latency 0, IRQ 18 I/O ports at 6400 [size=256] Memory at fdfe (64-bit, non-prefetchable) [size=16K] [virtual] Expansion ROM at d024 [disabled] [size=256K] Capabilities: [44] Power Management version 2 Capabilities: [4c] Express Endpoint, MSI 00 Capabilities: [64] Message Signalled Interrupts: Mask- 64bit+ Queue=0/4 Enable- Capabilities: [74] Vital Product Data ? Capabilities: [7c] MSI-X: Enable- Mask- TabSize=16 Capabilities: [100] Advanced Error Reporting ? Capabilities: [138] Power Budgeting ? Kernel driver in use: qla2xxx Kernel modules: qla2xxx Anyway, we had been running an earlier 2.6.32 kernel up until a few days ago, which gave us this on boot: [2.656008] QLogic Fibre Channel HBA Driver: 8.03.01-k6-debug [2.656188] qla2xxx :0b:00.0: PCI INT A - GSI 16 (level, low) - IRQ 16 [2.710842] qla2xxx :0b:00.0: Found an ISP2432, irq 16, iobase 0xc9c6c000 [2.719526] qla2xxx :0b:00.0: MSI-X: Unsupported ISP2432 (0x2, 0x0). [2.727776] alloc irq_desc for 61 on node -1 [2.727778] alloc kstat_irqs on node -1 [2.728002] qla2xxx :0b:00.0: irq 61 for MSI/MSI-X [2.728184] qla2xxx :0b:00.0: MSI: Enabled. [2.732040] IRQ 59/cciss0: IRQF_DISABLED is not guaranteed on shared IRQs [2.732058] cciss0: 0x3230 at PCI :06:00.0 IRQ 59 using DAC [2.747326] qla2xxx :0b:00.0: Configuring PCI space... [2.747479] cciss/c0d0: p1 [2.755773] qla2xxx :0b:00.0: setting latency timer to 64 [2.756280] p2 [2.760467] qla2xxx :0b:00.0: FLTL[DEF] = 0x11400. [2.773807] qla2xxx :0b:00.0: FLT[DEF]: boot=0x0 fw=0x2 vpd_nvram=0x48000 vpd=0x0 nvram=0x0 fdt=0x11000 flt=0x11400 [2.787143] qla2xxx :0b:00.0: FDT[MID]: (0xbf/0x80) erase=0x7ffd0352 pro=0 upro=0 wrtd=0x9c blk=0x8000. [2.789701] qla2xxx :0b:00.0:
Bug#597276: qla2xxx_eh_abort(5) - kernel NULL pointer dereference
Hi Josip, Thanks for letting us know about this problem. Can u please provide logs with ql2xextended_error_logging enabled. Also, can u please provide more details about the test case. -- Giri On 9/18/10 4:48 AM, Josip Rodin j...@debbugs.entuzijast.net wrote: so it looks generic. -- To UNSUBSCRIBE, email to debian-kernel-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org Archive: http://lists.debian.org/c8bc50f2.9a54%giridhar.malav...@qlogic.com
Bug#597276: qla2xxx_eh_abort(5) - kernel NULL pointer dereference
Package: linux-2.6 Version: 2.6.32-21~bpo50+1 Hi, Got this in dmesg on a server: Sep 18 02:46:52 birdun kernel: [387093.744649] qla2xxx_eh_abort(5): aborting sp 8801b58013c0 from RISC. pid=46881441. Sep 18 02:46:56 birdun kernel: [387093.836909] BUG: unable to handle kernel NULL pointer dereference at 0040 Sep 18 02:46:56 birdun kernel: [387093.924511] IP: [812f8ea1] _spin_lock_irqsave+0x1a/0x34 Sep 18 02:46:56 birdun kernel: [387093.996511] PGD 22d846067 PUD 22d678067 PMD 0 Sep 18 02:46:56 birdun kernel: [387094.048511] Oops: 0002 [#1] SMP Sep 18 02:46:56 birdun kernel: [387094.086651] last sysfs file: /sys/devices/pci:00/:00:04.0/:13:00.0/host4/rport-4:0-3/target4:0:3/fc_transport/target4:0:3/node_name Sep 18 02:46:56 birdun kernel: [387094.236007] CPU 4 Sep 18 02:46:56 birdun kernel: [387094.260007] Modules linked in: ipmi_devintf nf_conntrack_ipv6 ip6t_LOG ip6table_filter ip6_tables xt_tcpudp nf_conntrack_ipv4 nf_defrag_ipv4 xt_state nf_conntrack ipt_REJECT ipt_LOG iptable_filter ip_tables x_tables bonding xfs exportfs dm_round_robin dm_multipath scsi_dh loop snd_pcsp snd_pcm snd_timer psmouse ipmi_si rng_core snd soundcore i5000_edac serio_raw hpilo ipmi_msghandler snd_page_alloc edac_core evdev container i5k_amb button processor shpchp pci_hotplug ext3 jbd mbcache dm_mirror dm_region_hash dm_log dm_snapshot dm_mod st ch osst sd_mod crc_t10dif sg sr_mod cdrom ata_piix ata_generic qla2xxx scsi_transport_fc libata scsi_tgt cciss usbhid hid bnx2 ehci_hcd uhci_hcd floppy usbcore nls_base scsi_mod thermal fan thermal_sys Sep 18 02:46:56 birdun kernel: [387095.008511] Pid: 763, comm: scsi_eh_5 Not tainted 2.6.32-bpo.5-amd64 #1 ProLiant DL360 G5 Sep 18 02:46:56 birdun kernel: [387095.104511] RIP: 0010:[812f8ea1] [812f8ea1] _spin_lock_irqsave+0x1a/0x34 Sep 18 02:46:56 birdun kernel: [387095.204007] RSP: 0018:88022b1c5d70 EFLAGS: 00010082 Sep 18 02:46:56 birdun kernel: [387095.264511] RAX: 0282 RBX: 0040 RCX: 381d Sep 18 02:46:56 birdun kernel: [387095.348511] RDX: 0001 RSI: 0282 RDI: 0040 Sep 18 02:46:56 birdun kernel: [387095.432258] RBP: 8801b58013c0 R08: 000a26c8 R09: 000a Sep 18 02:46:56 birdun kernel: [387095.512512] R10: R11: 81673868 R12: 0001 Sep 18 02:46:56 birdun kernel: [387095.596512] R13: 88014066e100 R14: 8801b5801e80 R15: Sep 18 02:46:56 birdun kernel: [387095.684513] FS: () GS:880008d0() knlGS: Sep 18 02:46:56 birdun kernel: [387095.780002] CS: 0010 DS: 0018 ES: 0018 CR0: 8005003b Sep 18 02:46:56 birdun kernel: [387095.844512] CR2: 0040 CR3: 00022d42b000 CR4: 06e0 Sep 18 02:46:56 birdun kernel: [387095.928512] DR0: DR1: DR2: Sep 18 02:46:56 birdun kernel: [387096.012511] DR3: DR6: 0ff0 DR7: 0400 Sep 18 02:46:56 birdun kernel: [387096.096005] Process scsi_eh_5 (pid: 763, threadinfo 88022b1c4000, task 88022ba39c40) Sep 18 02:46:56 birdun kernel: [387096.192511] Stack: Sep 18 02:46:56 birdun kernel: [387096.216511] 381d a014cb8b 0286 Sep 18 02:46:56 birdun kernel: [387096.300959] 0 ff10 8801b58013c0 2002 0286 Sep 18 02:46:56 birdun kernel: [387096.390206] 0 88022df0a900 88022b1c 88022b881840 a01407e4 Sep 18 02:46:56 birdun kernel: [387096.480511] Call Trace: Sep 18 02:46:56 birdun kernel: [387096.508511] [a014cb8b] ? qla24xx_abort_command+0x3f/0x1db [qla2xxx] Sep 18 02:46:56 birdun kernel: [387096.592513] [a01407e4] ? qla2xxx_eh_abort+0xf2/0x250 [qla2xxx] Sep 18 02:46:56 birdun kernel: [387096.672511] [a001ccde] ? scsi_error_handler+0x302/0x5b5 [scsi_mod] Sep 18 02:46:56 birdun kernel: [387096.756512] [a001c9dc] ? scsi_error_handler+0x0/0x5b5 [scsi_mod] Sep 18 02:46:56 birdun kernel: [387096.836513] [81063601] ? kthread+0x79/0x81 Sep 18 02:46:56 birdun kernel: [387096.896512] [81011baa] ? child_rip+0xa/0x20 Sep 18 02:46:56 birdun kernel: [387096.956511] [81063588] ? kthread+0x0/0x81 Sep 18 02:46:56 birdun kernel: [387097.012512] [81011ba0] ? child_rip+0x0/0x20 Sep 18 02:46:56 birdun kernel: [387097.072511] Code: 31 d2 89 d0 c3 f0 83 2f 01 79 05 e8 ca ae e9 ff c3 48 83 ec 08 9c 58 0f 1f 44 00 00 48 89 c6 fa 66 0f 1f 44 00 00 ba 00 00 01 00 f0 0f c1 17 0f b7 ca c1 ea 10 39 d1 74 07 f3 90 0f b7 0f eb f5 Sep 18 02:46:56 birdun kernel: [387097.292511] RIP [812f8ea1] _spin_lock_irqsave+0x1a/0x34 Sep 18 02:46:56 birdun kernel: [387097.364514] RSP 88022b1c5d70 Sep 18 02:46:56 birdun kernel: [387097.404511] CR2: 0040 Sep 18 02:46:56 birdun kernel: