Re: [E1000-devel] ixgbe NULL pointer dereference on ubuntu-5.8.0-25-generic
We are unable to reproduce. I think Ubuntu might be moving kernels around on us because 'apt install linux-generic-hwe-20.04-edge' gave us 5.8.0-53-generic. I'm going to call this closed unless you're still seeing this with the latest kernels. Todd Fujinaka Software Application Engineer Data Center Group Intel Corporation todd.fujin...@intel.com -Original Message- From: Fujinaka, Todd Sent: Monday, March 1, 2021 10:24 AM To: Dmitry Kravkov Cc: e1000-de...@lists.sf.net Subject: Re: [E1000-devel] ixgbe NULL pointer dereference on ubuntu-5.8.0-25-generic OK, I’ll file an internal ticket. Todd Fujinaka Software Application Engineer Data Center Group Intel Corporation todd.fujin...@intel.com From: Dmitry Kravkov Sent: Monday, March 1, 2021 9:04 AM To: Fujinaka, Todd Cc: e1000-de...@lists.sf.net Subject: Re: [E1000-devel] ixgbe NULL pointer dereference on ubuntu-5.8.0-25-generic Also happens in 20.04 TLS hwe-5.8.0-44-generic kernel kernel installed using 'apt install linux-generic-hwe-20.04-edge' On Thu, Feb 25, 2021 at 11:41 AM Dmitry Kravkov mailto:dmit...@qwilt.com>> wrote: Will do so On Wed, Feb 24, 2021 at 5:00 PM Fujinaka, Todd mailto:todd.fujin...@intel.com>> wrote: We only support the LTS variants. Can you try one of those? Todd Fujinaka Software Application Engineer Data Center Group Intel Corporation todd.fujin...@intel.com<mailto:todd.fujin...@intel.com> From: Dmitry Kravkov mailto:dmit...@qwilt.com>> Sent: Tuesday, February 23, 2021 11:24 PM To: Fujinaka, Todd mailto:todd.fujin...@intel.com>> Cc: e1000-de...@lists.sf.net<mailto:e1000-de...@lists.sf.net> Subject: Re: [E1000-devel] ixgbe NULL pointer dereference on ubuntu-5.8.0-25-generic On Wed, Feb 24, 2021 at 2:28 AM Fujinaka, Todd mailto:todd.fujin...@intel.com>> wrote: What version of Ubuntu is this? It's going to take me a bit to try to find the kernel from the release. Ubuntu 20.10 Todd Fujinaka Software Application Engineer Data Center Group Intel Corporation todd.fujin...@intel.com<mailto:todd.fujin...@intel.com> -Original Message- From: Dmitry Kravkov mailto:dmit...@qwilt.com>> Sent: Sunday, February 21, 2021 11:43 PM To: e1000-de...@lists.sf.net<mailto:e1000-de...@lists.sf.net> Subject: [E1000-devel] ixgbe NULL pointer dereference on ubuntu-5.8.0-25-generic Hi All I'm hitting the following bug during unload inbox driver and insmod'ing 5.9.4 (also happens with 5.10.2): [ 1739.889642] BUG: kernel NULL pointer dereference, address: 04f0 [ 1739.897969] #PF: supervisor read access in kernel mode [ 1739.904155] #PF: error_code(0x) - not-present page [ 1739.910327] PGD 0 P4D 0 [ 1739.913648] Oops: [#1] SMP PTI [ 1739.917985] CPU: 16 PID: 0 Comm: swapper/16 Kdump: loaded Tainted: G OE 5.8.0-25-generic #26-Ubuntu [ 1739.929943] Hardware name: /, BIOS 2.2.2 01/16/2014 [ 1739.936043] RIP: 0010:eth_get_headlen+0x26/0xb0 [ 1739.941625] Code: 00 00 00 00 66 66 66 66 90 55 48 89 e5 41 54 53 89 d3 48 83 ec 18 65 48 8b 04 25 28 00 00 00 48 89 45 e8 31 c0 83 fa 0d 76 7e <48> 8b bf f0 04 00 00 6a 01 49 89 f0 49 89 f4 52 48 8d 4d dc 48 c7 [ 1739.963567] RSP: 0018:be2506798db8 EFLAGS: 00010216 [ 1739.969961] RAX: RBX: 05ea RCX: 0002 [ 1739.978453] RDX: 05ea RSI: 9f6fb733c0c0 RDI: [ 1739.986957] RBP: be2506798de0 R08: R09: 9f733306ff00 [ 1739.995423] R10: 05ea R11: 0100 R12: 9f727b2c0740 [ 1740.003871] R13: 9f724b0e6010 R14: 400a838d R15: [ 1740.012330] FS: () GS:9f733fa0() knlGS: [ 1740.021848] CS: 0010 DS: ES: CR0: 80050033 [ 1740.028757] CR2: 04f0 CR3: 0002c740a001 CR4: 000606e0 [ 1740.037209] Call Trace: [ 1740.040425] [ 1740.043154] ixgbe_process_skb_fields+0x55/0x260 [ixgbe] [ 1740.049577] ixgbe_poll+0x52b/0x12c0 [ixgbe] [ 1740.054809] napi_poll+0x96/0x1b0 [ 1740.058985] net_rx_action+0xb8/0x1c0 [ 1740.063575] __do_softirq+0xd0/0x2a1 [ 1740.068055] asm_call_irq_on_stack+0x12/0x20 [ 1740.073345] [ 1740.076223] do_softirq_own_stack+0x3d/0x50 [ 1740.081402] irq_exit_rcu+0x95/0xd0 [ 1740.085829] common_interrupt+0x7c/0x150 [ 1740.090730] asm_common_interrupt+0x1e/0x40 [ 1740.095941] RIP: 0010:cpuidle_enter_state+0xb4/0x3f0 [ 1740.102049] Code: 65 8b 3d 3f fb c6 58 e8 4a 5d 74 ff 48 89 45 d0 66 66 66 66 90 31 ff e8 fa 68 74 ff 80 7d c7 00 0f 85 d3 01 00 00 fb 66 66 90 <66> 66 90 45 85 e4 0f 88 df 01 00 00 49 63 d4 48 8d 04 52 48 8d 0c [ 1740.124194] RSP: 0018:be250634fe48 EFLAGS: 0246 [ 1740.130699] RAX: 9f733fa2c6c0 RBX: de14bfa00f00 RCX: 001f [ 1740.139315] RDX: RSI: 373a RDI: [ 1740.147943] RBP: be250634fe88 R08: 01951980e894 R09: 000
Re: [E1000-devel] ixgbe NULL pointer dereference on ubuntu-5.8.0-25-generic
OK, I’ll file an internal ticket. Todd Fujinaka Software Application Engineer Data Center Group Intel Corporation todd.fujin...@intel.com From: Dmitry Kravkov Sent: Monday, March 1, 2021 9:04 AM To: Fujinaka, Todd Cc: e1000-de...@lists.sf.net Subject: Re: [E1000-devel] ixgbe NULL pointer dereference on ubuntu-5.8.0-25-generic Also happens in 20.04 TLS hwe-5.8.0-44-generic kernel kernel installed using 'apt install linux-generic-hwe-20.04-edge' On Thu, Feb 25, 2021 at 11:41 AM Dmitry Kravkov mailto:dmit...@qwilt.com>> wrote: Will do so On Wed, Feb 24, 2021 at 5:00 PM Fujinaka, Todd mailto:todd.fujin...@intel.com>> wrote: We only support the LTS variants. Can you try one of those? Todd Fujinaka Software Application Engineer Data Center Group Intel Corporation todd.fujin...@intel.com<mailto:todd.fujin...@intel.com> From: Dmitry Kravkov mailto:dmit...@qwilt.com>> Sent: Tuesday, February 23, 2021 11:24 PM To: Fujinaka, Todd mailto:todd.fujin...@intel.com>> Cc: e1000-de...@lists.sf.net<mailto:e1000-de...@lists.sf.net> Subject: Re: [E1000-devel] ixgbe NULL pointer dereference on ubuntu-5.8.0-25-generic On Wed, Feb 24, 2021 at 2:28 AM Fujinaka, Todd mailto:todd.fujin...@intel.com>> wrote: What version of Ubuntu is this? It's going to take me a bit to try to find the kernel from the release. Ubuntu 20.10 Todd Fujinaka Software Application Engineer Data Center Group Intel Corporation todd.fujin...@intel.com<mailto:todd.fujin...@intel.com> -Original Message- From: Dmitry Kravkov mailto:dmit...@qwilt.com>> Sent: Sunday, February 21, 2021 11:43 PM To: e1000-de...@lists.sf.net<mailto:e1000-de...@lists.sf.net> Subject: [E1000-devel] ixgbe NULL pointer dereference on ubuntu-5.8.0-25-generic Hi All I'm hitting the following bug during unload inbox driver and insmod'ing 5.9.4 (also happens with 5.10.2): [ 1739.889642] BUG: kernel NULL pointer dereference, address: 04f0 [ 1739.897969] #PF: supervisor read access in kernel mode [ 1739.904155] #PF: error_code(0x) - not-present page [ 1739.910327] PGD 0 P4D 0 [ 1739.913648] Oops: [#1] SMP PTI [ 1739.917985] CPU: 16 PID: 0 Comm: swapper/16 Kdump: loaded Tainted: G OE 5.8.0-25-generic #26-Ubuntu [ 1739.929943] Hardware name: /, BIOS 2.2.2 01/16/2014 [ 1739.936043] RIP: 0010:eth_get_headlen+0x26/0xb0 [ 1739.941625] Code: 00 00 00 00 66 66 66 66 90 55 48 89 e5 41 54 53 89 d3 48 83 ec 18 65 48 8b 04 25 28 00 00 00 48 89 45 e8 31 c0 83 fa 0d 76 7e <48> 8b bf f0 04 00 00 6a 01 49 89 f0 49 89 f4 52 48 8d 4d dc 48 c7 [ 1739.963567] RSP: 0018:be2506798db8 EFLAGS: 00010216 [ 1739.969961] RAX: RBX: 05ea RCX: 0002 [ 1739.978453] RDX: 05ea RSI: 9f6fb733c0c0 RDI: [ 1739.986957] RBP: be2506798de0 R08: R09: 9f733306ff00 [ 1739.995423] R10: 05ea R11: 0100 R12: 9f727b2c0740 [ 1740.003871] R13: 9f724b0e6010 R14: 400a838d R15: [ 1740.012330] FS: () GS:9f733fa0() knlGS: [ 1740.021848] CS: 0010 DS: ES: CR0: 80050033 [ 1740.028757] CR2: 04f0 CR3: 0002c740a001 CR4: 000606e0 [ 1740.037209] Call Trace: [ 1740.040425] [ 1740.043154] ixgbe_process_skb_fields+0x55/0x260 [ixgbe] [ 1740.049577] ixgbe_poll+0x52b/0x12c0 [ixgbe] [ 1740.054809] napi_poll+0x96/0x1b0 [ 1740.058985] net_rx_action+0xb8/0x1c0 [ 1740.063575] __do_softirq+0xd0/0x2a1 [ 1740.068055] asm_call_irq_on_stack+0x12/0x20 [ 1740.073345] [ 1740.076223] do_softirq_own_stack+0x3d/0x50 [ 1740.081402] irq_exit_rcu+0x95/0xd0 [ 1740.085829] common_interrupt+0x7c/0x150 [ 1740.090730] asm_common_interrupt+0x1e/0x40 [ 1740.095941] RIP: 0010:cpuidle_enter_state+0xb4/0x3f0 [ 1740.102049] Code: 65 8b 3d 3f fb c6 58 e8 4a 5d 74 ff 48 89 45 d0 66 66 66 66 90 31 ff e8 fa 68 74 ff 80 7d c7 00 0f 85 d3 01 00 00 fb 66 66 90 <66> 66 90 45 85 e4 0f 88 df 01 00 00 49 63 d4 48 8d 04 52 48 8d 0c [ 1740.124194] RSP: 0018:be250634fe48 EFLAGS: 0246 [ 1740.130699] RAX: 9f733fa2c6c0 RBX: de14bfa00f00 RCX: 001f [ 1740.139315] RDX: RSI: 373a RDI: [ 1740.147943] RBP: be250634fe88 R08: 01951980e894 R09: 2840a000 [ 1740.156580] R10: 02b9 R11: 9f733fa2b364 R12: 0005 [ 1740.165266] R13: a856adc0 R14: 0005 R15: [ 1740.173911] ? cpuidle_enter_state+0xa6/0x3f0 [ 1740.179470] cpuidle_enter+0x2e/0x40 [ 1740.184136] cpuidle_idle_call+0x145/0x200 [ 1740.189359] do_idle+0x7a/0xe0 [ 1740.193426] cpu_startup_entry+0x20/0x30 [ 1740.198466] start_secondary+0xe6/0x100 [ 1740.203425] secondary_startup_64+0xb6/0xc0 [ 1740.208779] Modules linked in: igb_uio(OE) ice(OE) i40e(OE) ixgbe(OE) dell_rbu vxlan ip6_udp_tunne
Re: [E1000-devel] ixgbe NULL pointer dereference on ubuntu-5.8.0-25-generic
Also happens in 20.04 TLS hwe-5.8.0-44-generic kernel kernel installed using 'apt install linux-generic-hwe-20.04-edge' On Thu, Feb 25, 2021 at 11:41 AM Dmitry Kravkov wrote: > Will do so > > On Wed, Feb 24, 2021 at 5:00 PM Fujinaka, Todd > wrote: > >> We only support the LTS variants. Can you try one of those? >> >> >> >> *Todd Fujinaka* >> >> Software Application Engineer >> >> Data Center Group >> >> Intel Corporation >> >> *todd.fujin...@intel.com * >> >> >> >> *From:* Dmitry Kravkov >> *Sent:* Tuesday, February 23, 2021 11:24 PM >> *To:* Fujinaka, Todd >> *Cc:* e1000-de...@lists.sf.net >> *Subject:* Re: [E1000-devel] ixgbe NULL pointer dereference on >> ubuntu-5.8.0-25-generic >> >> >> >> >> >> On Wed, Feb 24, 2021 at 2:28 AM Fujinaka, Todd >> wrote: >> >> What version of Ubuntu is this? It's going to take me a bit to try to >> find the kernel from the release. >> >> Ubuntu 20.10 >> >> >> Todd Fujinaka >> Software Application Engineer >> Data Center Group >> Intel Corporation >> todd.fujin...@intel.com >> >> -Original Message- >> From: Dmitry Kravkov >> Sent: Sunday, February 21, 2021 11:43 PM >> To: e1000-de...@lists.sf.net >> Subject: [E1000-devel] ixgbe NULL pointer dereference on >> ubuntu-5.8.0-25-generic >> >> Hi All >> >> I'm hitting the following bug during unload inbox driver and insmod'ing >> 5.9.4 (also happens with 5.10.2): >> >> [ 1739.889642] BUG: kernel NULL pointer dereference, address: >> 04f0 >> [ 1739.897969] #PF: supervisor read access in kernel mode [ 1739.904155] >> #PF: error_code(0x) - not-present page [ 1739.910327] PGD 0 P4D 0 [ >> 1739.913648] Oops: [#1] SMP PTI [ 1739.917985] CPU: 16 PID: 0 Comm: >> swapper/16 Kdump: loaded Tainted: G >> OE 5.8.0-25-generic #26-Ubuntu >> [ 1739.929943] Hardware name: /, BIOS 2.2.2 01/16/2014 [ 1739.936043] >> RIP: 0010:eth_get_headlen+0x26/0xb0 [ 1739.941625] Code: 00 00 00 00 66 66 >> 66 66 90 55 48 89 e5 41 54 53 89 d3 >> 48 83 ec 18 65 48 8b 04 25 28 00 00 00 48 89 45 e8 31 c0 83 fa 0d 76 7e >> <48> 8b bf f0 04 00 00 6a 01 49 89 f0 49 89 f4 52 48 8d 4d dc 48 c7 [ >> 1739.963567] RSP: 0018:be2506798db8 EFLAGS: 00010216 [ 1739.969961] >> RAX: RBX: 05ea RCX: >> 0002 >> [ 1739.978453] RDX: 05ea RSI: 9f6fb733c0c0 RDI: >> >> [ 1739.986957] RBP: be2506798de0 R08: R09: >> 9f733306ff00 >> [ 1739.995423] R10: 05ea R11: 0100 R12: >> 9f727b2c0740 >> [ 1740.003871] R13: 9f724b0e6010 R14: 400a838d R15: >> >> [ 1740.012330] FS: () GS:9f733fa0() >> knlGS: >> [ 1740.021848] CS: 0010 DS: ES: CR0: 80050033 [ >> 1740.028757] CR2: 04f0 CR3: 0002c740a001 CR4: >> 000606e0 >> [ 1740.037209] Call Trace: >> [ 1740.040425] >> [ 1740.043154] ixgbe_process_skb_fields+0x55/0x260 [ixgbe] [ >> 1740.049577] ixgbe_poll+0x52b/0x12c0 [ixgbe] [ 1740.054809] >> napi_poll+0x96/0x1b0 [ 1740.058985] net_rx_action+0xb8/0x1c0 [ >> 1740.063575] __do_softirq+0xd0/0x2a1 [ 1740.068055] >> asm_call_irq_on_stack+0x12/0x20 [ 1740.073345] [ 1740.076223] >> do_softirq_own_stack+0x3d/0x50 [ 1740.081402] irq_exit_rcu+0x95/0xd0 [ >> 1740.085829] common_interrupt+0x7c/0x150 [ 1740.090730] >> asm_common_interrupt+0x1e/0x40 [ 1740.095941] RIP: >> 0010:cpuidle_enter_state+0xb4/0x3f0 >> [ 1740.102049] Code: 65 8b 3d 3f fb c6 58 e8 4a 5d 74 ff 48 89 45 d0 66 66 >> 66 66 90 31 ff e8 fa 68 74 ff 80 7d c7 00 0f 85 d3 01 00 00 fb 66 66 90 >> <66> 66 90 45 85 e4 0f 88 df 01 00 00 49 63 d4 48 8d 04 52 48 8d 0c [ >> 1740.124194] RSP: 0018:be250634fe48 EFLAGS: 0246 [ 1740.130699] >> RAX: 9f733fa2c6c0 RBX: de14bfa00f00 RCX: >> 001f >> [ 1740.139315] RDX: RSI: 373a RDI: >> >> [ 1740.147943] RBP: be250634fe88 R08: 01951980e894 R09: >> 2840a000 >> [ 1740.156580] R10: 02b9 R11: 9f733fa2b364 R12: >> 0005 >> [ 1740.165266] R13: a856adc0 R14: 0005 R15: >> >> [ 1740.173911] ? cpuidle_enter_state+0xa6/0x3f0 [ 1740.179470] >> cpuidle_enter+0x2e/0x40 [ 1740.184136] cpuidle_idle_call
Re: [E1000-devel] ixgbe NULL pointer dereference on ubuntu-5.8.0-25-generic
Will do so On Wed, Feb 24, 2021 at 5:00 PM Fujinaka, Todd wrote: > We only support the LTS variants. Can you try one of those? > > > > *Todd Fujinaka* > > Software Application Engineer > > Data Center Group > > Intel Corporation > > *todd.fujin...@intel.com * > > > > *From:* Dmitry Kravkov > *Sent:* Tuesday, February 23, 2021 11:24 PM > *To:* Fujinaka, Todd > *Cc:* e1000-de...@lists.sf.net > *Subject:* Re: [E1000-devel] ixgbe NULL pointer dereference on > ubuntu-5.8.0-25-generic > > > > > > On Wed, Feb 24, 2021 at 2:28 AM Fujinaka, Todd > wrote: > > What version of Ubuntu is this? It's going to take me a bit to try to find > the kernel from the release. > > Ubuntu 20.10 > > > Todd Fujinaka > Software Application Engineer > Data Center Group > Intel Corporation > todd.fujin...@intel.com > > -Original Message- > From: Dmitry Kravkov > Sent: Sunday, February 21, 2021 11:43 PM > To: e1000-de...@lists.sf.net > Subject: [E1000-devel] ixgbe NULL pointer dereference on > ubuntu-5.8.0-25-generic > > Hi All > > I'm hitting the following bug during unload inbox driver and insmod'ing > 5.9.4 (also happens with 5.10.2): > > [ 1739.889642] BUG: kernel NULL pointer dereference, address: > 04f0 > [ 1739.897969] #PF: supervisor read access in kernel mode [ 1739.904155] > #PF: error_code(0x) - not-present page [ 1739.910327] PGD 0 P4D 0 [ > 1739.913648] Oops: [#1] SMP PTI [ 1739.917985] CPU: 16 PID: 0 Comm: > swapper/16 Kdump: loaded Tainted: G > OE 5.8.0-25-generic #26-Ubuntu > [ 1739.929943] Hardware name: /, BIOS 2.2.2 01/16/2014 [ 1739.936043] > RIP: 0010:eth_get_headlen+0x26/0xb0 [ 1739.941625] Code: 00 00 00 00 66 66 > 66 66 90 55 48 89 e5 41 54 53 89 d3 > 48 83 ec 18 65 48 8b 04 25 28 00 00 00 48 89 45 e8 31 c0 83 fa 0d 76 7e > <48> 8b bf f0 04 00 00 6a 01 49 89 f0 49 89 f4 52 48 8d 4d dc 48 c7 [ > 1739.963567] RSP: 0018:be2506798db8 EFLAGS: 00010216 [ 1739.969961] > RAX: RBX: 05ea RCX: > 0002 > [ 1739.978453] RDX: 05ea RSI: 9f6fb733c0c0 RDI: > > [ 1739.986957] RBP: be2506798de0 R08: R09: > 9f733306ff00 > [ 1739.995423] R10: 05ea R11: 0100 R12: > 9f727b2c0740 > [ 1740.003871] R13: 9f724b0e6010 R14: 400a838d R15: > > [ 1740.012330] FS: () GS:9f733fa0() > knlGS: > [ 1740.021848] CS: 0010 DS: ES: CR0: 80050033 [ > 1740.028757] CR2: 04f0 CR3: 0002c740a001 CR4: > 000606e0 > [ 1740.037209] Call Trace: > [ 1740.040425] > [ 1740.043154] ixgbe_process_skb_fields+0x55/0x260 [ixgbe] [ > 1740.049577] ixgbe_poll+0x52b/0x12c0 [ixgbe] [ 1740.054809] > napi_poll+0x96/0x1b0 [ 1740.058985] net_rx_action+0xb8/0x1c0 [ > 1740.063575] __do_softirq+0xd0/0x2a1 [ 1740.068055] > asm_call_irq_on_stack+0x12/0x20 [ 1740.073345] [ 1740.076223] > do_softirq_own_stack+0x3d/0x50 [ 1740.081402] irq_exit_rcu+0x95/0xd0 [ > 1740.085829] common_interrupt+0x7c/0x150 [ 1740.090730] > asm_common_interrupt+0x1e/0x40 [ 1740.095941] RIP: > 0010:cpuidle_enter_state+0xb4/0x3f0 > [ 1740.102049] Code: 65 8b 3d 3f fb c6 58 e8 4a 5d 74 ff 48 89 45 d0 66 66 > 66 66 90 31 ff e8 fa 68 74 ff 80 7d c7 00 0f 85 d3 01 00 00 fb 66 66 90 > <66> 66 90 45 85 e4 0f 88 df 01 00 00 49 63 d4 48 8d 04 52 48 8d 0c [ > 1740.124194] RSP: 0018:be250634fe48 EFLAGS: 0246 [ 1740.130699] > RAX: 9f733fa2c6c0 RBX: de14bfa00f00 RCX: > 001f > [ 1740.139315] RDX: RSI: 373a RDI: > > [ 1740.147943] RBP: be250634fe88 R08: 01951980e894 R09: > 2840a000 > [ 1740.156580] R10: 02b9 R11: 9f733fa2b364 R12: > 0005 > [ 1740.165266] R13: a856adc0 R14: 0005 R15: > > [ 1740.173911] ? cpuidle_enter_state+0xa6/0x3f0 [ 1740.179470] > cpuidle_enter+0x2e/0x40 [ 1740.184136] cpuidle_idle_call+0x145/0x200 [ > 1740.189359] do_idle+0x7a/0xe0 [ 1740.193426] cpu_startup_entry+0x20/0x30 > [ 1740.198466] start_secondary+0xe6/0x100 [ 1740.203425] > secondary_startup_64+0xb6/0xc0 [ 1740.208779] Modules linked in: > igb_uio(OE) ice(OE) i40e(OE) ixgbe(OE) dell_rbu vxlan ip6_udp_tunnel > udp_tunnel ip6table_filter ip6table_raw ip6_tables mpt3sas raid_class > scsi_transport_sas mptctl mptbase xt_conntrack iptable_filter xt_tcpudp > xt_CT nf_conntrack nf_defrag_ipv6 > nf_defrag_ipv4 iptable_raw bpfilter intel_rapl_msr intel_rapl_common > sb_edac iTCO_wdt intel_pmc_bxt iTCO_vendor_support x86_pkg_temp_t
Re: [E1000-devel] ixgbe NULL pointer dereference on ubuntu-5.8.0-25-generic
We only support the LTS variants. Can you try one of those? Todd Fujinaka Software Application Engineer Data Center Group Intel Corporation todd.fujin...@intel.com From: Dmitry Kravkov Sent: Tuesday, February 23, 2021 11:24 PM To: Fujinaka, Todd Cc: e1000-de...@lists.sf.net Subject: Re: [E1000-devel] ixgbe NULL pointer dereference on ubuntu-5.8.0-25-generic On Wed, Feb 24, 2021 at 2:28 AM Fujinaka, Todd mailto:todd.fujin...@intel.com>> wrote: What version of Ubuntu is this? It's going to take me a bit to try to find the kernel from the release. Ubuntu 20.10 Todd Fujinaka Software Application Engineer Data Center Group Intel Corporation todd.fujin...@intel.com<mailto:todd.fujin...@intel.com> -Original Message- From: Dmitry Kravkov mailto:dmit...@qwilt.com>> Sent: Sunday, February 21, 2021 11:43 PM To: e1000-de...@lists.sf.net<mailto:e1000-de...@lists.sf.net> Subject: [E1000-devel] ixgbe NULL pointer dereference on ubuntu-5.8.0-25-generic Hi All I'm hitting the following bug during unload inbox driver and insmod'ing 5.9.4 (also happens with 5.10.2): [ 1739.889642] BUG: kernel NULL pointer dereference, address: 04f0 [ 1739.897969] #PF: supervisor read access in kernel mode [ 1739.904155] #PF: error_code(0x) - not-present page [ 1739.910327] PGD 0 P4D 0 [ 1739.913648] Oops: [#1] SMP PTI [ 1739.917985] CPU: 16 PID: 0 Comm: swapper/16 Kdump: loaded Tainted: G OE 5.8.0-25-generic #26-Ubuntu [ 1739.929943] Hardware name: /, BIOS 2.2.2 01/16/2014 [ 1739.936043] RIP: 0010:eth_get_headlen+0x26/0xb0 [ 1739.941625] Code: 00 00 00 00 66 66 66 66 90 55 48 89 e5 41 54 53 89 d3 48 83 ec 18 65 48 8b 04 25 28 00 00 00 48 89 45 e8 31 c0 83 fa 0d 76 7e <48> 8b bf f0 04 00 00 6a 01 49 89 f0 49 89 f4 52 48 8d 4d dc 48 c7 [ 1739.963567] RSP: 0018:be2506798db8 EFLAGS: 00010216 [ 1739.969961] RAX: RBX: 05ea RCX: 0002 [ 1739.978453] RDX: 05ea RSI: 9f6fb733c0c0 RDI: [ 1739.986957] RBP: be2506798de0 R08: R09: 9f733306ff00 [ 1739.995423] R10: 05ea R11: 0100 R12: 9f727b2c0740 [ 1740.003871] R13: 9f724b0e6010 R14: 400a838d R15: [ 1740.012330] FS: () GS:9f733fa0() knlGS: [ 1740.021848] CS: 0010 DS: ES: CR0: 80050033 [ 1740.028757] CR2: 04f0 CR3: 0002c740a001 CR4: 000606e0 [ 1740.037209] Call Trace: [ 1740.040425] [ 1740.043154] ixgbe_process_skb_fields+0x55/0x260 [ixgbe] [ 1740.049577] ixgbe_poll+0x52b/0x12c0 [ixgbe] [ 1740.054809] napi_poll+0x96/0x1b0 [ 1740.058985] net_rx_action+0xb8/0x1c0 [ 1740.063575] __do_softirq+0xd0/0x2a1 [ 1740.068055] asm_call_irq_on_stack+0x12/0x20 [ 1740.073345] [ 1740.076223] do_softirq_own_stack+0x3d/0x50 [ 1740.081402] irq_exit_rcu+0x95/0xd0 [ 1740.085829] common_interrupt+0x7c/0x150 [ 1740.090730] asm_common_interrupt+0x1e/0x40 [ 1740.095941] RIP: 0010:cpuidle_enter_state+0xb4/0x3f0 [ 1740.102049] Code: 65 8b 3d 3f fb c6 58 e8 4a 5d 74 ff 48 89 45 d0 66 66 66 66 90 31 ff e8 fa 68 74 ff 80 7d c7 00 0f 85 d3 01 00 00 fb 66 66 90 <66> 66 90 45 85 e4 0f 88 df 01 00 00 49 63 d4 48 8d 04 52 48 8d 0c [ 1740.124194] RSP: 0018:be250634fe48 EFLAGS: 0246 [ 1740.130699] RAX: 9f733fa2c6c0 RBX: de14bfa00f00 RCX: 001f [ 1740.139315] RDX: RSI: 373a RDI: [ 1740.147943] RBP: be250634fe88 R08: 01951980e894 R09: 2840a000 [ 1740.156580] R10: 02b9 R11: 9f733fa2b364 R12: 0005 [ 1740.165266] R13: a856adc0 R14: 0005 R15: [ 1740.173911] ? cpuidle_enter_state+0xa6/0x3f0 [ 1740.179470] cpuidle_enter+0x2e/0x40 [ 1740.184136] cpuidle_idle_call+0x145/0x200 [ 1740.189359] do_idle+0x7a/0xe0 [ 1740.193426] cpu_startup_entry+0x20/0x30 [ 1740.198466] start_secondary+0xe6/0x100 [ 1740.203425] secondary_startup_64+0xb6/0xc0 [ 1740.208779] Modules linked in: igb_uio(OE) ice(OE) i40e(OE) ixgbe(OE) dell_rbu vxlan ip6_udp_tunnel udp_tunnel ip6table_filter ip6table_raw ip6_tables mpt3sas raid_class scsi_transport_sas mptctl mptbase xt_conntrack iptable_filter xt_tcpudp xt_CT nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 iptable_raw bpfilter intel_rapl_msr intel_rapl_common sb_edac iTCO_wdt intel_pmc_bxt iTCO_vendor_support x86_pkg_temp_thermal mgag200 intel_powerclamp drm_kms_helper cec rc_core coretemp drm kvm_intel i2c_algo_bit fb_sys_fops syscopyarea kvm sysfillrect sysimgblt rapl intel_cstate joydev pcspkr input_leds mei_me mei ipmi_si acpi_power_meter evbug ipmi_devintf lpc_ich ipmi_msghandler mac_hid ip_tables x_tables dm_multipath crct10dif_pclmul crc32_pclmul ghash_clmulni_intel aesni_intel uas crypto_simd cryptd glue_helper xfrm_algo usb_storage megaraid_sas dca tg3 wmi hid_generic usbkbd usbmo
Re: [E1000-devel] ixgbe NULL pointer dereference on ubuntu-5.8.0-25-generic
On Wed, Feb 24, 2021 at 2:28 AM Fujinaka, Todd wrote: > What version of Ubuntu is this? It's going to take me a bit to try to find > the kernel from the release. > Ubuntu 20.10 > > Todd Fujinaka > Software Application Engineer > Data Center Group > Intel Corporation > todd.fujin...@intel.com > > -Original Message- > From: Dmitry Kravkov > Sent: Sunday, February 21, 2021 11:43 PM > To: e1000-de...@lists.sf.net > Subject: [E1000-devel] ixgbe NULL pointer dereference on > ubuntu-5.8.0-25-generic > > Hi All > > I'm hitting the following bug during unload inbox driver and insmod'ing > 5.9.4 (also happens with 5.10.2): > > [ 1739.889642] BUG: kernel NULL pointer dereference, address: > 04f0 > [ 1739.897969] #PF: supervisor read access in kernel mode [ 1739.904155] > #PF: error_code(0x) - not-present page [ 1739.910327] PGD 0 P4D 0 [ > 1739.913648] Oops: [#1] SMP PTI [ 1739.917985] CPU: 16 PID: 0 Comm: > swapper/16 Kdump: loaded Tainted: G > OE 5.8.0-25-generic #26-Ubuntu > [ 1739.929943] Hardware name: /, BIOS 2.2.2 01/16/2014 [ 1739.936043] > RIP: 0010:eth_get_headlen+0x26/0xb0 [ 1739.941625] Code: 00 00 00 00 66 66 > 66 66 90 55 48 89 e5 41 54 53 89 d3 > 48 83 ec 18 65 48 8b 04 25 28 00 00 00 48 89 45 e8 31 c0 83 fa 0d 76 7e > <48> 8b bf f0 04 00 00 6a 01 49 89 f0 49 89 f4 52 48 8d 4d dc 48 c7 [ > 1739.963567] RSP: 0018:be2506798db8 EFLAGS: 00010216 [ 1739.969961] > RAX: RBX: 05ea RCX: > 0002 > [ 1739.978453] RDX: 05ea RSI: 9f6fb733c0c0 RDI: > > [ 1739.986957] RBP: be2506798de0 R08: R09: > 9f733306ff00 > [ 1739.995423] R10: 05ea R11: 0100 R12: > 9f727b2c0740 > [ 1740.003871] R13: 9f724b0e6010 R14: 400a838d R15: > > [ 1740.012330] FS: () GS:9f733fa0() > knlGS: > [ 1740.021848] CS: 0010 DS: ES: CR0: 80050033 [ > 1740.028757] CR2: 04f0 CR3: 0002c740a001 CR4: > 000606e0 > [ 1740.037209] Call Trace: > [ 1740.040425] > [ 1740.043154] ixgbe_process_skb_fields+0x55/0x260 [ixgbe] [ > 1740.049577] ixgbe_poll+0x52b/0x12c0 [ixgbe] [ 1740.054809] > napi_poll+0x96/0x1b0 [ 1740.058985] net_rx_action+0xb8/0x1c0 [ > 1740.063575] __do_softirq+0xd0/0x2a1 [ 1740.068055] > asm_call_irq_on_stack+0x12/0x20 [ 1740.073345] [ 1740.076223] > do_softirq_own_stack+0x3d/0x50 [ 1740.081402] irq_exit_rcu+0x95/0xd0 [ > 1740.085829] common_interrupt+0x7c/0x150 [ 1740.090730] > asm_common_interrupt+0x1e/0x40 [ 1740.095941] RIP: > 0010:cpuidle_enter_state+0xb4/0x3f0 > [ 1740.102049] Code: 65 8b 3d 3f fb c6 58 e8 4a 5d 74 ff 48 89 45 d0 66 66 > 66 66 90 31 ff e8 fa 68 74 ff 80 7d c7 00 0f 85 d3 01 00 00 fb 66 66 90 > <66> 66 90 45 85 e4 0f 88 df 01 00 00 49 63 d4 48 8d 04 52 48 8d 0c [ > 1740.124194] RSP: 0018:be250634fe48 EFLAGS: 0246 [ 1740.130699] > RAX: 9f733fa2c6c0 RBX: de14bfa00f00 RCX: > 001f > [ 1740.139315] RDX: RSI: 373a RDI: > > [ 1740.147943] RBP: be250634fe88 R08: 01951980e894 R09: > 2840a000 > [ 1740.156580] R10: 02b9 R11: 9f733fa2b364 R12: > 0005 > [ 1740.165266] R13: a856adc0 R14: 0005 R15: > > [ 1740.173911] ? cpuidle_enter_state+0xa6/0x3f0 [ 1740.179470] > cpuidle_enter+0x2e/0x40 [ 1740.184136] cpuidle_idle_call+0x145/0x200 [ > 1740.189359] do_idle+0x7a/0xe0 [ 1740.193426] cpu_startup_entry+0x20/0x30 > [ 1740.198466] start_secondary+0xe6/0x100 [ 1740.203425] > secondary_startup_64+0xb6/0xc0 [ 1740.208779] Modules linked in: > igb_uio(OE) ice(OE) i40e(OE) ixgbe(OE) dell_rbu vxlan ip6_udp_tunnel > udp_tunnel ip6table_filter ip6table_raw ip6_tables mpt3sas raid_class > scsi_transport_sas mptctl mptbase xt_conntrack iptable_filter xt_tcpudp > xt_CT nf_conntrack nf_defrag_ipv6 > nf_defrag_ipv4 iptable_raw bpfilter intel_rapl_msr intel_rapl_common > sb_edac iTCO_wdt intel_pmc_bxt iTCO_vendor_support x86_pkg_temp_thermal > mgag200 intel_powerclamp drm_kms_helper cec rc_core coretemp drm kvm_intel > i2c_algo_bit fb_sys_fops syscopyarea kvm sysfillrect sysimgblt rapl > intel_cstate joydev pcspkr input_leds mei_me mei ipmi_si acpi_power_meter > evbug ipmi_devintf lpc_ich ipmi_msghandler mac_hid ip_tables x_tables > dm_multipath crct10dif_pclmul crc32_pclmul ghash_clmulni_intel aesni_intel > uas crypto_simd cryptd glue_helper xfrm_algo usb_storage megaraid_sas dca > tg3 wmi hid_generic usbkbd usbmouse usbhid hid btrfs blake2b_generic > libcrc32c xor raid6_pq sunrpc dm_mirror dm_region_hash dm_log be2iscsi > bnx2i cnic [ 1740.208816] uio cxgb4i cxgb4 cxgb3i cxgb3 mdio libcxgbi > libcxgb qla4xxx iscsi_boot_sysfs iscsi_tcp libiscsi_tcp libiscsi > scsi_transport_iscsi > autofs4 [last unloaded: igb_uio] > [ 1740.331702] CR2: 04f0 > > > Any chance that skb->dev is set to
Re: [E1000-devel] ixgbe NULL pointer dereference on ubuntu-5.8.0-25-generic
What version of Ubuntu is this? It's going to take me a bit to try to find the kernel from the release. Todd Fujinaka Software Application Engineer Data Center Group Intel Corporation todd.fujin...@intel.com -Original Message- From: Dmitry Kravkov Sent: Sunday, February 21, 2021 11:43 PM To: e1000-de...@lists.sf.net Subject: [E1000-devel] ixgbe NULL pointer dereference on ubuntu-5.8.0-25-generic Hi All I'm hitting the following bug during unload inbox driver and insmod'ing 5.9.4 (also happens with 5.10.2): [ 1739.889642] BUG: kernel NULL pointer dereference, address: 04f0 [ 1739.897969] #PF: supervisor read access in kernel mode [ 1739.904155] #PF: error_code(0x) - not-present page [ 1739.910327] PGD 0 P4D 0 [ 1739.913648] Oops: [#1] SMP PTI [ 1739.917985] CPU: 16 PID: 0 Comm: swapper/16 Kdump: loaded Tainted: G OE 5.8.0-25-generic #26-Ubuntu [ 1739.929943] Hardware name: /, BIOS 2.2.2 01/16/2014 [ 1739.936043] RIP: 0010:eth_get_headlen+0x26/0xb0 [ 1739.941625] Code: 00 00 00 00 66 66 66 66 90 55 48 89 e5 41 54 53 89 d3 48 83 ec 18 65 48 8b 04 25 28 00 00 00 48 89 45 e8 31 c0 83 fa 0d 76 7e <48> 8b bf f0 04 00 00 6a 01 49 89 f0 49 89 f4 52 48 8d 4d dc 48 c7 [ 1739.963567] RSP: 0018:be2506798db8 EFLAGS: 00010216 [ 1739.969961] RAX: RBX: 05ea RCX: 0002 [ 1739.978453] RDX: 05ea RSI: 9f6fb733c0c0 RDI: [ 1739.986957] RBP: be2506798de0 R08: R09: 9f733306ff00 [ 1739.995423] R10: 05ea R11: 0100 R12: 9f727b2c0740 [ 1740.003871] R13: 9f724b0e6010 R14: 400a838d R15: [ 1740.012330] FS: () GS:9f733fa0() knlGS: [ 1740.021848] CS: 0010 DS: ES: CR0: 80050033 [ 1740.028757] CR2: 04f0 CR3: 0002c740a001 CR4: 000606e0 [ 1740.037209] Call Trace: [ 1740.040425] [ 1740.043154] ixgbe_process_skb_fields+0x55/0x260 [ixgbe] [ 1740.049577] ixgbe_poll+0x52b/0x12c0 [ixgbe] [ 1740.054809] napi_poll+0x96/0x1b0 [ 1740.058985] net_rx_action+0xb8/0x1c0 [ 1740.063575] __do_softirq+0xd0/0x2a1 [ 1740.068055] asm_call_irq_on_stack+0x12/0x20 [ 1740.073345] [ 1740.076223] do_softirq_own_stack+0x3d/0x50 [ 1740.081402] irq_exit_rcu+0x95/0xd0 [ 1740.085829] common_interrupt+0x7c/0x150 [ 1740.090730] asm_common_interrupt+0x1e/0x40 [ 1740.095941] RIP: 0010:cpuidle_enter_state+0xb4/0x3f0 [ 1740.102049] Code: 65 8b 3d 3f fb c6 58 e8 4a 5d 74 ff 48 89 45 d0 66 66 66 66 90 31 ff e8 fa 68 74 ff 80 7d c7 00 0f 85 d3 01 00 00 fb 66 66 90 <66> 66 90 45 85 e4 0f 88 df 01 00 00 49 63 d4 48 8d 04 52 48 8d 0c [ 1740.124194] RSP: 0018:be250634fe48 EFLAGS: 0246 [ 1740.130699] RAX: 9f733fa2c6c0 RBX: de14bfa00f00 RCX: 001f [ 1740.139315] RDX: RSI: 373a RDI: [ 1740.147943] RBP: be250634fe88 R08: 01951980e894 R09: 2840a000 [ 1740.156580] R10: 02b9 R11: 9f733fa2b364 R12: 0005 [ 1740.165266] R13: a856adc0 R14: 0005 R15: [ 1740.173911] ? cpuidle_enter_state+0xa6/0x3f0 [ 1740.179470] cpuidle_enter+0x2e/0x40 [ 1740.184136] cpuidle_idle_call+0x145/0x200 [ 1740.189359] do_idle+0x7a/0xe0 [ 1740.193426] cpu_startup_entry+0x20/0x30 [ 1740.198466] start_secondary+0xe6/0x100 [ 1740.203425] secondary_startup_64+0xb6/0xc0 [ 1740.208779] Modules linked in: igb_uio(OE) ice(OE) i40e(OE) ixgbe(OE) dell_rbu vxlan ip6_udp_tunnel udp_tunnel ip6table_filter ip6table_raw ip6_tables mpt3sas raid_class scsi_transport_sas mptctl mptbase xt_conntrack iptable_filter xt_tcpudp xt_CT nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 iptable_raw bpfilter intel_rapl_msr intel_rapl_common sb_edac iTCO_wdt intel_pmc_bxt iTCO_vendor_support x86_pkg_temp_thermal mgag200 intel_powerclamp drm_kms_helper cec rc_core coretemp drm kvm_intel i2c_algo_bit fb_sys_fops syscopyarea kvm sysfillrect sysimgblt rapl intel_cstate joydev pcspkr input_leds mei_me mei ipmi_si acpi_power_meter evbug ipmi_devintf lpc_ich ipmi_msghandler mac_hid ip_tables x_tables dm_multipath crct10dif_pclmul crc32_pclmul ghash_clmulni_intel aesni_intel uas crypto_simd cryptd glue_helper xfrm_algo usb_storage megaraid_sas dca tg3 wmi hid_generic usbkbd usbmouse usbhid hid btrfs blake2b_generic libcrc32c xor raid6_pq sunrpc dm_mirror dm_region_hash dm_log be2iscsi bnx2i cnic [ 1740.208816] uio cxgb4i cxgb4 cxgb3i cxgb3 mdio libcxgbi libcxgb qla4xxx iscsi_boot_sysfs iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi autofs4 [last unloaded: igb_uio] [ 1740.331702] CR2: 04f0 Any chance that skb->dev is set to zero in ixgbe_set_rsc_gso_size ? I noticed that in kernel code ixgbe_set_rsc_gso_size() calls skb_headlen(skb) and not eth_get_headlen(skb->dev, skb->data, skb_headlen(skb)); -- Thanks, Dmitry