Re: [E1000-devel] ixgbe NULL pointer dereference on ubuntu-5.8.0-25-generic

2021-05-27 Thread Fujinaka, Todd
We are unable to reproduce. I think Ubuntu might be moving kernels around on us 
because 'apt install linux-generic-hwe-20.04-edge' gave us 5.8.0-53-generic.

I'm going to call this closed unless you're still seeing this with the latest 
kernels.

Todd Fujinaka
Software Application Engineer
Data Center Group
Intel Corporation
todd.fujin...@intel.com

-Original Message-
From: Fujinaka, Todd  
Sent: Monday, March 1, 2021 10:24 AM
To: Dmitry Kravkov 
Cc: e1000-de...@lists.sf.net
Subject: Re: [E1000-devel] ixgbe NULL pointer dereference on 
ubuntu-5.8.0-25-generic

OK, I’ll file an internal ticket.

Todd Fujinaka
Software Application Engineer
Data Center Group
Intel Corporation
todd.fujin...@intel.com

From: Dmitry Kravkov 
Sent: Monday, March 1, 2021 9:04 AM
To: Fujinaka, Todd 
Cc: e1000-de...@lists.sf.net
Subject: Re: [E1000-devel] ixgbe NULL pointer dereference on 
ubuntu-5.8.0-25-generic

Also happens in 20.04 TLS hwe-5.8.0-44-generic kernel kernel installed using 
'apt install linux-generic-hwe-20.04-edge'


On Thu, Feb 25, 2021 at 11:41 AM Dmitry Kravkov 
mailto:dmit...@qwilt.com>> wrote:
Will do so

On Wed, Feb 24, 2021 at 5:00 PM Fujinaka, Todd 
mailto:todd.fujin...@intel.com>> wrote:
We only support the LTS variants. Can you try one of those?

Todd Fujinaka
Software Application Engineer
Data Center Group
Intel Corporation
todd.fujin...@intel.com<mailto:todd.fujin...@intel.com>

From: Dmitry Kravkov mailto:dmit...@qwilt.com>>
Sent: Tuesday, February 23, 2021 11:24 PM
To: Fujinaka, Todd mailto:todd.fujin...@intel.com>>
Cc: e1000-de...@lists.sf.net<mailto:e1000-de...@lists.sf.net>
Subject: Re: [E1000-devel] ixgbe NULL pointer dereference on 
ubuntu-5.8.0-25-generic


On Wed, Feb 24, 2021 at 2:28 AM Fujinaka, Todd 
mailto:todd.fujin...@intel.com>> wrote:
What version of Ubuntu is this? It's going to take me a bit to try to find the 
kernel from the release.
Ubuntu 20.10

Todd Fujinaka
Software Application Engineer
Data Center Group
Intel Corporation
todd.fujin...@intel.com<mailto:todd.fujin...@intel.com>

-Original Message-
From: Dmitry Kravkov mailto:dmit...@qwilt.com>>
Sent: Sunday, February 21, 2021 11:43 PM
To: e1000-de...@lists.sf.net<mailto:e1000-de...@lists.sf.net>
Subject: [E1000-devel] ixgbe NULL pointer dereference on ubuntu-5.8.0-25-generic

Hi All

I'm hitting the following bug during unload inbox driver and insmod'ing 5.9.4 
(also happens with 5.10.2):

[ 1739.889642] BUG: kernel NULL pointer dereference, address:
04f0
[ 1739.897969] #PF: supervisor read access in kernel mode [ 1739.904155] #PF: 
error_code(0x) - not-present page [ 1739.910327] PGD 0 P4D 0 [ 1739.913648] 
Oops:  [#1] SMP PTI [ 1739.917985] CPU: 16 PID: 0 Comm: swapper/16 Kdump: 
loaded Tainted: G
  OE 5.8.0-25-generic #26-Ubuntu
[ 1739.929943] Hardware name:  /, BIOS 2.2.2 01/16/2014 [ 1739.936043] RIP: 
0010:eth_get_headlen+0x26/0xb0 [ 1739.941625] Code: 00 00 00 00 66 66 66 66 90 
55 48 89 e5 41 54 53 89 d3
48 83 ec 18 65 48 8b 04 25 28 00 00 00 48 89 45 e8 31 c0 83 fa 0d 76 7e <48> 8b 
bf f0 04 00 00 6a 01 49 89 f0 49 89 f4 52 48 8d 4d dc 48 c7 [ 1739.963567] RSP: 
0018:be2506798db8 EFLAGS: 00010216 [ 1739.969961] RAX:  
RBX: 05ea RCX:
0002
[ 1739.978453] RDX: 05ea RSI: 9f6fb733c0c0 RDI:

[ 1739.986957] RBP: be2506798de0 R08:  R09:
9f733306ff00
[ 1739.995423] R10: 05ea R11: 0100 R12:
9f727b2c0740
[ 1740.003871] R13: 9f724b0e6010 R14: 400a838d R15:

[ 1740.012330] FS:  () GS:9f733fa0()
knlGS:
[ 1740.021848] CS:  0010 DS:  ES:  CR0: 80050033 [ 1740.028757] 
CR2: 04f0 CR3: 0002c740a001 CR4:
000606e0
[ 1740.037209] Call Trace:
[ 1740.040425]  
[ 1740.043154]  ixgbe_process_skb_fields+0x55/0x260 [ixgbe] [ 1740.049577]  
ixgbe_poll+0x52b/0x12c0 [ixgbe] [ 1740.054809]  napi_poll+0x96/0x1b0 [ 
1740.058985]  net_rx_action+0xb8/0x1c0 [ 1740.063575]  __do_softirq+0xd0/0x2a1 
[ 1740.068055]  asm_call_irq_on_stack+0x12/0x20 [ 1740.073345]   [ 
1740.076223]  do_softirq_own_stack+0x3d/0x50 [ 1740.081402]  
irq_exit_rcu+0x95/0xd0 [ 1740.085829]  common_interrupt+0x7c/0x150 [ 
1740.090730]  asm_common_interrupt+0x1e/0x40 [ 1740.095941] RIP: 
0010:cpuidle_enter_state+0xb4/0x3f0
[ 1740.102049] Code: 65 8b 3d 3f fb c6 58 e8 4a 5d 74 ff 48 89 45 d0 66 66
66 66 90 31 ff e8 fa 68 74 ff 80 7d c7 00 0f 85 d3 01 00 00 fb 66 66 90 <66> 66 
90 45 85 e4 0f 88 df 01 00 00 49 63 d4 48 8d 04 52 48 8d 0c [ 1740.124194] RSP: 
0018:be250634fe48 EFLAGS: 0246 [ 1740.130699] RAX: 9f733fa2c6c0 
RBX: de14bfa00f00 RCX:
001f
[ 1740.139315] RDX:  RSI: 373a RDI:

[ 1740.147943] RBP: be250634fe88 R08: 01951980e894 R09:
000

Re: [E1000-devel] ixgbe NULL pointer dereference on ubuntu-5.8.0-25-generic

2021-03-01 Thread Fujinaka, Todd
OK, I’ll file an internal ticket.

Todd Fujinaka
Software Application Engineer
Data Center Group
Intel Corporation
todd.fujin...@intel.com

From: Dmitry Kravkov 
Sent: Monday, March 1, 2021 9:04 AM
To: Fujinaka, Todd 
Cc: e1000-de...@lists.sf.net
Subject: Re: [E1000-devel] ixgbe NULL pointer dereference on 
ubuntu-5.8.0-25-generic

Also happens in 20.04 TLS hwe-5.8.0-44-generic kernel
kernel installed using 'apt install linux-generic-hwe-20.04-edge'


On Thu, Feb 25, 2021 at 11:41 AM Dmitry Kravkov 
mailto:dmit...@qwilt.com>> wrote:
Will do so

On Wed, Feb 24, 2021 at 5:00 PM Fujinaka, Todd 
mailto:todd.fujin...@intel.com>> wrote:
We only support the LTS variants. Can you try one of those?

Todd Fujinaka
Software Application Engineer
Data Center Group
Intel Corporation
todd.fujin...@intel.com<mailto:todd.fujin...@intel.com>

From: Dmitry Kravkov mailto:dmit...@qwilt.com>>
Sent: Tuesday, February 23, 2021 11:24 PM
To: Fujinaka, Todd mailto:todd.fujin...@intel.com>>
Cc: e1000-de...@lists.sf.net<mailto:e1000-de...@lists.sf.net>
Subject: Re: [E1000-devel] ixgbe NULL pointer dereference on 
ubuntu-5.8.0-25-generic


On Wed, Feb 24, 2021 at 2:28 AM Fujinaka, Todd 
mailto:todd.fujin...@intel.com>> wrote:
What version of Ubuntu is this? It's going to take me a bit to try to find the 
kernel from the release.
Ubuntu 20.10

Todd Fujinaka
Software Application Engineer
Data Center Group
Intel Corporation
todd.fujin...@intel.com<mailto:todd.fujin...@intel.com>

-Original Message-
From: Dmitry Kravkov mailto:dmit...@qwilt.com>>
Sent: Sunday, February 21, 2021 11:43 PM
To: e1000-de...@lists.sf.net<mailto:e1000-de...@lists.sf.net>
Subject: [E1000-devel] ixgbe NULL pointer dereference on ubuntu-5.8.0-25-generic

Hi All

I'm hitting the following bug during unload inbox driver and insmod'ing 5.9.4 
(also happens with 5.10.2):

[ 1739.889642] BUG: kernel NULL pointer dereference, address:
04f0
[ 1739.897969] #PF: supervisor read access in kernel mode [ 1739.904155] #PF: 
error_code(0x) - not-present page [ 1739.910327] PGD 0 P4D 0 [ 1739.913648] 
Oops:  [#1] SMP PTI [ 1739.917985] CPU: 16 PID: 0 Comm: swapper/16 Kdump: 
loaded Tainted: G
  OE 5.8.0-25-generic #26-Ubuntu
[ 1739.929943] Hardware name:  /, BIOS 2.2.2 01/16/2014 [ 1739.936043] RIP: 
0010:eth_get_headlen+0x26/0xb0 [ 1739.941625] Code: 00 00 00 00 66 66 66 66 90 
55 48 89 e5 41 54 53 89 d3
48 83 ec 18 65 48 8b 04 25 28 00 00 00 48 89 45 e8 31 c0 83 fa 0d 76 7e <48> 8b 
bf f0 04 00 00 6a 01 49 89 f0 49 89 f4 52 48 8d 4d dc 48 c7 [ 1739.963567] RSP: 
0018:be2506798db8 EFLAGS: 00010216 [ 1739.969961] RAX:  
RBX: 05ea RCX:
0002
[ 1739.978453] RDX: 05ea RSI: 9f6fb733c0c0 RDI:

[ 1739.986957] RBP: be2506798de0 R08:  R09:
9f733306ff00
[ 1739.995423] R10: 05ea R11: 0100 R12:
9f727b2c0740
[ 1740.003871] R13: 9f724b0e6010 R14: 400a838d R15:

[ 1740.012330] FS:  () GS:9f733fa0()
knlGS:
[ 1740.021848] CS:  0010 DS:  ES:  CR0: 80050033 [ 1740.028757] 
CR2: 04f0 CR3: 0002c740a001 CR4:
000606e0
[ 1740.037209] Call Trace:
[ 1740.040425]  
[ 1740.043154]  ixgbe_process_skb_fields+0x55/0x260 [ixgbe] [ 1740.049577]  
ixgbe_poll+0x52b/0x12c0 [ixgbe] [ 1740.054809]  napi_poll+0x96/0x1b0 [ 
1740.058985]  net_rx_action+0xb8/0x1c0 [ 1740.063575]  __do_softirq+0xd0/0x2a1 
[ 1740.068055]  asm_call_irq_on_stack+0x12/0x20 [ 1740.073345]   [ 
1740.076223]  do_softirq_own_stack+0x3d/0x50 [ 1740.081402]  
irq_exit_rcu+0x95/0xd0 [ 1740.085829]  common_interrupt+0x7c/0x150 [ 
1740.090730]  asm_common_interrupt+0x1e/0x40 [ 1740.095941] RIP: 
0010:cpuidle_enter_state+0xb4/0x3f0
[ 1740.102049] Code: 65 8b 3d 3f fb c6 58 e8 4a 5d 74 ff 48 89 45 d0 66 66
66 66 90 31 ff e8 fa 68 74 ff 80 7d c7 00 0f 85 d3 01 00 00 fb 66 66 90 <66> 66 
90 45 85 e4 0f 88 df 01 00 00 49 63 d4 48 8d 04 52 48 8d 0c [ 1740.124194] RSP: 
0018:be250634fe48 EFLAGS: 0246 [ 1740.130699] RAX: 9f733fa2c6c0 
RBX: de14bfa00f00 RCX:
001f
[ 1740.139315] RDX:  RSI: 373a RDI:

[ 1740.147943] RBP: be250634fe88 R08: 01951980e894 R09:
2840a000
[ 1740.156580] R10: 02b9 R11: 9f733fa2b364 R12:
0005
[ 1740.165266] R13: a856adc0 R14: 0005 R15:

[ 1740.173911]  ? cpuidle_enter_state+0xa6/0x3f0 [ 1740.179470]  
cpuidle_enter+0x2e/0x40 [ 1740.184136]  cpuidle_idle_call+0x145/0x200 [ 
1740.189359]  do_idle+0x7a/0xe0 [ 1740.193426]  cpu_startup_entry+0x20/0x30 [ 
1740.198466]  start_secondary+0xe6/0x100 [ 1740.203425]  
secondary_startup_64+0xb6/0xc0 [ 1740.208779] Modules linked in: igb_uio(OE) 
ice(OE) i40e(OE) ixgbe(OE) dell_rbu vxlan ip6_udp_tunne

Re: [E1000-devel] ixgbe NULL pointer dereference on ubuntu-5.8.0-25-generic

2021-03-01 Thread Dmitry Kravkov
Also happens in 20.04 TLS hwe-5.8.0-44-generic kernel
kernel installed using 'apt install linux-generic-hwe-20.04-edge'


On Thu, Feb 25, 2021 at 11:41 AM Dmitry Kravkov  wrote:

> Will do so
>
> On Wed, Feb 24, 2021 at 5:00 PM Fujinaka, Todd 
> wrote:
>
>> We only support the LTS variants. Can you try one of those?
>>
>>
>>
>> *Todd Fujinaka*
>>
>> Software Application Engineer
>>
>> Data Center Group
>>
>> Intel Corporation
>>
>> *todd.fujin...@intel.com *
>>
>>
>>
>> *From:* Dmitry Kravkov 
>> *Sent:* Tuesday, February 23, 2021 11:24 PM
>> *To:* Fujinaka, Todd 
>> *Cc:* e1000-de...@lists.sf.net
>> *Subject:* Re: [E1000-devel] ixgbe NULL pointer dereference on
>> ubuntu-5.8.0-25-generic
>>
>>
>>
>>
>>
>> On Wed, Feb 24, 2021 at 2:28 AM Fujinaka, Todd 
>> wrote:
>>
>> What version of Ubuntu is this? It's going to take me a bit to try to
>> find the kernel from the release.
>>
>> Ubuntu 20.10
>>
>>
>> Todd Fujinaka
>> Software Application Engineer
>> Data Center Group
>> Intel Corporation
>> todd.fujin...@intel.com
>>
>> -Original Message-
>> From: Dmitry Kravkov 
>> Sent: Sunday, February 21, 2021 11:43 PM
>> To: e1000-de...@lists.sf.net
>> Subject: [E1000-devel] ixgbe NULL pointer dereference on
>> ubuntu-5.8.0-25-generic
>>
>> Hi All
>>
>> I'm hitting the following bug during unload inbox driver and insmod'ing
>> 5.9.4 (also happens with 5.10.2):
>>
>> [ 1739.889642] BUG: kernel NULL pointer dereference, address:
>> 04f0
>> [ 1739.897969] #PF: supervisor read access in kernel mode [ 1739.904155]
>> #PF: error_code(0x) - not-present page [ 1739.910327] PGD 0 P4D 0 [
>> 1739.913648] Oops:  [#1] SMP PTI [ 1739.917985] CPU: 16 PID: 0 Comm:
>> swapper/16 Kdump: loaded Tainted: G
>>   OE 5.8.0-25-generic #26-Ubuntu
>> [ 1739.929943] Hardware name:  /, BIOS 2.2.2 01/16/2014 [ 1739.936043]
>> RIP: 0010:eth_get_headlen+0x26/0xb0 [ 1739.941625] Code: 00 00 00 00 66 66
>> 66 66 90 55 48 89 e5 41 54 53 89 d3
>> 48 83 ec 18 65 48 8b 04 25 28 00 00 00 48 89 45 e8 31 c0 83 fa 0d 76 7e
>> <48> 8b bf f0 04 00 00 6a 01 49 89 f0 49 89 f4 52 48 8d 4d dc 48 c7 [
>> 1739.963567] RSP: 0018:be2506798db8 EFLAGS: 00010216 [ 1739.969961]
>> RAX:  RBX: 05ea RCX:
>> 0002
>> [ 1739.978453] RDX: 05ea RSI: 9f6fb733c0c0 RDI:
>> 
>> [ 1739.986957] RBP: be2506798de0 R08:  R09:
>> 9f733306ff00
>> [ 1739.995423] R10: 05ea R11: 0100 R12:
>> 9f727b2c0740
>> [ 1740.003871] R13: 9f724b0e6010 R14: 400a838d R15:
>> 
>> [ 1740.012330] FS:  () GS:9f733fa0()
>> knlGS:
>> [ 1740.021848] CS:  0010 DS:  ES:  CR0: 80050033 [
>> 1740.028757] CR2: 04f0 CR3: 0002c740a001 CR4:
>> 000606e0
>> [ 1740.037209] Call Trace:
>> [ 1740.040425]  
>> [ 1740.043154]  ixgbe_process_skb_fields+0x55/0x260 [ixgbe] [
>> 1740.049577]  ixgbe_poll+0x52b/0x12c0 [ixgbe] [ 1740.054809]
>> napi_poll+0x96/0x1b0 [ 1740.058985]  net_rx_action+0xb8/0x1c0 [
>> 1740.063575]  __do_softirq+0xd0/0x2a1 [ 1740.068055]
>> asm_call_irq_on_stack+0x12/0x20 [ 1740.073345]   [ 1740.076223]
>> do_softirq_own_stack+0x3d/0x50 [ 1740.081402]  irq_exit_rcu+0x95/0xd0 [
>> 1740.085829]  common_interrupt+0x7c/0x150 [ 1740.090730]
>> asm_common_interrupt+0x1e/0x40 [ 1740.095941] RIP:
>> 0010:cpuidle_enter_state+0xb4/0x3f0
>> [ 1740.102049] Code: 65 8b 3d 3f fb c6 58 e8 4a 5d 74 ff 48 89 45 d0 66 66
>> 66 66 90 31 ff e8 fa 68 74 ff 80 7d c7 00 0f 85 d3 01 00 00 fb 66 66 90
>> <66> 66 90 45 85 e4 0f 88 df 01 00 00 49 63 d4 48 8d 04 52 48 8d 0c [
>> 1740.124194] RSP: 0018:be250634fe48 EFLAGS: 0246 [ 1740.130699]
>> RAX: 9f733fa2c6c0 RBX: de14bfa00f00 RCX:
>> 001f
>> [ 1740.139315] RDX:  RSI: 373a RDI:
>> 
>> [ 1740.147943] RBP: be250634fe88 R08: 01951980e894 R09:
>> 2840a000
>> [ 1740.156580] R10: 02b9 R11: 9f733fa2b364 R12:
>> 0005
>> [ 1740.165266] R13: a856adc0 R14: 0005 R15:
>> 
>> [ 1740.173911]  ? cpuidle_enter_state+0xa6/0x3f0 [ 1740.179470]
>> cpuidle_enter+0x2e/0x40 [ 1740.184136]  cpuidle_idle_call

Re: [E1000-devel] ixgbe NULL pointer dereference on ubuntu-5.8.0-25-generic

2021-02-25 Thread Dmitry Kravkov
Will do so

On Wed, Feb 24, 2021 at 5:00 PM Fujinaka, Todd 
wrote:

> We only support the LTS variants. Can you try one of those?
>
>
>
> *Todd Fujinaka*
>
> Software Application Engineer
>
> Data Center Group
>
> Intel Corporation
>
> *todd.fujin...@intel.com *
>
>
>
> *From:* Dmitry Kravkov 
> *Sent:* Tuesday, February 23, 2021 11:24 PM
> *To:* Fujinaka, Todd 
> *Cc:* e1000-de...@lists.sf.net
> *Subject:* Re: [E1000-devel] ixgbe NULL pointer dereference on
> ubuntu-5.8.0-25-generic
>
>
>
>
>
> On Wed, Feb 24, 2021 at 2:28 AM Fujinaka, Todd 
> wrote:
>
> What version of Ubuntu is this? It's going to take me a bit to try to find
> the kernel from the release.
>
> Ubuntu 20.10
>
>
> Todd Fujinaka
> Software Application Engineer
> Data Center Group
> Intel Corporation
> todd.fujin...@intel.com
>
> -Original Message-
> From: Dmitry Kravkov 
> Sent: Sunday, February 21, 2021 11:43 PM
> To: e1000-de...@lists.sf.net
> Subject: [E1000-devel] ixgbe NULL pointer dereference on
> ubuntu-5.8.0-25-generic
>
> Hi All
>
> I'm hitting the following bug during unload inbox driver and insmod'ing
> 5.9.4 (also happens with 5.10.2):
>
> [ 1739.889642] BUG: kernel NULL pointer dereference, address:
> 04f0
> [ 1739.897969] #PF: supervisor read access in kernel mode [ 1739.904155]
> #PF: error_code(0x) - not-present page [ 1739.910327] PGD 0 P4D 0 [
> 1739.913648] Oops:  [#1] SMP PTI [ 1739.917985] CPU: 16 PID: 0 Comm:
> swapper/16 Kdump: loaded Tainted: G
>   OE 5.8.0-25-generic #26-Ubuntu
> [ 1739.929943] Hardware name:  /, BIOS 2.2.2 01/16/2014 [ 1739.936043]
> RIP: 0010:eth_get_headlen+0x26/0xb0 [ 1739.941625] Code: 00 00 00 00 66 66
> 66 66 90 55 48 89 e5 41 54 53 89 d3
> 48 83 ec 18 65 48 8b 04 25 28 00 00 00 48 89 45 e8 31 c0 83 fa 0d 76 7e
> <48> 8b bf f0 04 00 00 6a 01 49 89 f0 49 89 f4 52 48 8d 4d dc 48 c7 [
> 1739.963567] RSP: 0018:be2506798db8 EFLAGS: 00010216 [ 1739.969961]
> RAX:  RBX: 05ea RCX:
> 0002
> [ 1739.978453] RDX: 05ea RSI: 9f6fb733c0c0 RDI:
> 
> [ 1739.986957] RBP: be2506798de0 R08:  R09:
> 9f733306ff00
> [ 1739.995423] R10: 05ea R11: 0100 R12:
> 9f727b2c0740
> [ 1740.003871] R13: 9f724b0e6010 R14: 400a838d R15:
> 
> [ 1740.012330] FS:  () GS:9f733fa0()
> knlGS:
> [ 1740.021848] CS:  0010 DS:  ES:  CR0: 80050033 [
> 1740.028757] CR2: 04f0 CR3: 0002c740a001 CR4:
> 000606e0
> [ 1740.037209] Call Trace:
> [ 1740.040425]  
> [ 1740.043154]  ixgbe_process_skb_fields+0x55/0x260 [ixgbe] [
> 1740.049577]  ixgbe_poll+0x52b/0x12c0 [ixgbe] [ 1740.054809]
> napi_poll+0x96/0x1b0 [ 1740.058985]  net_rx_action+0xb8/0x1c0 [
> 1740.063575]  __do_softirq+0xd0/0x2a1 [ 1740.068055]
> asm_call_irq_on_stack+0x12/0x20 [ 1740.073345]   [ 1740.076223]
> do_softirq_own_stack+0x3d/0x50 [ 1740.081402]  irq_exit_rcu+0x95/0xd0 [
> 1740.085829]  common_interrupt+0x7c/0x150 [ 1740.090730]
> asm_common_interrupt+0x1e/0x40 [ 1740.095941] RIP:
> 0010:cpuidle_enter_state+0xb4/0x3f0
> [ 1740.102049] Code: 65 8b 3d 3f fb c6 58 e8 4a 5d 74 ff 48 89 45 d0 66 66
> 66 66 90 31 ff e8 fa 68 74 ff 80 7d c7 00 0f 85 d3 01 00 00 fb 66 66 90
> <66> 66 90 45 85 e4 0f 88 df 01 00 00 49 63 d4 48 8d 04 52 48 8d 0c [
> 1740.124194] RSP: 0018:be250634fe48 EFLAGS: 0246 [ 1740.130699]
> RAX: 9f733fa2c6c0 RBX: de14bfa00f00 RCX:
> 001f
> [ 1740.139315] RDX:  RSI: 373a RDI:
> 
> [ 1740.147943] RBP: be250634fe88 R08: 01951980e894 R09:
> 2840a000
> [ 1740.156580] R10: 02b9 R11: 9f733fa2b364 R12:
> 0005
> [ 1740.165266] R13: a856adc0 R14: 0005 R15:
> 
> [ 1740.173911]  ? cpuidle_enter_state+0xa6/0x3f0 [ 1740.179470]
> cpuidle_enter+0x2e/0x40 [ 1740.184136]  cpuidle_idle_call+0x145/0x200 [
> 1740.189359]  do_idle+0x7a/0xe0 [ 1740.193426]  cpu_startup_entry+0x20/0x30
> [ 1740.198466]  start_secondary+0xe6/0x100 [ 1740.203425]
> secondary_startup_64+0xb6/0xc0 [ 1740.208779] Modules linked in:
> igb_uio(OE) ice(OE) i40e(OE) ixgbe(OE) dell_rbu vxlan ip6_udp_tunnel
> udp_tunnel ip6table_filter ip6table_raw ip6_tables mpt3sas raid_class
> scsi_transport_sas mptctl mptbase xt_conntrack iptable_filter xt_tcpudp
> xt_CT nf_conntrack nf_defrag_ipv6
> nf_defrag_ipv4 iptable_raw bpfilter intel_rapl_msr intel_rapl_common
> sb_edac iTCO_wdt intel_pmc_bxt iTCO_vendor_support x86_pkg_temp_t

Re: [E1000-devel] ixgbe NULL pointer dereference on ubuntu-5.8.0-25-generic

2021-02-24 Thread Fujinaka, Todd
We only support the LTS variants. Can you try one of those?

Todd Fujinaka
Software Application Engineer
Data Center Group
Intel Corporation
todd.fujin...@intel.com

From: Dmitry Kravkov 
Sent: Tuesday, February 23, 2021 11:24 PM
To: Fujinaka, Todd 
Cc: e1000-de...@lists.sf.net
Subject: Re: [E1000-devel] ixgbe NULL pointer dereference on 
ubuntu-5.8.0-25-generic


On Wed, Feb 24, 2021 at 2:28 AM Fujinaka, Todd 
mailto:todd.fujin...@intel.com>> wrote:
What version of Ubuntu is this? It's going to take me a bit to try to find the 
kernel from the release.
Ubuntu 20.10

Todd Fujinaka
Software Application Engineer
Data Center Group
Intel Corporation
todd.fujin...@intel.com<mailto:todd.fujin...@intel.com>

-Original Message-
From: Dmitry Kravkov mailto:dmit...@qwilt.com>>
Sent: Sunday, February 21, 2021 11:43 PM
To: e1000-de...@lists.sf.net<mailto:e1000-de...@lists.sf.net>
Subject: [E1000-devel] ixgbe NULL pointer dereference on ubuntu-5.8.0-25-generic

Hi All

I'm hitting the following bug during unload inbox driver and insmod'ing 5.9.4 
(also happens with 5.10.2):

[ 1739.889642] BUG: kernel NULL pointer dereference, address:
04f0
[ 1739.897969] #PF: supervisor read access in kernel mode [ 1739.904155] #PF: 
error_code(0x) - not-present page [ 1739.910327] PGD 0 P4D 0 [ 1739.913648] 
Oops:  [#1] SMP PTI [ 1739.917985] CPU: 16 PID: 0 Comm: swapper/16 Kdump: 
loaded Tainted: G
  OE 5.8.0-25-generic #26-Ubuntu
[ 1739.929943] Hardware name:  /, BIOS 2.2.2 01/16/2014 [ 1739.936043] RIP: 
0010:eth_get_headlen+0x26/0xb0 [ 1739.941625] Code: 00 00 00 00 66 66 66 66 90 
55 48 89 e5 41 54 53 89 d3
48 83 ec 18 65 48 8b 04 25 28 00 00 00 48 89 45 e8 31 c0 83 fa 0d 76 7e <48> 8b 
bf f0 04 00 00 6a 01 49 89 f0 49 89 f4 52 48 8d 4d dc 48 c7 [ 1739.963567] RSP: 
0018:be2506798db8 EFLAGS: 00010216 [ 1739.969961] RAX:  
RBX: 05ea RCX:
0002
[ 1739.978453] RDX: 05ea RSI: 9f6fb733c0c0 RDI:

[ 1739.986957] RBP: be2506798de0 R08:  R09:
9f733306ff00
[ 1739.995423] R10: 05ea R11: 0100 R12:
9f727b2c0740
[ 1740.003871] R13: 9f724b0e6010 R14: 400a838d R15:

[ 1740.012330] FS:  () GS:9f733fa0()
knlGS:
[ 1740.021848] CS:  0010 DS:  ES:  CR0: 80050033 [ 1740.028757] 
CR2: 04f0 CR3: 0002c740a001 CR4:
000606e0
[ 1740.037209] Call Trace:
[ 1740.040425]  
[ 1740.043154]  ixgbe_process_skb_fields+0x55/0x260 [ixgbe] [ 1740.049577]  
ixgbe_poll+0x52b/0x12c0 [ixgbe] [ 1740.054809]  napi_poll+0x96/0x1b0 [ 
1740.058985]  net_rx_action+0xb8/0x1c0 [ 1740.063575]  __do_softirq+0xd0/0x2a1 
[ 1740.068055]  asm_call_irq_on_stack+0x12/0x20 [ 1740.073345]   [ 
1740.076223]  do_softirq_own_stack+0x3d/0x50 [ 1740.081402]  
irq_exit_rcu+0x95/0xd0 [ 1740.085829]  common_interrupt+0x7c/0x150 [ 
1740.090730]  asm_common_interrupt+0x1e/0x40 [ 1740.095941] RIP: 
0010:cpuidle_enter_state+0xb4/0x3f0
[ 1740.102049] Code: 65 8b 3d 3f fb c6 58 e8 4a 5d 74 ff 48 89 45 d0 66 66
66 66 90 31 ff e8 fa 68 74 ff 80 7d c7 00 0f 85 d3 01 00 00 fb 66 66 90 <66> 66 
90 45 85 e4 0f 88 df 01 00 00 49 63 d4 48 8d 04 52 48 8d 0c [ 1740.124194] RSP: 
0018:be250634fe48 EFLAGS: 0246 [ 1740.130699] RAX: 9f733fa2c6c0 
RBX: de14bfa00f00 RCX:
001f
[ 1740.139315] RDX:  RSI: 373a RDI:

[ 1740.147943] RBP: be250634fe88 R08: 01951980e894 R09:
2840a000
[ 1740.156580] R10: 02b9 R11: 9f733fa2b364 R12:
0005
[ 1740.165266] R13: a856adc0 R14: 0005 R15:

[ 1740.173911]  ? cpuidle_enter_state+0xa6/0x3f0 [ 1740.179470]  
cpuidle_enter+0x2e/0x40 [ 1740.184136]  cpuidle_idle_call+0x145/0x200 [ 
1740.189359]  do_idle+0x7a/0xe0 [ 1740.193426]  cpu_startup_entry+0x20/0x30 [ 
1740.198466]  start_secondary+0xe6/0x100 [ 1740.203425]  
secondary_startup_64+0xb6/0xc0 [ 1740.208779] Modules linked in: igb_uio(OE) 
ice(OE) i40e(OE) ixgbe(OE) dell_rbu vxlan ip6_udp_tunnel udp_tunnel 
ip6table_filter ip6table_raw ip6_tables mpt3sas raid_class scsi_transport_sas 
mptctl mptbase xt_conntrack iptable_filter xt_tcpudp xt_CT nf_conntrack 
nf_defrag_ipv6
nf_defrag_ipv4 iptable_raw bpfilter intel_rapl_msr intel_rapl_common sb_edac 
iTCO_wdt intel_pmc_bxt iTCO_vendor_support x86_pkg_temp_thermal
mgag200 intel_powerclamp drm_kms_helper cec rc_core coretemp drm kvm_intel 
i2c_algo_bit fb_sys_fops syscopyarea kvm sysfillrect sysimgblt rapl 
intel_cstate joydev pcspkr input_leds mei_me mei ipmi_si acpi_power_meter evbug 
ipmi_devintf lpc_ich ipmi_msghandler mac_hid ip_tables x_tables dm_multipath 
crct10dif_pclmul crc32_pclmul ghash_clmulni_intel aesni_intel uas crypto_simd 
cryptd glue_helper xfrm_algo usb_storage megaraid_sas dca
tg3 wmi hid_generic usbkbd usbmo

Re: [E1000-devel] ixgbe NULL pointer dereference on ubuntu-5.8.0-25-generic

2021-02-24 Thread Dmitry Kravkov
On Wed, Feb 24, 2021 at 2:28 AM Fujinaka, Todd 
wrote:

> What version of Ubuntu is this? It's going to take me a bit to try to find
> the kernel from the release.
>
Ubuntu 20.10

>
> Todd Fujinaka
> Software Application Engineer
> Data Center Group
> Intel Corporation
> todd.fujin...@intel.com
>
> -Original Message-
> From: Dmitry Kravkov 
> Sent: Sunday, February 21, 2021 11:43 PM
> To: e1000-de...@lists.sf.net
> Subject: [E1000-devel] ixgbe NULL pointer dereference on
> ubuntu-5.8.0-25-generic
>
> Hi All
>
> I'm hitting the following bug during unload inbox driver and insmod'ing
> 5.9.4 (also happens with 5.10.2):
>
> [ 1739.889642] BUG: kernel NULL pointer dereference, address:
> 04f0
> [ 1739.897969] #PF: supervisor read access in kernel mode [ 1739.904155]
> #PF: error_code(0x) - not-present page [ 1739.910327] PGD 0 P4D 0 [
> 1739.913648] Oops:  [#1] SMP PTI [ 1739.917985] CPU: 16 PID: 0 Comm:
> swapper/16 Kdump: loaded Tainted: G
>   OE 5.8.0-25-generic #26-Ubuntu
> [ 1739.929943] Hardware name:  /, BIOS 2.2.2 01/16/2014 [ 1739.936043]
> RIP: 0010:eth_get_headlen+0x26/0xb0 [ 1739.941625] Code: 00 00 00 00 66 66
> 66 66 90 55 48 89 e5 41 54 53 89 d3
> 48 83 ec 18 65 48 8b 04 25 28 00 00 00 48 89 45 e8 31 c0 83 fa 0d 76 7e
> <48> 8b bf f0 04 00 00 6a 01 49 89 f0 49 89 f4 52 48 8d 4d dc 48 c7 [
> 1739.963567] RSP: 0018:be2506798db8 EFLAGS: 00010216 [ 1739.969961]
> RAX:  RBX: 05ea RCX:
> 0002
> [ 1739.978453] RDX: 05ea RSI: 9f6fb733c0c0 RDI:
> 
> [ 1739.986957] RBP: be2506798de0 R08:  R09:
> 9f733306ff00
> [ 1739.995423] R10: 05ea R11: 0100 R12:
> 9f727b2c0740
> [ 1740.003871] R13: 9f724b0e6010 R14: 400a838d R15:
> 
> [ 1740.012330] FS:  () GS:9f733fa0()
> knlGS:
> [ 1740.021848] CS:  0010 DS:  ES:  CR0: 80050033 [
> 1740.028757] CR2: 04f0 CR3: 0002c740a001 CR4:
> 000606e0
> [ 1740.037209] Call Trace:
> [ 1740.040425]  
> [ 1740.043154]  ixgbe_process_skb_fields+0x55/0x260 [ixgbe] [
> 1740.049577]  ixgbe_poll+0x52b/0x12c0 [ixgbe] [ 1740.054809]
> napi_poll+0x96/0x1b0 [ 1740.058985]  net_rx_action+0xb8/0x1c0 [
> 1740.063575]  __do_softirq+0xd0/0x2a1 [ 1740.068055]
> asm_call_irq_on_stack+0x12/0x20 [ 1740.073345]   [ 1740.076223]
> do_softirq_own_stack+0x3d/0x50 [ 1740.081402]  irq_exit_rcu+0x95/0xd0 [
> 1740.085829]  common_interrupt+0x7c/0x150 [ 1740.090730]
> asm_common_interrupt+0x1e/0x40 [ 1740.095941] RIP:
> 0010:cpuidle_enter_state+0xb4/0x3f0
> [ 1740.102049] Code: 65 8b 3d 3f fb c6 58 e8 4a 5d 74 ff 48 89 45 d0 66 66
> 66 66 90 31 ff e8 fa 68 74 ff 80 7d c7 00 0f 85 d3 01 00 00 fb 66 66 90
> <66> 66 90 45 85 e4 0f 88 df 01 00 00 49 63 d4 48 8d 04 52 48 8d 0c [
> 1740.124194] RSP: 0018:be250634fe48 EFLAGS: 0246 [ 1740.130699]
> RAX: 9f733fa2c6c0 RBX: de14bfa00f00 RCX:
> 001f
> [ 1740.139315] RDX:  RSI: 373a RDI:
> 
> [ 1740.147943] RBP: be250634fe88 R08: 01951980e894 R09:
> 2840a000
> [ 1740.156580] R10: 02b9 R11: 9f733fa2b364 R12:
> 0005
> [ 1740.165266] R13: a856adc0 R14: 0005 R15:
> 
> [ 1740.173911]  ? cpuidle_enter_state+0xa6/0x3f0 [ 1740.179470]
> cpuidle_enter+0x2e/0x40 [ 1740.184136]  cpuidle_idle_call+0x145/0x200 [
> 1740.189359]  do_idle+0x7a/0xe0 [ 1740.193426]  cpu_startup_entry+0x20/0x30
> [ 1740.198466]  start_secondary+0xe6/0x100 [ 1740.203425]
> secondary_startup_64+0xb6/0xc0 [ 1740.208779] Modules linked in:
> igb_uio(OE) ice(OE) i40e(OE) ixgbe(OE) dell_rbu vxlan ip6_udp_tunnel
> udp_tunnel ip6table_filter ip6table_raw ip6_tables mpt3sas raid_class
> scsi_transport_sas mptctl mptbase xt_conntrack iptable_filter xt_tcpudp
> xt_CT nf_conntrack nf_defrag_ipv6
> nf_defrag_ipv4 iptable_raw bpfilter intel_rapl_msr intel_rapl_common
> sb_edac iTCO_wdt intel_pmc_bxt iTCO_vendor_support x86_pkg_temp_thermal
> mgag200 intel_powerclamp drm_kms_helper cec rc_core coretemp drm kvm_intel
> i2c_algo_bit fb_sys_fops syscopyarea kvm sysfillrect sysimgblt rapl
> intel_cstate joydev pcspkr input_leds mei_me mei ipmi_si acpi_power_meter
> evbug ipmi_devintf lpc_ich ipmi_msghandler mac_hid ip_tables x_tables
> dm_multipath crct10dif_pclmul crc32_pclmul ghash_clmulni_intel aesni_intel
> uas crypto_simd cryptd glue_helper xfrm_algo usb_storage megaraid_sas dca
> tg3 wmi hid_generic usbkbd usbmouse usbhid hid btrfs blake2b_generic
> libcrc32c xor raid6_pq sunrpc dm_mirror dm_region_hash dm_log be2iscsi
> bnx2i cnic [ 1740.208816]  uio cxgb4i cxgb4 cxgb3i cxgb3 mdio libcxgbi
> libcxgb qla4xxx iscsi_boot_sysfs iscsi_tcp libiscsi_tcp libiscsi
> scsi_transport_iscsi
> autofs4 [last unloaded: igb_uio]
> [ 1740.331702] CR2: 04f0
>
>
> Any chance that skb->dev is set to 

Re: [E1000-devel] ixgbe NULL pointer dereference on ubuntu-5.8.0-25-generic

2021-02-23 Thread Fujinaka, Todd
What version of Ubuntu is this? It's going to take me a bit to try to find the 
kernel from the release.

Todd Fujinaka
Software Application Engineer
Data Center Group
Intel Corporation
todd.fujin...@intel.com

-Original Message-
From: Dmitry Kravkov  
Sent: Sunday, February 21, 2021 11:43 PM
To: e1000-de...@lists.sf.net
Subject: [E1000-devel] ixgbe NULL pointer dereference on ubuntu-5.8.0-25-generic

Hi All

I'm hitting the following bug during unload inbox driver and insmod'ing 5.9.4 
(also happens with 5.10.2):

[ 1739.889642] BUG: kernel NULL pointer dereference, address:
04f0
[ 1739.897969] #PF: supervisor read access in kernel mode [ 1739.904155] #PF: 
error_code(0x) - not-present page [ 1739.910327] PGD 0 P4D 0 [ 1739.913648] 
Oops:  [#1] SMP PTI [ 1739.917985] CPU: 16 PID: 0 Comm: swapper/16 Kdump: 
loaded Tainted: G
  OE 5.8.0-25-generic #26-Ubuntu
[ 1739.929943] Hardware name:  /, BIOS 2.2.2 01/16/2014 [ 1739.936043] RIP: 
0010:eth_get_headlen+0x26/0xb0 [ 1739.941625] Code: 00 00 00 00 66 66 66 66 90 
55 48 89 e5 41 54 53 89 d3
48 83 ec 18 65 48 8b 04 25 28 00 00 00 48 89 45 e8 31 c0 83 fa 0d 76 7e <48> 8b 
bf f0 04 00 00 6a 01 49 89 f0 49 89 f4 52 48 8d 4d dc 48 c7 [ 1739.963567] RSP: 
0018:be2506798db8 EFLAGS: 00010216 [ 1739.969961] RAX:  
RBX: 05ea RCX:
0002
[ 1739.978453] RDX: 05ea RSI: 9f6fb733c0c0 RDI:

[ 1739.986957] RBP: be2506798de0 R08:  R09:
9f733306ff00
[ 1739.995423] R10: 05ea R11: 0100 R12:
9f727b2c0740
[ 1740.003871] R13: 9f724b0e6010 R14: 400a838d R15:

[ 1740.012330] FS:  () GS:9f733fa0()
knlGS:
[ 1740.021848] CS:  0010 DS:  ES:  CR0: 80050033 [ 1740.028757] 
CR2: 04f0 CR3: 0002c740a001 CR4:
000606e0
[ 1740.037209] Call Trace:
[ 1740.040425]  
[ 1740.043154]  ixgbe_process_skb_fields+0x55/0x260 [ixgbe] [ 1740.049577]  
ixgbe_poll+0x52b/0x12c0 [ixgbe] [ 1740.054809]  napi_poll+0x96/0x1b0 [ 
1740.058985]  net_rx_action+0xb8/0x1c0 [ 1740.063575]  __do_softirq+0xd0/0x2a1 
[ 1740.068055]  asm_call_irq_on_stack+0x12/0x20 [ 1740.073345]   [ 
1740.076223]  do_softirq_own_stack+0x3d/0x50 [ 1740.081402]  
irq_exit_rcu+0x95/0xd0 [ 1740.085829]  common_interrupt+0x7c/0x150 [ 
1740.090730]  asm_common_interrupt+0x1e/0x40 [ 1740.095941] RIP: 
0010:cpuidle_enter_state+0xb4/0x3f0
[ 1740.102049] Code: 65 8b 3d 3f fb c6 58 e8 4a 5d 74 ff 48 89 45 d0 66 66
66 66 90 31 ff e8 fa 68 74 ff 80 7d c7 00 0f 85 d3 01 00 00 fb 66 66 90 <66> 66 
90 45 85 e4 0f 88 df 01 00 00 49 63 d4 48 8d 04 52 48 8d 0c [ 1740.124194] RSP: 
0018:be250634fe48 EFLAGS: 0246 [ 1740.130699] RAX: 9f733fa2c6c0 
RBX: de14bfa00f00 RCX:
001f
[ 1740.139315] RDX:  RSI: 373a RDI:

[ 1740.147943] RBP: be250634fe88 R08: 01951980e894 R09:
2840a000
[ 1740.156580] R10: 02b9 R11: 9f733fa2b364 R12:
0005
[ 1740.165266] R13: a856adc0 R14: 0005 R15:

[ 1740.173911]  ? cpuidle_enter_state+0xa6/0x3f0 [ 1740.179470]  
cpuidle_enter+0x2e/0x40 [ 1740.184136]  cpuidle_idle_call+0x145/0x200 [ 
1740.189359]  do_idle+0x7a/0xe0 [ 1740.193426]  cpu_startup_entry+0x20/0x30 [ 
1740.198466]  start_secondary+0xe6/0x100 [ 1740.203425]  
secondary_startup_64+0xb6/0xc0 [ 1740.208779] Modules linked in: igb_uio(OE) 
ice(OE) i40e(OE) ixgbe(OE) dell_rbu vxlan ip6_udp_tunnel udp_tunnel 
ip6table_filter ip6table_raw ip6_tables mpt3sas raid_class scsi_transport_sas 
mptctl mptbase xt_conntrack iptable_filter xt_tcpudp xt_CT nf_conntrack 
nf_defrag_ipv6
nf_defrag_ipv4 iptable_raw bpfilter intel_rapl_msr intel_rapl_common sb_edac 
iTCO_wdt intel_pmc_bxt iTCO_vendor_support x86_pkg_temp_thermal
mgag200 intel_powerclamp drm_kms_helper cec rc_core coretemp drm kvm_intel 
i2c_algo_bit fb_sys_fops syscopyarea kvm sysfillrect sysimgblt rapl 
intel_cstate joydev pcspkr input_leds mei_me mei ipmi_si acpi_power_meter evbug 
ipmi_devintf lpc_ich ipmi_msghandler mac_hid ip_tables x_tables dm_multipath 
crct10dif_pclmul crc32_pclmul ghash_clmulni_intel aesni_intel uas crypto_simd 
cryptd glue_helper xfrm_algo usb_storage megaraid_sas dca
tg3 wmi hid_generic usbkbd usbmouse usbhid hid btrfs blake2b_generic libcrc32c 
xor raid6_pq sunrpc dm_mirror dm_region_hash dm_log be2iscsi bnx2i cnic [ 
1740.208816]  uio cxgb4i cxgb4 cxgb3i cxgb3 mdio libcxgbi libcxgb qla4xxx 
iscsi_boot_sysfs iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi
autofs4 [last unloaded: igb_uio]
[ 1740.331702] CR2: 04f0


Any chance that skb->dev is set to zero in  ixgbe_set_rsc_gso_size ?

I noticed that in kernel code ixgbe_set_rsc_gso_size() calls
skb_headlen(skb) and not eth_get_headlen(skb->dev, skb->data, skb_headlen(skb));


--
Thanks,
Dmitry