[Kernel-packages] [Bug 1931106] Re: bnxt_en NIC driver crashes IO_PAGE_FAULT

2021-09-12 Thread Launchpad Bug Tracker
[Expired for linux (Ubuntu) because there has been no activity for 60
days.]

** Changed in: linux (Ubuntu)
   Status: Incomplete => Expired

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1931106

Title:
  bnxt_en NIC driver crashes IO_PAGE_FAULT

Status in linux package in Ubuntu:
  Expired

Bug description:
  Hi all,

  We received a bunch of new servers with a Supermicro H12SSL-NT
  mainboard that has an embedded Broadcom BCM57416 NIC.

  On all those servers we observe crashes of the NIC driver (bnxt_en)
  from time to time. We're not able to manually reproduce this issue, it
  just occurs at some point. Also our monitoring does not show any
  irregularities(high traffic flow or sth. like this).

  Syslog: https://pastebin.com/yDAyjHvF

  All servers are running with up-to-date packages:
  $ lsb_release -rd
  Description: Ubuntu 20.04.2 LTS
  Release: 20.04
  $ uname -r
  5.4.0-73-generic 

  It also happens on older kernel versions (tested 5.4.0-66) as well as
  the HWE kernel (tested 5.8.0-55).

  
  Thanks in advance.
  ~ Roman

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1931106/+subscriptions


-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp


[Kernel-packages] [Bug 1931106] Re: bnxt_en NIC driver crashes IO_PAGE_FAULT

2021-07-14 Thread Kai-Heng Feng
One possible workaround is to make the device use identity mapping, but
that requires change the kernel source code.

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1931106

Title:
  bnxt_en NIC driver crashes IO_PAGE_FAULT

Status in linux package in Ubuntu:
  Incomplete

Bug description:
  Hi all,

  We received a bunch of new servers with a Supermicro H12SSL-NT
  mainboard that has an embedded Broadcom BCM57416 NIC.

  On all those servers we observe crashes of the NIC driver (bnxt_en)
  from time to time. We're not able to manually reproduce this issue, it
  just occurs at some point. Also our monitoring does not show any
  irregularities(high traffic flow or sth. like this).

  Syslog: https://pastebin.com/yDAyjHvF

  All servers are running with up-to-date packages:
  $ lsb_release -rd
  Description: Ubuntu 20.04.2 LTS
  Release: 20.04
  $ uname -r
  5.4.0-73-generic 

  It also happens on older kernel versions (tested 5.4.0-66) as well as
  the HWE kernel (tested 5.8.0-55).

  
  Thanks in advance.
  ~ Roman

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1931106/+subscriptions


-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp


[Kernel-packages] [Bug 1931106] Re: bnxt_en NIC driver crashes IO_PAGE_FAULT

2021-07-14 Thread aft2d
Update:
I've contacted the mail addresses from the Kai-Heng's post.
Michael Chan (from Broadcom) replied that they've seen similar issues on other 
AMD systems and that they were working with AMD to resolve this.
The plan was to establish contact between me and AMD, unfortunately this never 
happened.  The attempt to contact AMD via the official way (tech support) 
failed because I could not answer AMD's questions without feedback from 
Broadcom, who then also did not reply anymore.

Workaround:
Luckily, with the information that came out of the conversation with Broadcom, 
I was able to troubleshoot a bit myself since I knew at least somewhat where to 
look.
It appears that by setting Advanced -> NB Configuration -> IOMMU to "disabled" 
(default is "Auto") in Supermicro BIOS the problem does not occur anymore.

Since then the whole topic is "stuck".

It's just a workaround and not really a fix, but at least servers
running stable now for me. Since I don't know where the actual problem
is (whether in AMD hardware, bios, kernel, or whatever) so I can't say
if this bug report can be marked as closed or not.

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1931106

Title:
  bnxt_en NIC driver crashes IO_PAGE_FAULT

Status in linux package in Ubuntu:
  Incomplete

Bug description:
  Hi all,

  We received a bunch of new servers with a Supermicro H12SSL-NT
  mainboard that has an embedded Broadcom BCM57416 NIC.

  On all those servers we observe crashes of the NIC driver (bnxt_en)
  from time to time. We're not able to manually reproduce this issue, it
  just occurs at some point. Also our monitoring does not show any
  irregularities(high traffic flow or sth. like this).

  Syslog: https://pastebin.com/yDAyjHvF

  All servers are running with up-to-date packages:
  $ lsb_release -rd
  Description: Ubuntu 20.04.2 LTS
  Release: 20.04
  $ uname -r
  5.4.0-73-generic 

  It also happens on older kernel versions (tested 5.4.0-66) as well as
  the HWE kernel (tested 5.8.0-55).

  
  Thanks in advance.
  ~ Roman

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1931106/+subscriptions


-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp


[Kernel-packages] [Bug 1931106] Re: bnxt_en NIC driver crashes IO_PAGE_FAULT

2021-06-08 Thread Kai-Heng Feng
Can you please raise the issue to the following mail address:
$ scripts/get_maintainer.pl -f drivers/net/ethernet/broadcom/bnxt   
Michael Chan  (supporter:BROADCOM BNXT_EN 50 GIGABIT 
ETHERNET DRIVER)
"David S. Miller"  (maintainer:NETWORKING DRIVERS)
Jakub Kicinski  (maintainer:NETWORKING DRIVERS)
net...@vger.kernel.org (open list:BROADCOM BNXT_EN 50 GIGABIT ETHERNET DRIVER)
linux-ker...@vger.kernel.org (open list)

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1931106

Title:
  bnxt_en NIC driver crashes IO_PAGE_FAULT

Status in linux package in Ubuntu:
  Incomplete

Bug description:
  Hi all,

  We received a bunch of new servers with a Supermicro H12SSL-NT
  mainboard that has an embedded Broadcom BCM57416 NIC.

  On all those servers we observe crashes of the NIC driver (bnxt_en)
  from time to time. We're not able to manually reproduce this issue, it
  just occurs at some point. Also our monitoring does not show any
  irregularities(high traffic flow or sth. like this).

  Syslog: https://pastebin.com/yDAyjHvF

  All servers are running with up-to-date packages:
  $ lsb_release -rd
  Description: Ubuntu 20.04.2 LTS
  Release: 20.04
  $ uname -r
  5.4.0-73-generic 

  It also happens on older kernel versions (tested 5.4.0-66) as well as
  the HWE kernel (tested 5.8.0-55).

  
  Thanks in advance.
  ~ Roman

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1931106/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp


[Kernel-packages] [Bug 1931106] Re: bnxt_en NIC driver crashes IO_PAGE_FAULT

2021-06-07 Thread aft2d
Did that, and after ~1 hour after I put them back in production 4 out of
5 servers I've upgraded to v5.13-rc5 crashed and the last after one more
hour.

For comaparison, with the old kernel version it occured ~2/3 times a week.
 
New syslog: https://pastebin.com/GWqtVaA3

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1931106

Title:
  bnxt_en NIC driver crashes IO_PAGE_FAULT

Status in linux package in Ubuntu:
  Incomplete

Bug description:
  Hi all,

  We received a bunch of new servers with a Supermicro H12SSL-NT
  mainboard that has an embedded Broadcom BCM57416 NIC.

  On all those servers we observe crashes of the NIC driver (bnxt_en)
  from time to time. We're not able to manually reproduce this issue, it
  just occurs at some point. Also our monitoring does not show any
  irregularities(high traffic flow or sth. like this).

  Syslog: https://pastebin.com/yDAyjHvF

  All servers are running with up-to-date packages:
  $ lsb_release -rd
  Description: Ubuntu 20.04.2 LTS
  Release: 20.04
  $ uname -r
  5.4.0-73-generic 

  It also happens on older kernel versions (tested 5.4.0-66) as well as
  the HWE kernel (tested 5.8.0-55).

  
  Thanks in advance.
  ~ Roman

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1931106/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp


[Kernel-packages] [Bug 1931106] Re: bnxt_en NIC driver crashes IO_PAGE_FAULT

2021-06-07 Thread Kai-Heng Feng
Please test latest mainline kernel:
https://kernel.ubuntu.com/~kernel-ppa/mainline/v5.13-rc5/amd64/

Headers are not needed.

** Changed in: linux (Ubuntu)
   Status: New => Incomplete

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1931106

Title:
  bnxt_en NIC driver crashes IO_PAGE_FAULT

Status in linux package in Ubuntu:
  Incomplete

Bug description:
  Hi all,

  We received a bunch of new servers with a Supermicro H12SSL-NT
  mainboard that has an embedded Broadcom BCM57416 NIC.

  On all those servers we observe crashes of the NIC driver (bnxt_en)
  from time to time. We're not able to manually reproduce this issue, it
  just occurs at some point. Also our monitoring does not show any
  irregularities(high traffic flow or sth. like this).

  Syslog: https://pastebin.com/yDAyjHvF

  All servers are running with up-to-date packages:
  $ lsb_release -rd
  Description: Ubuntu 20.04.2 LTS
  Release: 20.04
  $ uname -r
  5.4.0-73-generic 

  It also happens on older kernel versions (tested 5.4.0-66) as well as
  the HWE kernel (tested 5.8.0-55).

  
  Thanks in advance.
  ~ Roman

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1931106/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp


[Kernel-packages] [Bug 1931106] Re: bnxt_en NIC driver crashes IO_PAGE_FAULT

2021-06-07 Thread aft2d
** Attachment added: "apport.linux-image-5.8.0-55-generic.cime34c6.apport"
   
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1931106/+attachment/5502930/+files/apport.linux-image-5.8.0-55-generic.cime34c6.apport

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1931106

Title:
  bnxt_en NIC driver crashes IO_PAGE_FAULT

Status in linux package in Ubuntu:
  New

Bug description:
  Hi all,

  We received a bunch of new servers with a Supermicro H12SSL-NT
  mainboard that has an embedded Broadcom BCM57416 NIC.

  On all those servers we observe crashes of the NIC driver (bnxt_en)
  from time to time. We're not able to manually reproduce this issue, it
  just occurs at some point. Also our monitoring does not show any
  irregularities(high traffic flow or sth. like this).

  Syslog: https://pastebin.com/yDAyjHvF

  All servers are running with up-to-date packages:
  $ lsb_release -rd
  Description: Ubuntu 20.04.2 LTS
  Release: 20.04
  $ uname -r
  5.4.0-73-generic 

  It also happens on older kernel versions (tested 5.4.0-66) as well as
  the HWE kernel (tested 5.8.0-55).

  
  Thanks in advance.
  ~ Roman

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1931106/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp