[Kernel-packages] [Bug 2029934] Re: arm64 AWS host hangs during modprobe nvidia on lunar and mantic

2024-04-09 Thread Abhishek Chauhan
The fix is also available on 535.171.04 available here - https://www.nvidia.com/Download/driverResults.aspx/223761/en-us/ -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux-hwe-6.5 in Ubuntu. https://bugs.launchpad.net/bugs/2029934

[Kernel-packages] [Bug 2029934] Re: arm64 AWS host hangs during modprobe nvidia on lunar and mantic

2024-04-02 Thread Abhishek Chauhan
Hi all, This should be fixed on the latest driver 550.67 - https://www.nvidia.com/Download/driverResults.aspx/223429/en-us/. Please help verify if this is resolved on your systems. Thanks! -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to

[Kernel-packages] [Bug 2029934] Re: arm64 AWS host hangs during modprobe nvidia on lunar and mantic

2024-02-07 Thread Mitchell Augustin
I identified a similar bug today when installing nvidia- fabricmanager-535 on a noble dev build for arm64 that may be related: https://bugs.launchpad.net/ubuntu/+source/fabric- manager-535/+bug/2052663 -- You received this bug notification because you are a member of Kernel Packages, which is

[Kernel-packages] [Bug 2029934] Re: arm64 AWS host hangs during modprobe nvidia on lunar and mantic

2024-01-30 Thread Simon Fels
I gave this another spin today with 6.5.0-17-generic #17~22.04.1 and the LRM modules of the 535 driver (6.5.0-17.17~22.04.1+1 of linux-modules- nvidia-535-server-generic-hwe-22.04) on our Altra system with 2x L4 GPUs and the same problem exists as with the DKMS modules: [ 39.437849] watchdog:

[Kernel-packages] [Bug 2029934] Re: arm64 AWS host hangs during modprobe nvidia on lunar and mantic

2024-01-26 Thread Dimitri John Ledkov
** Changed in: nvidia-graphics-drivers-525 (Ubuntu) Status: Confirmed => Incomplete ** Changed in: nvidia-graphics-drivers-525-server (Ubuntu) Status: Confirmed => Incomplete ** Changed in: linux-aws (Ubuntu) Status: Confirmed => Incomplete ** Also affects: linux-hwe-6.5

[Kernel-packages] [Bug 2029934] Re: arm64 AWS host hangs during modprobe nvidia on lunar and mantic

2024-01-26 Thread Simon Fels
Verified that with linux-aws-edge 6.5.0.1012.12~22.04.1 the DKMS installation via $ sudo apt install -y nvidia-driver-535-server on an AWS g5g.xlarge goes through the driver comes up fine. Trying the same with linux-generic-hwe-22.04-edge 6.5.0-17-generic #17~22.04.1 on an Ampere Altra with 2x

[Kernel-packages] [Bug 2029934] Re: arm64 AWS host hangs during modprobe nvidia on lunar and mantic

2024-01-26 Thread Francis Ginther
I can reproduce the failure on mantic with both the DKMS and LRM drivers. Specifically what I'm doing to install these are: for DKMS: sudo DEBIAN_FRONTEND=noninteractive apt-get install -y nvidia-driver-535-server for LRM: sudo DEBIAN_FRONTEND=noninteractive apt-get install -y

[Kernel-packages] [Bug 2029934] Re: arm64 AWS host hangs during modprobe nvidia on lunar and mantic

2024-01-25 Thread Launchpad Bug Tracker
Status changed to 'Confirmed' because the bug affects multiple users. ** Changed in: nvidia-graphics-drivers-535-server (Ubuntu) Status: New => Confirmed -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux-aws in Ubuntu.

[Kernel-packages] [Bug 2029934] Re: arm64 AWS host hangs during modprobe nvidia on lunar and mantic

2024-01-25 Thread Launchpad Bug Tracker
Status changed to 'Confirmed' because the bug affects multiple users. ** Changed in: nvidia-graphics-drivers-535 (Ubuntu) Status: New => Confirmed -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux-aws in Ubuntu.

[Kernel-packages] [Bug 2029934] Re: arm64 AWS host hangs during modprobe nvidia on lunar and mantic

2024-01-25 Thread Launchpad Bug Tracker
Status changed to 'Confirmed' because the bug affects multiple users. ** Changed in: nvidia-graphics-drivers-525-server (Ubuntu) Status: New => Confirmed -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux-aws in Ubuntu.

[Kernel-packages] [Bug 2029934] Re: arm64 AWS host hangs during modprobe nvidia on lunar and mantic

2024-01-25 Thread Launchpad Bug Tracker
Status changed to 'Confirmed' because the bug affects multiple users. ** Changed in: nvidia-graphics-drivers-525 (Ubuntu) Status: New => Confirmed -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux-aws in Ubuntu.

[Kernel-packages] [Bug 2029934] Re: arm64 AWS host hangs during modprobe nvidia on lunar and mantic

2024-01-25 Thread Launchpad Bug Tracker
Status changed to 'Confirmed' because the bug affects multiple users. ** Changed in: linux-aws (Ubuntu) Status: New => Confirmed -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux-aws in Ubuntu.

[Kernel-packages] [Bug 2029934] Re: arm64 AWS host hangs during modprobe nvidia on lunar and mantic

2024-01-25 Thread Dimitri John Ledkov
I am surprised that `ubuntu-drivers list` doesn't provide any drivers to install, when it really should. To install pre-built drivers I use $ sudo apt install linux-modules-nvidia-535-server-aws nvidia- headless-535-server Such that signed nvidia modules provided by Canonical are installed.

[Kernel-packages] [Bug 2029934] Re: arm64 AWS host hangs during modprobe nvidia on lunar and mantic

2024-01-25 Thread Dimitri John Ledkov
I wonder if the bug is with trying to install self-built dkms modules, instead of pre-built ones, and how come ubuntu-drivers is not offering pre-built ones... -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux-aws in Ubuntu.

[Kernel-packages] [Bug 2029934] Re: arm64 AWS host hangs during modprobe nvidia on lunar and mantic

2024-01-25 Thread Dimitri John Ledkov
and everything seems to work fine. -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux-aws in Ubuntu. https://bugs.launchpad.net/bugs/2029934 Title: arm64 AWS host hangs during modprobe nvidia on lunar and mantic Status in linux-aws

[Kernel-packages] [Bug 2029934] Re: arm64 AWS host hangs during modprobe nvidia on lunar and mantic

2024-01-25 Thread Simon Fels
Trying the same with the linux-nvidia-hwe-22.04-edge kernel from proposed linux-image-6.5.0-1011-nvidia wit the same NVIDIA driver (535.154.05-0ubuntu0.22.04.1 of nvidia-utils-535-server) and loading kernel driver and running nvidia-smi works fine without problems. -- You received this bug

[Kernel-packages] [Bug 2029934] Re: arm64 AWS host hangs during modprobe nvidia on lunar and mantic

2024-01-25 Thread Simon Fels
Verified that the issue does not exist with 535.154.05-0ubuntu0.22.04.1 of nvidia-utils-535-server on 6.2.0-1017-aws or 6.2.0-1018-aws of linux- aws. -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux-aws in Ubuntu.

[Kernel-packages] [Bug 2029934] Re: arm64 AWS host hangs during modprobe nvidia on lunar and mantic

2024-01-25 Thread Simon Fels
I can reproduce the the same with the latest 535.154.05-0ubuntu0.22.04.1 on jammy with the 6.5 HWE kernel on an arm64 machine. The same happens with the -server driver 535.154.05-0ubuntu0.22.04.1. Reproducing is pretty simple: 1. Boot plain Ubuntu 24.04 with either HWE already installed or

[Kernel-packages] [Bug 2029934] Re: arm64 AWS host hangs during modprobe nvidia on lunar and mantic

2024-01-23 Thread Dimitri John Ledkov
since then, we had multiple glibc srus; kernel sru's and most recently new release of 535-server. can i request for this to be retested again? -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux-aws in Ubuntu.