Public bug reported:
SRU Justification:
[Impact]
Google is experiencing an issue with kernel and NVIDIA driver updates
where they are updated into a state in which either the kernel/module or
the kernel/user versions aren't lining up. This causes a hard failure in
some of the NVIDIA tools, for example a mismatch produced the following
kernel output:
NVRM: API mismatch: the client has the version 570.158.01, but
NVRM: this kernel module has the version 570.133.20. Please
NVRM: make sure that this kernel module and all NVIDIA driver
NVRM: components have the same version.
[Fix]
As a stop-gap, Google has begun to place holds on certain packages, but
this opens them up to other undesirable states. They are requesting that
we do something in our packaging to prevent the mismatch.
We are investigating updating the package control file in an attempt to
prevent the user space packages from going out of sync with the nvidia-
kernel-common-570-server driver package.
[Test Plan]
An internal Salesforce case documents the sequence of events that can
lead to the version mismatch case. We will follow those steps to verify
any proposed solution.
[Where problems could occur]
There is concern that adjusting the control file this way could lead to
a situation where the system is rendered unable to add or remove
packages.
** Affects: linux-gcp (Ubuntu)
Importance: Undecided
Status: Invalid
** Affects: linux-gcp (Ubuntu Jammy)
Importance: Undecided
Assignee: Tim Whisonant (tswhison)
Status: In Progress
** Affects: linux-gcp (Ubuntu Noble)
Importance: Undecided
Assignee: Tim Whisonant (tswhison)
Status: In Progress
** Package changed: linux-hwe-6.14 (Ubuntu) => linux-gcp (Ubuntu)
** Also affects: linux-gcp (Ubuntu Jammy)
Importance: Undecided
Status: New
** Also affects: linux-gcp (Ubuntu Noble)
Importance: Undecided
Status: New
** Changed in: linux-gcp (Ubuntu)
Status: New => Invalid
** Changed in: linux-gcp (Ubuntu Jammy)
Assignee: (unassigned) => Tim Whisonant (tswhison)
** Changed in: linux-gcp (Ubuntu Noble)
Assignee: (unassigned) => Tim Whisonant (tswhison)
** Changed in: linux-gcp (Ubuntu Jammy)
Status: New => In Progress
** Changed in: linux-gcp (Ubuntu Noble)
Status: New => In Progress
--
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/2121633
Title:
NVIDIA user space packages not held when drivers are pinned
To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux-gcp/+bug/2121633/+subscriptions
--
ubuntu-bugs mailing list
[email protected]
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs