Also, could there be a bios parameter that could be changed to help
here?
--
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1934620
Title:
NVIDIA A100-80GB GPU fails to initialize in
Hey Sujith, so far no real update from nVidia other than the fact that
not just the GPUs were having issues:
Jun 21 06:14:47 R750XAS kernel: [ 59.183873] pnp 00:00: Plug and Play ACPI
device, IDs PNP0b00 (active)
Jun 21 06:14:47 R750XAS kernel: [ 59.183896] pnp 00:01: disabling [mem
closing the kernel task as there's really nothing for us to do here,
unfortunately. Still waiting on nVidia to respond about the logs and if
they have any advice (I asked on monday again and they're going to check
with the internal team they sent the logs to again).
** Changed in: linux
@Jeff - I have sent them via email to you and Michael. Please check.
--
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1934620
Title:
NVIDIA A100-80GB GPU fails to initialize in R750xa,
@Sujith - I'm not seeing anyu updated logs... where were they uploaded?
--
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1934620
Title:
NVIDIA A100-80GB GPU fails to initialize in R750xa,
** Attachment removed: "nvidia-bug-report log"
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1934620/+attachment/5592914/+files/nvidia-bug-report.log.gz
--
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
Uploaded the requested logs.
--
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1934620
Title:
NVIDIA A100-80GB GPU fails to initialize in R750xa, BAR Address setup
failed if SR-IOV is
** Attachment added: "nvidia-bug-report log"
https://bugs.launchpad.net/dellserver/+bug/1934620/+attachment/5592914/+files/nvidia-bug-report.log.gz
--
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
Team is working on getting the required GPU setup.
--
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1934620
Title:
NVIDIA A100-80GB GPU fails to initialize in R750xa, BAR Address setup
Hi Sujith - Can you update this bug.
--
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1934620
Title:
NVIDIA A100-80GB GPU fails to initialize in R750xa, BAR Address setup
failed if
Hi Sujith, have you been able to get the nvidia bug report log for them
to help debug this issue?
--
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1934620
Title:
NVIDIA A100-80GB GPU
** Changed in: linux (Ubuntu)
Status: Confirmed => Incomplete
--
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1934620
Title:
NVIDIA A100-80GB GPU fails to initialize in R750xa,
Sujith,
Can you get a nvidia-bug-report.log
--
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1934620
Title:
NVIDIA A100-80GB GPU fails to initialize in R750xa, BAR Address setup
failed
Hi Jeff,
Original repro machine had ~1TB memory.
Previously attached log snip:
Memory: 1055972180K/1073051196K available (14339K kernel code, 2400K rwdata,
5008K rodata, 2736K init, 4964K bss, 17079016K reserved, 0K cma-reserved)
--
You received this bug notification because you are a member
HI Sujith, can you tell us what server this is, and how much RAM is in
it? Just wondering.
--
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1934620
Title:
NVIDIA A100-80GB GPU fails to
** Changed in: linux (Ubuntu)
Status: Won't Fix => Confirmed
--
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1934620
Title:
NVIDIA A100-80GB GPU fails to initialize in R750xa, BAR
** Changed in: linux (Ubuntu)
Status: Incomplete => Won't Fix
** Changed in: dellserver
Status: New => Won't Fix
--
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1934620
** Description changed:
- Installing NVidia A100-80GB GPUs in R750XA fails to initialize BAR memory
space for GPUs.
+ Installing NVidia A100-80GB GPUs in R750XA fails to initialize BAR memory
space for GPUs.
Multiple error messages in kernel.log file.
+
+ Summary:
+ * Dell prefers to set
** Information type changed from Private to Public
--
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1934620
Title:
NVIDIA A100-80GB GPU fails to initialize in R750xa, BAR Address setup
19 matches
Mail list logo