Public bug reported:

This patch series adds comprehensive CXL (Compute Express Link) support
to the nvidia-6.17 kernel, including:

1. CXL Type-2 device support - Enables accelerator devices (like GPUs
and SmartNICs) to use CXL for coherent memory access

2. CXL RAS (Reliability, Availability, Serviceability) error handling -
Implements PCIe Port Protocol error handling and logging for CXL devices

3. Prerequisite CXL driver updates - Cherry-picked commits from Linux
v6.18 that are required dependencies


Key Features Added:

    CXL Type-2 accelerator device registration and memory management
    CXL region creation by Type-2 drivers
    DPA (Device Physical Address) allocation interface for accelerators
    HPA (Host Physical Address) free space enumeration
    CXL protocol error detection, forwarding, and recovery
    RAS register mapping for CXL Endpoints and Switch Ports

Justification

CXL Type-2 device support is critical for next-generation NVIDIA
accelerators and data center workloads:


    Enables coherent memory sharing between CPUs and accelerators
    Supports firmware-provisioned CXL regions for accelerator memory
    Provides proper error handling and reporting for CXL fabric errors
    Required for upcoming NVIDIA hardware with CXL capabilities


Patch Breakdown (80 commits total):

Category                          Count  Source
Revert old CXL reset               1     OOT (cleanup)
v6.18 CXL driver prerequisites    28     Upstream (cherry-picked from 
torvalds/linux v6.18)
Terry Bowman's CXL RAS series     25     Upstream (RESEND v13)
Alejandro Lucero's Type-2 series  25     Upstream (v22)
CXL Config update                  1     OOT (build config)


Lore Links:
Terry Bowman's CXL RAS series (RESEND v13):
https://lore.kernel.org/linux-cxl/[email protected]/

Alejandro Lucero's CXL Type-2 series (v22):
https://lore.kernel.org/linux-cxl/[email protected]/

Notes

CONFIG_CXL_BUS and CONFIG_CXL_PCI changed from tristate to bool by the
Type-2 patches (intentional design change for built-in CXL support)

Kernel config annotations updated in
debian.nvidia-6.17/config/annotations to reflect these changes

** Affects: linux-nvidia-6.17 (Ubuntu)
     Importance: Undecided
         Status: New

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/2143032

Title:
  Add CXL Type-2 device support and CXL RAS error handling

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux-nvidia-6.17/+bug/2143032/+subscriptions


-- 
ubuntu-bugs mailing list
[email protected]
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to