Public bug reported:

[ Impact ]

During the keynote presentation at Microsoft Ignite, 2024, Microsoft
announced the private preview of a new series of AI-oriented Azure
virtual machines based on the NVIDIA GB200 superchip:
https://techcommunity.microsoft.com/blog/azurehighperformancecomputingblog/microsoft-
adopts-nvidia-blackwell-to-power-the-next-frontier-of-ai-
supercomputin/4303541 . As part of Microsoft’s and Canonical’s joint
effort to ensure that the latest Ubuntu LTS release supports the most
salient features of this new virtual machine series, Canonical would
like to back-port all post-24.04 changes to the Microsoft Azure Network
Adapter (MANA) provider in rdma-core from ubuntu/devel to ubuntu/noble.

[ Test Plan ]

```
# Server terminal

az group create --name mana --location westeurope

az network vnet create --resource-group mana --name mana-vnet --address-
prefix 10.0.0.0/16 --subnet-name mana-subnet --subnet-prefix 10.0.0.0/24

az network nsg create --name mana-nsg --resource-group mana

az network nsg rule create --name mana-nsg-rule --nsg-name mana-nsg
--resource-group mana --access Allow --destination-port-ranges 22
--protocol Tcp --priority 100

az network public-ip create --resource-group mana --name mana-public-ip

az network nic create --resource-group mana --name mana-nic --vnet-name
mana-vnet --subnet mana-subnet --accelerated-networking true --network-
security-group mana-nsg --public-ip-address mana-public-ip

path_to_ssh_public_key=...

az vm create --resource-group mana --name mana-virtual-machine --image
canonical:ubuntu-24_04-lts:server-arm64:latest --size Standard_D2ps_v6
--admin-username ubuntu --ssh-key-values $path_to_ssh_public_key --nics
mana-nic

machine_ip_address=$(az network public-ip show --name mana-public-ip
--resource-group mana --query ipAddress | tr -d '"')

path_to_ssh_private_key=...

ssh -i $path_to_ssh_private_key ubuntu@$machine_ip_address

## Enable proposed archive:
https://wiki.ubuntu.com/Testing/EnableProposed

## rdma-core is the source of the rdmacm-utils package
sudo apt-add-repository -y universe && sudo apt install -y linux-azure-nvidia 
rdmacm-utils

installed_kernel_version=$(uname -r)

sudo apt remove -y linux-image-$installed_kernel_version 
linux-modules-$installed_kernel_version # DO NOT abort kernel removal when p
rompted

sudo reboot

ssh -i $path_to_ssh_private_key ubuntu@$machine_ip_address

rping -s -C 10 -v

# Client terminal

az network nic show --name mana-nic --resource-group mana --query
'ipConfigurations[0].privateIPAddress' | tr -d '"' # Note for use in
rping command below

path_to_ssh_private_key=...

machine_ip_address=$(az network public-ip show --name mana-public-ip
--resource-group mana --query ipAddress | tr -d '"')

ssh -i $path_to_ssh_private_key ubuntu@$machine_ip_address

rping -c -a $machine_private_ip_address -C 10 -v # Use MANA server private IP 
address obtained above
```

[ Where problems could occur ]

A regression could adversely impact remote direct memory access (RDMA)
through one or more non-MANA providers (kernel drivers), potentially
even preventing RDMA altogether.

[ Other Info ]

The second point under
https://documentation.ubuntu.com/sru/en/latest/reference/requirements/#other-
safe-cases describes the changes that Canonical is seeking to back-port
to ubuntu/noble. As such, the changes, while representing new features,
appear to qualify for SRU.

The patches associated with the SRU request only touch the following
files, all of which are MANA-specific:

providers/mana/*
kernel-headers/rdma/mana-abi.h

Furthermore, the patches encapsulate all the changes to those files
since the Ubuntu 24.04 release; there are no residual changes to those
files that the patches do not cover.

** Affects: rdma-core (Ubuntu)
     Importance: Undecided
         Status: New

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/2100089

Title:
  rdma-core in latest Ubuntu LTS does not support Microsoft Azure
  Network Adapter

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/rdma-core/+bug/2100089/+subscriptions


-- 
ubuntu-bugs mailing list
[email protected]
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to