Public bug reported: [ Impact ]
During the keynote presentation at Microsoft Ignite, 2024, Microsoft announced the private preview of a new series of AI-oriented Azure virtual machines based on the NVIDIA GB200 superchip: https://techcommunity.microsoft.com/blog/azurehighperformancecomputingblog/microsoft- adopts-nvidia-blackwell-to-power-the-next-frontier-of-ai- supercomputin/4303541 . As part of Microsoft’s and Canonical’s joint effort to ensure that the latest Ubuntu LTS release supports the most salient features of this new virtual machine series, Canonical would like to back-port all post-24.04 changes to the Microsoft Azure Network Adapter (MANA) provider in rdma-core from ubuntu/devel to ubuntu/noble. [ Test Plan ] ``` # Server terminal az group create --name mana --location westeurope az network vnet create --resource-group mana --name mana-vnet --address- prefix 10.0.0.0/16 --subnet-name mana-subnet --subnet-prefix 10.0.0.0/24 az network nsg create --name mana-nsg --resource-group mana az network nsg rule create --name mana-nsg-rule --nsg-name mana-nsg --resource-group mana --access Allow --destination-port-ranges 22 --protocol Tcp --priority 100 az network public-ip create --resource-group mana --name mana-public-ip az network nic create --resource-group mana --name mana-nic --vnet-name mana-vnet --subnet mana-subnet --accelerated-networking true --network- security-group mana-nsg --public-ip-address mana-public-ip path_to_ssh_public_key=... az vm create --resource-group mana --name mana-virtual-machine --image canonical:ubuntu-24_04-lts:server-arm64:latest --size Standard_D2ps_v6 --admin-username ubuntu --ssh-key-values $path_to_ssh_public_key --nics mana-nic machine_ip_address=$(az network public-ip show --name mana-public-ip --resource-group mana --query ipAddress | tr -d '"') path_to_ssh_private_key=... ssh -i $path_to_ssh_private_key ubuntu@$machine_ip_address ## Enable proposed archive: https://wiki.ubuntu.com/Testing/EnableProposed ## rdma-core is the source of the rdmacm-utils package sudo apt-add-repository -y universe && sudo apt install -y linux-azure-nvidia rdmacm-utils installed_kernel_version=$(uname -r) sudo apt remove -y linux-image-$installed_kernel_version linux-modules-$installed_kernel_version # DO NOT abort kernel removal when p rompted sudo reboot ssh -i $path_to_ssh_private_key ubuntu@$machine_ip_address rping -s -C 10 -v # Client terminal az network nic show --name mana-nic --resource-group mana --query 'ipConfigurations[0].privateIPAddress' | tr -d '"' # Note for use in rping command below path_to_ssh_private_key=... machine_ip_address=$(az network public-ip show --name mana-public-ip --resource-group mana --query ipAddress | tr -d '"') ssh -i $path_to_ssh_private_key ubuntu@$machine_ip_address rping -c -a $machine_private_ip_address -C 10 -v # Use MANA server private IP address obtained above ``` [ Where problems could occur ] A regression could adversely impact remote direct memory access (RDMA) through one or more non-MANA providers (kernel drivers), potentially even preventing RDMA altogether. [ Other Info ] The second point under https://documentation.ubuntu.com/sru/en/latest/reference/requirements/#other- safe-cases describes the changes that Canonical is seeking to back-port to ubuntu/noble. As such, the changes, while representing new features, appear to qualify for SRU. The patches associated with the SRU request only touch the following files, all of which are MANA-specific: providers/mana/* kernel-headers/rdma/mana-abi.h Furthermore, the patches encapsulate all the changes to those files since the Ubuntu 24.04 release; there are no residual changes to those files that the patches do not cover. ** Affects: rdma-core (Ubuntu) Importance: Undecided Status: New -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/2100089 Title: rdma-core in latest Ubuntu LTS does not support Microsoft Azure Network Adapter To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/rdma-core/+bug/2100089/+subscriptions -- ubuntu-bugs mailing list [email protected] https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
