[Bug 2055222] Re: ucx library fails with Genoa CPUs and InfiniBand

2024-04-04 Thread Quesar
This bug report includes the solution. Can someone please acknowledge and respond to it? This is an easy fix at this point but it has been ignored for over a month already. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu.

[Bug 2055222] Re: ucx library fails with Genoa CPUs and InfiniBand

2024-03-29 Thread Launchpad Bug Tracker
Status changed to 'Confirmed' because the bug affects multiple users. ** Changed in: ucx (Ubuntu) Status: New => Confirmed -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/2055222 Title: ucx

[Bug 2055222] Re: ucx library fails with Genoa CPUs and InfiniBand

2024-03-18 Thread Quesar
I reproduced this on a Sapphire Rapids cluster now too, and the same patches fixed it. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/2055222 Title: ucx library fails with Genoa CPUs and InfiniBand

[Bug 2055222] Re: ucx library fails with Genoa CPUs and InfiniBand

2024-03-07 Thread Quesar
Can these patches be added to the ucx package please? This issue is affecting all Genoa clusters with Infiniband. Here's the type of error it causes: root@rschhpc210:~# ucx_perftest [1698428074.879303] [rschhpc210:13557:0] perftest.c:899 UCX WARN CPU affinity is not set (bound to 384 cpus).