** Description changed:

- Our NCCL software uses the sysfs to populate the attached topo.xml file.
- Several of the entries should report "32 GT/s", but they're saying
- "Unknown speed" instead. For instance:
+ [Impact]
  
- <pci busid="0000:0c:00.0" class="0x020700" vendor="0x15b3"
- device="0x1021" subsystem_vendor="0x15b3" subsystem_device="0x0082"
- link_speed="Unknown speed" link_width="16">
+ NVIDIA Collective Communication Library software uses sysfs to report
+ performance statistics.  Users have reported entries showing "Unknown
+ speed" when they should be reporting "32 GT/s".
  
- The 5.4 kernel is missing the following commit:
- https://lore.kernel.org/all/1581937984-40353-2-git-send-email-
- yangyic...@hisilicon.com/
+ Example:
+ "<pci busid="0000:0c:00.0" class="0x020700" vendor="0x15b3" device="0x1021" 
subsystem_vendor="0x15b3" subsystem_device="0x0082" link_speed="Unknown speed" 
link_width="16">"
+ 
+ PCIe 5.0 which supports 32 GT/s is available in the 5.4 kernel, but the
+ patches for properly reporting speeds in sysfs are missing.  The
+ following upstream patches add the reporting capability.
+ 
+ https://lore.kernel.org/linux-
+ pci/20200229030706.17835-1-helg...@kernel.org/
+ 
+ 
+ [Test Plan]
+ 
+ Testing these speeds requires special hardware. A Test kernel with these
+ patches applied was provided to the customer and they confirmed the
+ proper numbers are reported.
+ 
+ 
+ [Where problems could occur]
+ 
+ Changes are for reporting info so chance of problems should be low.  If
+ a problem did occur it would be with sysfs or pcie driver.

** Changed in: linux (Ubuntu Focal)
       Status: Incomplete => In Progress

** Changed in: linux (Ubuntu)
       Status: Incomplete => In Progress

** Changed in: linux (Ubuntu)
   Importance: Undecided => High

** Changed in: linux (Ubuntu Focal)
   Importance: Undecided => High

** Description changed:

+ SRU Justification
+ 
  [Impact]
  
  NVIDIA Collective Communication Library software uses sysfs to report
  performance statistics.  Users have reported entries showing "Unknown
  speed" when they should be reporting "32 GT/s".
  
  Example:
  "<pci busid="0000:0c:00.0" class="0x020700" vendor="0x15b3" device="0x1021" 
subsystem_vendor="0x15b3" subsystem_device="0x0082" link_speed="Unknown speed" 
link_width="16">"
  
  PCIe 5.0 which supports 32 GT/s is available in the 5.4 kernel, but the
  patches for properly reporting speeds in sysfs are missing.  The
  following upstream patches add the reporting capability.
  
  https://lore.kernel.org/linux-
  pci/20200229030706.17835-1-helg...@kernel.org/
  
- 
  [Test Plan]
  
  Testing these speeds requires special hardware. A Test kernel with these
  patches applied was provided to the customer and they confirmed the
  proper numbers are reported.
  
- 
  [Where problems could occur]
  
  Changes are for reporting info so chance of problems should be low.  If
  a problem did occur it would be with sysfs or pcie driver.

** Description changed:

  SRU Justification
  
  [Impact]
  
  NVIDIA Collective Communication Library software uses sysfs to report
  performance statistics.  Users have reported entries showing "Unknown
  speed" when they should be reporting "32 GT/s".
  
  Example:
  "<pci busid="0000:0c:00.0" class="0x020700" vendor="0x15b3" device="0x1021" 
subsystem_vendor="0x15b3" subsystem_device="0x0082" link_speed="Unknown speed" 
link_width="16">"
  
- PCIe 5.0 which supports 32 GT/s is available in the 5.4 kernel, but the
+ PCIe 5.0 supports 32 GT/s and is available in the 5.4 kernel, but the
  patches for properly reporting speeds in sysfs are missing.  The
  following upstream patches add the reporting capability.
  
  https://lore.kernel.org/linux-
  pci/20200229030706.17835-1-helg...@kernel.org/
  
  [Test Plan]
  
  Testing these speeds requires special hardware. A Test kernel with these
  patches applied was provided to the customer and they confirmed the
  proper numbers are reported.
  
  [Where problems could occur]
  
  Changes are for reporting info so chance of problems should be low.  If
  a problem did occur it would be with sysfs or pcie driver.

** Changed in: linux (Ubuntu)
   Importance: High => Medium

** Changed in: linux (Ubuntu Focal)
   Importance: High => Medium

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1970798

Title:
  32 GT/s PCI link speeds reporting "Unknown speed" in sysfs

Status in linux package in Ubuntu:
  In Progress
Status in linux source package in Focal:
  In Progress

Bug description:
  SRU Justification

  [Impact]

  NVIDIA Collective Communication Library software uses sysfs to report
  performance statistics.  Users have reported entries showing "Unknown
  speed" when they should be reporting "32 GT/s".

  Example:
  "<pci busid="0000:0c:00.0" class="0x020700" vendor="0x15b3" device="0x1021" 
subsystem_vendor="0x15b3" subsystem_device="0x0082" link_speed="Unknown speed" 
link_width="16">"

  PCIe 5.0 supports 32 GT/s and is available in the 5.4 kernel, but the
  patches for properly reporting speeds in sysfs are missing.  The
  following upstream patches add the reporting capability.

  https://lore.kernel.org/linux-
  pci/20200229030706.17835-1-helg...@kernel.org/

  [Test Plan]

  Testing these speeds requires special hardware. A Test kernel with
  these patches applied was provided to the customer and they confirmed
  the proper numbers are reported.

  [Where problems could occur]

  Changes are for reporting info so chance of problems should be low.
  If a problem did occur it would be with sysfs or pcie driver.

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1970798/+subscriptions


-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to