Public bug reported:

[Feature Description]
Starting in ICL/ICX there is a performance improvement to the "rep movsb" copy 
instruction for short strings.

>From the optimization guide:

3.7.6.1 Fast Short REP MOVSB
Beginning with processors based on Ice Lake Client microarchitecture,
REP MOVSB performance of short operations is enhanced. The enhancement
applies to string lengths between 1 and 128 bytes long. Support for
fast-short REP MOVSB is enumerated by the CPUID feature flag: CPUID
[EAX=7H, ECX=0H).EDX.FAST_SHORT_REP_MOVSB[bit 4] = 1. There is no change
in the REP STOS performance.

Linux could check for this CPUID bit and make the flag "fsrm" appear
in /proc/cpuinfo flags.

#define X86_FEATURES_FSRM (18*32+ 4) /* Fast short rep movsb */

At some future point copy functions could be modified to check for this
feature and have some alternate copy loop for short strings on CPUs that
do NOT have FSRM (benchmarking needed to see if this actually helps).

Target Kernel: 5.6
Target Release: 20.10

[HW/SW Information]
IceLake Server

[Business Justification]
performance

** Affects: intel
     Importance: Undecided
         Status: New

** Affects: ubuntu
     Importance: Undecided
         Status: New


** Tags: intel-kernel-20.10

** Also affects: ubuntu
   Importance: Undecided
       Status: New

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1883630

Title:
  [ICX]ICL/ICX Fast Short Rep Movs

To manage notifications about this bug go to:
https://bugs.launchpad.net/intel/+bug/1883630/+subscriptions

-- 
ubuntu-bugs mailing list
[email protected]
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to