On Wed, 19 May 2021 15:14:38 +0000
"Van Haaren, Harry" <harry.van.haa...@intel.com> wrote:

> > -----Original Message-----
> > From: users <users-boun...@dpdk.org> On Behalf Of Filip Janiszewski
> > Sent: Wednesday, May 19, 2021 2:10 PM
> > To: users@dpdk.org
> > Subject: [dpdk-users] Performance of rte_eth_stats_get
> > 
> > Hi,
> > 
> > Is it safe to call rte_eth_stats_get while capturing from the port?
> > 
> > I'm mostly concerned about performance, if rte_eth_stats_get will in any
> > way impact the port performance, in the application I plan to call the
> > function from a thread that is not directly involved in the capture,
> > there's another worker responsible for rx bursting, but I wonder if the
> > NIC might get upset if I call it too frequently (say 10 times per
> > second) and potentially cause some performance issues.
> > 
> > The question is really Nic agnostic, but if the Nic vendor is actually
> > relevant then I'm running Intel 700 series nic and Mellanox ConnectX-4/5.  
> 
> To understand what really goes on when getting stats, it might help to list 
> the
> steps involved in getting statistics from the NIC HW.
> 
> 1) CPU sends an MMIO read (Memory Mapped I/O, aka, sometimes referred
> to as a "pci read") to the NIC.
> 2) The PCI bus has to handle extra TLPs (pci transactions) to satisfy read
> 3) NIC has to send a reply based on accessing its internal counters
> 4) CPU gets a result from the PCI read.
> 
> Notice how elegantly this whole process is abstracted from SW? In code, 
> reading
> a stat value is just dereferencing a pointer that is mapped to the NIC HW 
> address.
> In practice from a CPU performance point of view, doing an MMIO-read is one of
> the slowest things you can do. You say the stats-reads are occurring from a 
> thread
> that is not handling rx/datapath, so perhaps the CPU cycle cost itself isn't 
> a concern.
> 
> Do note however, that when reading a full set of extended stats from the NIC, 
> there
> could be many 10's to 100's of MMIO reads (depending on the statistics 
> requested,
> and how the PMD itself is implemented to handle stats updates).
> 
> The PCI bus does become more busy with reads to the NIC HW when doing lots of
> statistic updates, so there is some more contention/activity to be expected 
> there.
> The PCM tool can be very useful to see MMIO traffic, you could measure how 
> many
> extra PCI transactions are occurring due to reading stats every X ms?
> https://github.com/opcm/pcm
> 
> I can recommend measuring pkt latency/jitter as a histogram, as then outliers 
> in performance
> can be identified. If you specifically want to identify if these are due 
> stats reads, compare
> with a "no stats reads" latency/jitter histogram, and graphically see the 
> impact.
> In the end if it doesn't affect packet latency/jitter, then it has no impact 
> right?
> 
> Ultimately, I can't give a generic answer - best steps are to measure 
> carefully and find out!
> 
> > Thanks  
> 
> Hope the above helps and doesn't add confusion :)  Regards, -Harry

Many drivers require transactions with the firmware via mailbox.
And that transaction needs a spin wait for the shared area.

Reply via email to