> From: Bruce Richardson [mailto:bruce.richard...@intel.com] > Sent: Thursday, 1 May 2025 09.48 > > On Thu, May 01, 2025 at 09:05:32AM +0200, Morten Brørup wrote: > > > From: Stephen Hemminger [mailto:step...@networkplumber.org] > > > Sent: Wednesday, 30 April 2025 21.45 > > > > > > On Wed, 30 Apr 2025 20:40:52 +0200 > > > Morten Brørup <m...@smartsharesystems.com> wrote: > > > > > > > There are only two thread priorities in the enum > rte_thread_priority: > > > Normal and Real-time Critical. > > > > > > > > I would like to poll ethdev counters, collect garbage and perform > > > other jitter non-sensitive tasks in a control thread with lower > > > priority than my ordinary control threads, so it will be preempted > by > > > any work ready for my ordinary control threads. > > > > > > > > Which DPDK API am I supposed to use to assign this below-normal > > > priority to my "background" control thread? > > > > > > > > Or: Aren't we missing a priority like Linux' SCHED_BATCH? > > > > > > Short answer: if your application is running on Linux, only ever > use > > > Normal. > > > DPDK applications usually never sleep and this will starve the OS > and > > > cause instability. > > > > I was asking for the opposite of Critical priority. > > > > For the sake of discussion, imagine a (registered or unregistered) > non-EAL thread doing something like this: > > loop { > > poll_counters(); // 1 ms execution time > > sleep(99 ms); > > } > > > > With normal scheduling priority, it will rack up a lot of scheduling > credits during sleep(), so it might not be preempted by other threads > while executing poll_counters(). > > > > But if some other thread (on the same CPU core) changes state from > Sleeping to Runnable, I want it to preempt the counter polling thread. > > This other thread could be a control plane application, e.g. a DNS > Server, which shouldn't suffer up to 1 ms scheduling lag if it becomes > Runnable the instant the counter polling thread started executing > poll_counters(). > > > > So I'm looking for a DPDK API to apply a "low priority" scheduling > policy, like SCHED_BATCH, to the counter polling thread. > > > > Does this need to be done in DPDK?
No, not really. > Unless you need to target Windows, would > using the standard Unix/Posix scheduling/pthread APIs directly not be > best, > rather than having us try to wrap all such things inside DPDK APIs? It probably would. That's how we do it today, anyway. :-) > I worry > about scope creep for such things, with us ending up wrapping a whole > bunch > of scheduling stuff into DPDK that we should not need to do. I'm mainly asking for academic reasons. I think the scope of my question was included into DPDK when it introduced the rte_thread_priority with RTE_THREAD_PRIORITY_NORMAL and RTE_THREAD_PRIORITY_REALTIME_CRITICAL. I know this is mainly related to the control plane, and thus not the most relevant thing for DPDK. But I think we need to offer something. Not only for applications, but drivers might want to run separate low-priority threads for background tasks, such as garbage collection, counter polling, or a link state machine. IMHO, the kernel scheduler is a much better choice than DPDK's non-preemptive "Service Cores" scheduler for many purposes.