[lustre-discuss] kernel threads for rpcs in flight

Anna Fuchs via lustre-discuss Sun, 28 Apr 2024 15:55:25 -0700

Hello everyone.

The setting |max_rpcs_in_flight| affects, among other things, how manythreads can be spawned simultaneously for processing the RPCs, right?In tests where the network is clearly a bottleneck, this setting hasalmost no effect - the network cannot keep up with processing the data,there is not so much to do in parallel.With a faster network, the stats show higher CPU utilization ondifferent cores (at least on the client).

What is the exact mechanism by which it is decided that a kernel threadis spawned for processing a bulk? Is there an RPC queue with timings orsomething similar?Is it in any way predictable or calculable how many threads a specificworkload will require (spawn if possible) given the data rates from thenetwork and storage devices?

With |max_||rpcs_in_flight = 1|, multiple cores are loaded, presumablyalternately, but the statistics are too inaccurate to capture this.The distribution of threads to cores is regulated by the Linux kernel,right? Does anyone have experience with what happens when all CPUs areunder full load with the application or something else?Do the Lustre threads suffer? Is there a prioritization of the Lustrethreads over other tasks?

Are there readily available statistics or tools for this scenario?

Thanks a lot
Anna
--
Anna Fuchs
Universität Hamburg
Department of Computer Science
Research Group Scientific Computing

Bundesstraße 45a
D-20146 Hamburg

_______________________________________________
lustre-discuss mailing list
[email protected]
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org

[lustre-discuss] kernel threads for rpcs in flight

Reply via email to