Re: multithread support

Maria Matejka Tue, 02 Mar 2021 09:56:51 -0800

Hello!

On 3/2/21 4:34 PM, Douglas Fischer wrote:

This is very good news!
I know you said "This is a ball park guess", but I confess that I was alittle scared by the proportion of extra CPU usage (30/48 -> +60%).

This depends much on what kind of load we are speaking about. Generally,if you are a big route server, then 98% of CPU time is probably eaten bycomplex filters. I would estimate that this may finish anywhere between+10% and -10% due to other structural changes. The parallelizationoverhead would be minimal.

However, if you are a big route reflector, then you're constantly justrecomputing the best route, accessing the same table. Then we may get tothe +60% estimate. Long story short, the more work you do with oneroute, the less overhead you get.

Remember that BIRD is currently extremely well optimized forsingle-threaded execution and some parts still heavily depend on beingexecuted that way. We chose first to allow parallel execution of thoseparts that can be parallelized well, with adding some overhead to otherparts.

The most critical part of this is route export (from tables toprotocols) which is now done synchronously after route import. Wedecided to decouple it in the multithreaded code, which involves havinga route export queue. Hence more memory stores and loads, more cachemisses etc.

Well … maybe the +60% is too much, reconsidering that guess. Let's hopeit's overestimated. I'd be more concerned about the memory usage. Thereare some estimations of peak memory usage in worst cases which can beeven +100% (for a short time). In case we get to these problems in realworld, we'd definitely have to implement algorithms to limit these peaksas swapping to disk is not desirable here at all. Anyway, this is notthe problem of today; we still need first to get to a code which atleast builds and runs without spitting one core file after another.

I also know that you said that the code is still "currently notreleasable", but I'm curious to know a little more about how thismulti-threading was handled.

Basically, one thread per receiving socket, one thread per exportingchannel, with some exceptions. One lock per protocol instance, one lockper table. You can lock only one table and one protocol instance attime; protocol goes first.

We'll publish more documentation; it's still WIP. For now, I'm justanswering a question to say "yes, we're going multithreaded and we'reactively working on it".

Just to illustrate:
Single-Core CPU on BGP is known to be a problem for many engines andvendors.
One of the vendors developed a "creative" way to do this loaddistribution in multiple colors.
As I understood it, they made a kind of Affinity CPU by BGP-Peer.
In a way that each peer has a BGP process, and that process is"semi-tied" to a core.And they created a mechanism to redistribute these affinities fromtime-to-time based on the amount of BGP messages per second exchanged oneach peer.

If this arises to be a problem, we'll consider this. For now, it justseems that the most critical part is the route itself which is beingpropagated through BIRD -- which should stay in one thread as long aspossible and the threads should keep its CPU (on a well-behaved system)unless moved for a good reason.


Maria

Em ter., 2 de mar. de 2021 às 10:13, Maria Matejka <[email protected]<mailto:[email protected]>> escreveu:


    Hi!

    On 3/1/21 1:26 PM, Marcelo Balbinot wrote:
     >
     > Hi, I already asked this question at some point,
     > but I am curious about the evolution ..
     > About multi thread support (multi-core cpu use).
     > Is this still a possibility?

    Yes, it is. Be prepared that this will also raise memory usage (current
    estimates are about >+10% memory) and overall CPU usage (compared to
    single-thread execution) due to needed synchronization and buffers.

    This means that if you now consume 20G of memory and 30 minutes of
    single core time to converge the main table on a rather big node,
    you're
    going to consume, let's say, >22G of memory and 3 minutes of 16-core
    CPU
    (summing to 48 minutes of CPU time). This is a ball park guess, do not
    take me much seriously. It may be better, it may be worse.

    Anyway, there is some code (currently not releasable) that will get
    to a
    preview release soon. We'll highly appreciate testing from any user
    around. Stay tunad!

    Maria



--
Douglas Fernando Fischer
Engº de Controle e Automação

Re: multithread support

Reply via email to