Lets talk about fibers

Liran Zvibel via Digitalmars-d Wed, 03 Jun 2015 11:36:31 -0700

Hi,

We discussed (not) moving fibers between threads on DConf lastweek, and later it was discussed in the announce group, I thinkthis matter is important enough to get a thread of it's own.

Software fibers/coroutines were created to make asynchronousprogramming using a Reactor (or another "event loop i/oscheduler") more seamless.

For those unaware of the Reactor Pattern, I advise reading [http://en.wikipedia.org/wiki/Reactor_pattern ;http://www.dre.vanderbilt.edu/~schmidt/PDF/reactor-siemens.pdf ],and for some perspective at how other languages have addressedthis I recommend watching Guido Van Rossum's talk about acyncioand Python: https://www.youtube.com/watch?v=aurOB4qYuFM

The Reactor pattern is a long-time widely accepted way to achievelow latency async io operations, that fortunately became famousthanks to the Web and the C10k requirement/problem. Using theReactor is the most efficient way to leverage current CPUarchitectures to perform lots of IO for many reasons outside ofthis scope.Another very important quality to using a rector based approach,is that since all event handlers just serialize on a single IOscheduler ("the reactor") on each thread, if designed correctlyprogrammers don't have to think about concurrency and care aboutcode-races.

Another thing to note: when using the reactor pattern you have tomake sure that no event handler blocks at all, never! Once anevent-handler blocks, since being a non-preemptive model, theother event handlers will not be able to run, basically starvingthemselves and the clients on the other side of the network.Reactor implementations usually detect, and notify when an eventhandler took too much time until giving away control (this isdependent on application, but should be in the usec range oncurrent hw).

The downside for the reactor pattern (used to be) that theprogrammer has to manually keep the state/context of how theevent handler worked. Since each "logical" operation wascomprised by many i/o transactions (some NW protocol to keeptrack, maybe accessing a networked DB for some data,reading/writing to local/remote files/ etc) the reactor wouldalso keep a context for each callback and IO event and theprogrammer had to either update the context and keep registeringnew event handlers manually for all extra I/O transactions and inmany cases change callback registration in some cases.This downside means that it's more difficult to program for aReactor model, but since programmers don't have to think aboutraces and concurrency issues (and then debug them...) from ourexperience it still more efficient to program thangeneral-purpose threads if you care about correctness/coherency.One way so mitigate this complexity was through the Proactorpattern -- implementing higher-level async. IO services over thereactor, thus sparing the programmer a lot of the low-levelcontext headaches.


Up until now I did not say anything about Fibers/coroutines.

What Fibers bring to the table, is the ability to program withinthe reactor model without having to manually keep a context thatis separate for the program logic, and without the requirement tomanually re/register callbacks for different IO events.D's Fibers allowed us to create an async io library with supportfor network/file/disk operations and higher level conditions(waiters, barriers, etc) that allows the programmer to write codeas-if it runs in its own thread (almost, sometimes fibers areexplicitly "spawned" -- added to the reactor, andfiber-conditions are slightly different than spawning and joiningthreads) without paying the huge correctness/coherence andperformance penalties of the threading model.

There are two main reasons why it does not make sense to movefibers between threads:

1. You'll start having concurrency issues. Lets assume we have amain fiber that received some request, and it spawns 3 fiberslooking into different DBs to get some info and update an arraywith the data. The array will probably be on the stack of thefirst fiber. If fibers don't move between threads, there isnothing to worry about (as expected by the model). If you startmoving fibers across threads you have to start guarding thisarray now, to make sure it's still coherent.This is a simple example, but basically shows that you're"losing" one of the biggest selling point of the whole reactorbased model.

2. Fibers and reactor based IO make work well (read: make sense)when you have a situation where you have lots of concurrent verysmall transactions (similar to the Web C10k problem or a storagemachine). In this case, if one of the threads has more capacitythan the rest, then the IO scheduler ("reactor") will just makesure to spawn new fibers accepting new transactions in thatfiber. If you don't have a situation that balancing can be donevia placing new requests in the right place, then probably youshould not use the reactor model, but a different one that suitsyour application better.Currently we can spawn another reactor to take more load, but theload is balanced statically at a system-wide level. On previousprojects we had several reactors running on different threads andproviding very different functionality (with different handlers,naturally).We never got to a situation that moving a fiber between threadsmade any sense.

As we see, there is nothing to gain and lots to lose by movingfibers between threads.

Now, if we want to make sure fibers are well supported in D thereare several other things we should do:

1. Implement a good asyncIO library that supports fiber basedprogramming. I don't know Vibe.d very well (e.g. at all), maybewe (Weka.IO) can help review it and suggest ways to make it intoa general async IO library (we have over 15 years experiencedeveloping with the reactor model in many environments)

2. Adding better compiler support. The one problem with fibers isthat upon creation you have to know the stack size for thatfiber. Different functions will create different stack depths. Itis very convenient to use the stack to hold all objects (recallWalter's first day talk, for example), and it can be used as veryconvenient way to "garbage collect" all resources added duringthe run of that fiber, but currently we don't leverage it to themax since we don't have a good way to know/limit the amount ofmemory used this way.If the compiler will be able to analyze stack usage by functions(recursively) and be able to give us hints regarding theupper-bounds of stack usage, we will be able to use the stackmore aggressively and utilize memory much better.Also -- I think such static analysis will be a big selling pointfor D for systems like ours.

I think now everything is written down, and we can move thediscussion here.


Liran.

Lets talk about fibers

Reply via email to