[rust-dev] Scheduler and I/O work items for the summer

Brian Anderson Thu, 30 May 2013 19:42:23 -0700

Hi Rusties,

Allow me to present the status of our ongoing quest to rewrite the taskscheduler, along with the major work items remaining. The results so farare encouraging but there is a very large amount of work left,particularly regarding I/O. In addition to myself we'll have two internsworking on these areas this summer, but we could use more help still.This is an especially good opportunity to influence the way I/O works inRust. I'm hoping that we will cut over to the new scheduler in June, butexpect that crucial I/O-related work will continue for most of the year.

At the moment we have a multithreaded task scheduler that integratesnon-blocking TCP built on top of libuv. So far it uses a very basicscheduling strategy that employs several contended locks, but most ofthe components of the full algorithm are in place, just waiting to befilled in. It is expected that once we're done the entire scheduler willbe lock-free. Besides the aforementioned locking it also allocates fartoo much at the moment but that's not a limitation of the design. As faras the scheduler goes I have not run into any major surprises and stillexpect it to be significantly more efficient than the current one. Thebiggest concern about scheduling is that our requirements force ourscheduler to do more synchronization than specified by the work stealingalgorithm alone. Whereas the literature describes work stealing as onlysynchronizing on the work stealing deque, we also have message passingbetween schedulers and a mechanism to put individual schedulers thatcan't find work to sleep and wake them later, both of which requirefurther synchronization. This expense can be mitigated in some cases,but not all.

As I've been working on the scheduler I have begun separating the taskand its services from the coroutine task scheduler with the intent thatwe can have Rust tasks that are not green threads but instead regularthreads with no userspace scheduling overhead at all. This has rippleeffects throughout the standard library, particularly with theconcurrency primitives, and I don't expect this to reach feature paritywith green thread tasks for a long time, but removing the green threadrequirement allows us to make a stronger case still for being a true'systems language'.

Most of my work on the I/O stack has been in specifying the main I/Otraits and building up the multi-layer interface between thepublic-facing I/O library and libuv. I think I've sufficiently proventhe strategy of using the scheduler to convert async I/O to sync I/O,but there's a whole lot more to implement and there are a number ofoutstanding design problems to solve. We've previously discussed herehow I/O should do [error handling]. The feedback in that thread wasgreat, but it is not yet reflected in the current implementation. I havethough introduced a `read_error` condition specifically for the `read`method and all extensions that build upon it, but it is not fleshed out.

What worries me the most about the entire endeavour is 'select'. We havegreat need for some facility to wait on multiple types of events(particularly I/O and ports) simultaneously, but the requirements can berather complex (detailed later). I am not sure that the old unix'select' function (as we used in pipes) is the best abstraction for thisand feel we need to do further research on this topic. I would like tostart prototyping something here soon.

I've previously done two experiments with microbenchmarks of TCP [readperformance] and single-threaded [scheduling performance] and claimedthat the results were encouraging. Of course things will change a lot aswe implement multi-threading and move on to better benchmarks. I'mmaintaining a selection of comparative [benchmarks] in an external repothat are currently a bit out of date.

I don't know that I recommend using the new scheduler yet for purposesother than scheduler development, but it can be turned on by setting theRUST_NEWRT environment variable. At the moment this will set up asingle-threaded scheduler only but I'll soon convert this to amulti-threaded scheduler. For simple programs you shouldn't see anydifference in execution, but some library features are still busted.Last I checked 95% of the run-pass tests succeeded with RUST_NEWRT set.

The [main issue] for the entire scheduler rewrite is #4419. Within thatone there is a description of the design and links to other related topics.

[error handling]:https://mail.mozilla.org/pipermail/rust-dev/2013-April/003746.html[read performance]:https://github.com/mozilla/rust/pull/6313#issuecomment-17577510[scheduling performance]:https://mail.mozilla.org/pipermail/rust-dev/2013-May/004127.html

[benchmarks]: https://github.com/brson/rust-sched-bench
[main issue]: https://github.com/mozilla/rust/issues/4419

The remainder of this email describes the most significant remainingwork items.



## Add remaining implementations of I/O traits

core::rt::io defines several traits for synchronous I/O, includingReader and Writer. We have a non-blocking TCP implementation incore::rt::io::net::tcp but that's it. We need non-blockingimplementations for files, UDP, unix pipes, then also blockingimplementations of the same, based not on uv, but on plain filedescriptors and sockets.


https://github.com/mozilla/rust/issues/4248


## Design string encoding and decoding for Reader/Writer traits

How do we deal with string encoding of I/O? The existing implementationuses extension methods on Readers and Writers, but this is notsufficient because it doesn't maintain any state. Need a betterunderstanding of the requirements here, but these might involve newdecorator types.


https://github.com/mozilla/rust/issues/6164


## Design and implement some solution for select / async events

We need a way to efficiently wait on multiple types of events at once,including port receives, I/O reads, socket accepts, timers. This hassome very complicated requirements to satisfy, as detailed in the linkedissue, and I'm not sure what the right abstractions are here. This issuper important and the biggest risk to the whole effort. If anybody hasopinions about this topic I would love to hear them.


https://github.com/mozilla/rust/issues/6842


## Make I/O threadsafe

I/O types must perform I/O on the scheduler on which they were created,but they are also sendable. This means that when we perform I/O we mustcheck that we are on the correct scheduler, and if not then reschedulethe running task. This complexity also infects 'select' and couldconceivably lead to some untenable situations at runtime that can donothing but `fail!`.


https://github.com/mozilla/rust/issues/6843


## stdin/out/err

Need to create non-blocking access to the global resourcesstdin/stdout/stderr. Currently I'm thinking these will be Readers andWriters backed by ports, with some protocol for obtaining exclusive access.


https://github.com/mozilla/rust/issues/6846


## Port existing core::io users to core::rt::io::native

In preparation for removing core::io we need to start porting existingusers to the blocking implementations (which don't yet exist) of the newI/O API. This will involve identifying and porting missing features andcompleting various other I/O tasks.


https://github.com/mozilla/rust/issues/6850


## Lock free data structures

There are several concurrent data structures used in the scheduler thatare currently implemented with locks and need to be reimplementedwithout because they are heavily contended. The easiest of these are theMessageQueue and the SleeperList. MessageQueue is a multiple-producer,single-consumer unbounded queue used for sending messages betweenschedulers. SleeperList is a multiple-producer, multiple-consumerbounded stack used to track which schedulers are 'asleep'.


https://github.com/mozilla/rust/issues/6837
https://github.com/mozilla/rust/issues/6838


## Work stealing

Multithreading is currently not implemented using work stealing, butinstead using a shared work queue. Adding work stealing will requireconverting WorkQueue to a deque and adding the 'thief' phase of the workstealing algorithm to the scheduler. Locating work queues to steal fromwill involve creating further lock-free data structures. Some ideas areoutlined on the issue tracker.

James Miller has an implementation of a lock free deque that we can usefor this. Multiple people have interest in this topic so let's make surewe coordinate.


https://github.com/mozilla/rust/issues/3095


## Implement stack growth

We need to make the new tasks support segmented stacks. For the mostpart this will involve copying lots of fiddly bits from the previousimplementation, but I want to make the caching story in thisimplementation simpler, with each scheduler having a single stack pool,instead of having some of the stacks originate in the scheduler and somein the task. This will be easier once fast_ffi is finished, but it willlikely require adding a new attribute to LLVM to suppress the segmentedstack function prologue.


https://github.com/mozilla/rust/issues/6844


## Remove the old scheduler

We can probably get this done relatively soon. There are some featuresnot implemented yet and some unimplemented features that we can justdrop, at least temporarily (pipes select). This can be done even beforefinishing I/O, since the blocking core::io will continue working fine.


https://github.com/mozilla/rust/issues/6587


## Implement a simple HTTP client/server library

I really want to be able to demonstrate a fast and convenient HTTP library.

https://github.com/mozilla/rust/issues/6167

If you've read this far then thanks for your time. I'm giving you avirtual high five!


-Brian


_______________________________________________
Rust-dev mailing list
[email protected]
https://mail.mozilla.org/listinfo/rust-dev

[rust-dev] Scheduler and I/O work items for the summer

Reply via email to