WSGI and Async [was: Pyramid 2 ideas]

Alice Bevan–McGregor Thu, 24 Mar 2011 00:05:54 -0700

[Cross posted between the source of the discussion and the Web-SIG foradditional discussion there.]


On 2011-03-15 14:54:18 -0700, Mike Orr said:

There has been an ongoing discussion between the WSGI developers andTwisted about how to be more compatible. The upshot is thatasynchronous servers need some kind of token in the output stream thatmeans "I'm not ready; come back later." Other middleware would have topass this token through unchanged. And of course, the application wouldhave to use non-blocking libraries such as non-blocking databaseexecutors. I'm not sure if ordinary file access is "blocking" enough torequire that too.

Not just Twisted; have a gander at the Web-SIG mailing list forDecember and January.[1]

Unfortunately the amount of interconnectedness (and thus complexity)needed for a working solution takes the concept of async completely outof the domain of a low-level specification like WSGI.

An "I'm not ready; come back later" token, which in marrow.server.httpis already implemented—yield None from your body iterator—would, as anexample, add an immediate (or slightly delayed) callback in thereactor[2] which will then poll the application for real data. That'snot async; that's no better than AJAX polling! (And is unidirectionalto boot.)

Non-blocking libraries… how do they determine how to be non-blocking?Socket and file operations, which can be easily made non-blockingthrough the use of select/epoll/kqueue/libevent/libev, have thedistinction of being handled (and likely already used) at the WSGIserver's reactor level. How would a third-party library interface toan existing reactor in an agnostic way? I'm fairly confident that itjust wouldn't be feasible.

If non-blocking libraries implemented their own async reactors… howwould you coordinate the mess of having, potentially, half a dozenreactors?

Futures objects take some of the headache away, allowing for the WSGIapplication to ask for some work to be performed by a third-partylibrary, returning a Future, then being suspended pending the result ofthe Future. Futures are easy to detect (through duck typing) and canbe easily ignored (and passed along) by middleware.

Still, Futures are usually bound to an Executor (reactor), and thatexecutor instance would need to be passed to the third-party librariessomehow. marrow.server.http provides a 'wsgi.executor' environmentvariable which is, usually, a thread pool worker, which still doesn'tquite qualify for async status.

A PEP extending Futures for use with true async models would be a greatstart, and could likely be combined with a simple extension to WSGI toadd the appropriate environment variables. Alex Grönholm, GrahamDumpleton, everyone on the (excellent, if lacking in bloody combat ;)WSGI panel at PyCon, and I seem to all agree that async has no part incore WSGI. There would simply be no way to get a consensus on a singleAPI with so many disparate implementations already in the wild.

The upshot has been that Twisted runs WSGI applications in a threadanyway because it can't be sure they won't block.

As does marrow.server.http if requested to do so. Extremely small orefficient applications can choose not to.

And there hasn't been enough interest from WSGI developers to actuallypursue using it with asynchronous servers.

I've been interested, as has my partner in crime. We've actuallyfiddled around with futures-based core IO reactors, different return /yield styles for WSGI applications, and all sorts of crazy things, andalways came to the same conclusion. :(

I think Python has a future object now which standardizes Twisted'sDeferred and the equivalent in other asynchronous servers. So that's astart.

The core Futures implementation (concurrent.futures; core in Python 3.2with a portable back-port maintained, I believe, by Alex) utilizes athread pool or process pool, has referential limitations (i.e. don'tpass the executor to a future running in a process pool… deadlocks arebad), and I simply have no idea at this time how difficult it would beto create a true async reactor under that model.

The end result of all of this is that async support should be its ownPEP, extending WSGI (333, now 3333) and potentially extending Futures,PEP 3148[3], to create an acceptable generalized API for asyncinterfaces, not just worker pools. I've abandoned the idea for my ownWSGI 2 WIP, PEP 444[3], which marrow.server.http is the referenceimplementation (and idea sandbox) of/for.


        — Alice.

P.s. If anyone has information I don't have, or simply can't rememberat midnight after a very long day, feel free to correct me! :)


[1] http://mail.python.org/pipermail/web-sig/

[2] I know, 'reactor' isn't exactly an accurate term. I just can'tremember the right one right now.


[3] http://www.python.org/dev/peps/pep-3148/

[4] http://bit.ly/fRyMJ2


--
You received this message because you are subscribed to the Google Groups 
"pylons-devel" group.
To post to this group, send email to pylons-devel@googlegroups.com.
To unsubscribe from this group, send email to 
pylons-devel+unsubscr...@googlegroups.com.
For more options, visit this group at 
http://groups.google.com/group/pylons-devel?hl=en.

WSGI and Async [was: Pyramid 2 ideas]

Reply via email to