Re: [Web-SIG] Server-side async API implementation sketches

Alex Grönholm Sun, 09 Jan 2011 10:09:36 -0800

09.01.2011 19:03, P.J. Eby kirjoitti:

At 06:06 AM 1/9/2011 +0200, Alex Grönholm wrote:
A new feature here is that the application itself yields a (status,headers) tuple and then chunks of the body (or futures).
Hm. I'm not sure if I like that. The typical app developer reallyshouldn't be yielding multiple body strings in the first place. Imuch prefer that the canonical example of a WSGI app just return alist with a single bytestring -- preferably in a single statement forthe entire return operation, whether it's a yield or a return.

Uh, so don't yield multiple body strings then? How is that so difficult?

IOW, I want it to look like the normal way to do thing is to justreturn the whole request at once, and use the additional difficulty ofcreating a second iterator to discourage people writing iteratedbodies when they should just write everything to a BytesIO and be donewith it.

I fail to understand why a second iterator is necessary when we can getaway with just one.

Also, it makes middleware simpler: the last line can just yield theresult of calling the app, or a modified version, i.e.:
    yield app(environ)

or:

    s, h, b = app(environ)
    # ... modify or replace s, h, b
    yield s, h, b

Asynchronous applications may not be ready to send the status line asthe first thing coming out of the generator. Consider an app thatreceives a file. The first thing coming out of the app is a future. Theapp needs to receive the entire file until it can determine what statusline to send. Maybe there was an I/O error writing the file, so it needsto send a 500 response instead of 200. This is not possible with a bodyiterator, and if we are already iterating the application generator, Ireally don't understand why the body needs to be an iterator as well.

In your approach, the above samples have to be rewritten as:

    return app(environ)

or:

    result = app(environ)
    s, h = yield result
    # ... modify or replace s, h
    yield s, h

    for data in result:
         # modify b as we go
         yield result
Only that last bit doesn't actually work, because you have to be ableto send future results back *into* the result. Try actually makingsome code that runs on this protocol and yields to futures during thebody iteration.

Did you miss the gist posted by myself (and improved by Alice)?

Really, this modified protocol can't work with a full async API theway my coroutine-based version does, AND the middleware is much morecomplicated. In my version, your do-nothing middleware looks like this:
class NullMiddleware(object):
    def __init__(self, app):
        self.app = app

    def __call__(environ):
        # ACTION: pre-application environ mangling

        s, h, body = yield self.app(environ)

        # modify or replace s, h, body here

        yield s, h, body


If you want to actually process the body in some way, it looks like:

class NullMiddleware(object):

    def __init__(self, app):
        self.app = app

    def __call__(environ):
        # ACTION: pre-application environ mangling

        s, h, body = yield self.app(environ)

        # modify or replace s, h, body here

        yield s, h, self.process(body)

    def process(self, body_iter):
        while True:
            chunk = yield body_iter
            if chunk is None:
                break
            # process/modify chunk here
            yield chunk

And that's still a lot simpler than your sketch.

Personally, I would write both of the above as:

    def null_middleware(app):

        def wrapped(environ):
            # ACTION: pre-application environ mangling
            s, h, body = yield app(environ)

            # modify or replace s, h, body here
            yield s, h, process(body)

        def process(body_iter):
            while True:
                chunk = yield body_iter
                if chunk is None:
                    break
                # process/modify chunk here
                yield chunk

        return wrapped
But that's just personal taste. Even as a class, it's much easier towrite. The above middleware pattern works with the sketches I gave onthe PEAK wiki, and I've now updated the wiki to include an example appand middleware for clarity.
Really, the only hole in this approach is dealing with applicationsthat block. The elephant in the room here is that while it's easy towrite these example applications so they don't block, in practicepeople read files and do database queries and whatnot in theirrequests, and those APIs are generally synchronous. So, unless theysomehow fold their entire application into a future, it doesn't work.
I liked the idea of having a separate async_read() method inwsgi.input, which would set the underlying socket in nonblocking modeand return a future. The event loop would watch the socket and readdata into a buffer and trigger the callback when the given amount ofdata has been read. Conversely, .read() would set the socket inblocking mode. What kinds of problems would this cause?
That you could never *call* the .read() method outside of a future, orelse you would block the server, thereby obliterating the point ofhaving the async API in the first place.

Outside of the application/middleware you mean? I hope there isn't anymore confusion left about what a future is. The fact is that you cannotuse synchronous API calls directly from an async app no matter what.Some workaround is always necessary.



_______________________________________________
Web-SIG mailing list
[email protected]
Web SIG: http://www.python.org/sigs/web-sig
Unsubscribe: 
http://mail.python.org/mailman/options/web-sig/archive%40mail-archive.com

Re: [Web-SIG] Server-side async API implementation sketches

Reply via email to