Re: [Python-Dev] PEP 3148 ready for pronouncement

Brian Quinlan Mon, 24 May 2010 02:39:00 -0700


On May 24, 2010, at 5:16 AM, Glyph Lefkowitz wrote:

On May 23, 2010, at 2:37 AM, Brian Quinlan wrote:
On May 23, 2010, at 2:44 PM, Glyph Lefkowitz wrote:
On May 22, 2010, at 8:47 PM, Brian Quinlan wrote:
Jesse, the designated pronouncer for this PEP, has decided tokeep discussion open for a few more days.
So fire away!
As you wish!
I retract my request ;-)
May you get what you wish for, may you find what you are seeking :).
The PEP should be consistent in its usage of terminology aboutcallables. It alternately calls them "callables", "functions",and "functions or methods". It would be nice to clean this up andbe consistent about what can be called where. I personally like"callables".
Did you find the terminology confusing? If not then I propose notchanging it.
Yes, actually. Whenever I see references to the multiprocessingmodule, I picture a giant "HERE BE (serialization) DRAGONS" sign.When I saw that some things were documented as being "functions", Ithought that maybe there was intended to be a restriction like the"these can only be top-level functions so they're easy for differentexecutors to locate and serialize". I didn't realize that theintent was "arbitrary callables" until I carefully re-read thedocument and noticed that the terminology was inconsistent.

ProcessPoolExecutor has the same serialization perils thatmultiprocessing does. My original plan was to link to themultiprocessing docs to explain them but I couldn't find them listed.

But changing it in the user docs is probably a good idea. I like"callables" too.
Great. Still, users will inevitably find the PEP and use it asdocumentation too.
The execution context of callable code is not made clear.Implicitly, submit() or map() would run the code in threads orprocesses as defined by the executor, but that's not spelled outclearly.
Any response to this bit?  Did I miss something in the PEP?

Yes, the execution context is Executor-dependent. The section underProcessPoolExecutor and ThreadPoolExecutor spells this out, I think.

More relevant to my own interests, the execution context of thecallables passed to add_done_callback and remove_done_callback isleft almost completely to the imagination. If I'm reading thesample implementation correctly, <http://code.google.com/p/pythonfutures/source/browse/branches/feedback/python3/futures/process.py#241>, it looks like in the multiprocessing implementation, the donecallbacks are invoked in a random local thread. The fact thatthey are passed the future itself *sort* of implies that this isthe case, but the multiprocessing module plays fast and loose withobject identity all over the place, so it would be good to beexplicit and say that it's *not* a pickled copy of the futuresitting in some arbitrary process (or even on some arbitrarymachine).
The callbacks will always be called in a thread other than the mainthread in the process that created the executor. Is that a strongenough contract?
Sure. Really, almost any contract would work, it just needs to bespelled out. It might be nice to know whether the thread invokingthe callbacks is a daemon thread or not, but I suppose it's notstrictly necessary.

Your concerns is that the thread will be killed when the interpreterexits? It won't be.

This is really minor, I know, but why does it say "NOTE: Thismethod can be used to create adapters from Futures to TwistedDeferreds"? First of all, what's the deal with "NOTE"; it's theonly "NOTE" in the whole PEP, and it doesn't seem to addanything. This sentence would read exactly the same if that wordwere deleted. Without more clarity on the required executioncontext of the callbacks, this claim might not actually be trueanyway; Deferred callbacks can only be invoked in the main reactorthread in Twisted. But even if it is perfectly possible, whyleave so much of the adapter implementation up to theimagination? If it's important enough to mention, why not have areference to such an adapter in the reference Futuresimplementation, since it *should* be fairly trivial to write?
I'm a bit surprised that this doesn't allow for betterinteroperability with Deferreds given this discussion:
<discussion snipped>
I did not communicate that well. As implemented, it's quitepossible to implement a translation layer which turns a Future intoa Deferred. What I meant by that comment was, the specification inthe PEP was to loose to be sure that such a layer would work witharbitrary executors.
For what it's worth, the Deferred translator would look like this,if you want to include it in the PEP (untested though, you may wantto run it first):
    from twisted.internet.defer import Deferred
    from twisted.internet.reactor import callFromThread

    def future2deferred(future):
        d = Deferred()
        def invoke_deferred():
            try:
                result = future.result()
            except:
                d.errback()
            else:
                d.callback(result)
        def done_callback(same_future):
            callFromThread(invoke_deferred)
        future.add_done_callback(done_callback)
        return d
This does beg the question of what the traceback will look like inthat except: block though. I guess the multi-threaded executor willuse python3 exception chaining so Deferred should be able to show asane traceback in case of an error, but what about exceptions inother processes?
I suggest having have add_done_callback, implementing it with alist so that callbacks are always invoked in the order thatthey're added, and getting rid of remove_done_callback.
Sounds good to me!
Great! :-)
futures._base.Executor isn't exposed publicly, but it needs tobe. The PEP kinda makes it sound like it is ("Executor is anabstract class..."). Plus, A third party library wanting toimplement an executor of its own shouldn't have to copy and pastethe implementation of Executor.map.
That was a bug that I've fixed. Thanks!
Double-great!
One minor suggestion on the "internal future methods" bit -something I wish we'd done with Deferreds was to put 'callback()'and 'addCallbacks()' on separate objects, so that it was veryexplicit whether you were on the emitting side of a Deferred orthe consuming side. That seems to be the case with these internalmethods - they are not so much "internal" as they are for theproducer of the Future (whether a unit test or executor) so youmight want to put them on a different object that it's easy forthe thing creating a Future() to get at but hard for anysubsequent application code to fiddle with by accident. Off thetop of my head, I suggest naming it "Invoker()". A good way to dothis would be to have an Invoker class which can't be instantiated(raises an exception from __init__ or somesuch), then aFuture.create() method which returns an Invoker, which itself hasa '.future' attribute.
No reaction on this part? I think you'll wish you did this in acouple of years when you start bumping into application code thatcalls "set_result" :).


My reactions are mixed ;-)

Your proposal is to add a level of indirection to make it harder forpeople to call implementation methods. The downside is that it makesit a bit harder to write tests and Executors. I also can't see a bigproblem in letting people call set_result in client code though it isdocumented as being only for Executor implementations and tests.

On the implementation side, I don't see why an Invoker needs areference to the future. Each Invoker could own one Future. Areference to the Invoker is kept by the Executor and its future isreturned to the client i.e.


class Invoker(object):
  def __init__(self):
    """Should only be called by Executor implementations."""
    self.future = Future()

  def set_running_or_notify_cancel(self):
    # Messes with self.future's internals

  def set_result(self):
    # Messes with self.future's internals

  def set_exception(self):
    # Messes with self.future's internals


Cheers,
Brian

Finally, why isn't this just a module on PyPI? It doesn't seemlike there's any particular benefit to making this a stdlib moduleand going through the whole PEP process - except maybe to promptfeedback like this :).
We've already had this discussion before. Could you explain whythis module should *not* be in the stdlib e.g. does it havesignificantly less utility than other modules in stdlib? Is itsignificantly higher risk? etc?
You've convinced me, mainly because I noticed later on in thediscussion that it *has* been released to pypi for several months,and does have a bunch of downloads. It doesn't have quite thepopularity I'd personally like to see for stdlib modules, but it'snot like you didn't try, and you do (sort of) have a point aboutsmall modules being hard to get adoption. I'm sorry that this, myleast interesting point in my opinion, is what has seen the mostdiscussion so far.
I'd appreciate it if you could do a release to pypi with thebugfixes you mentioned here, to make sure that the released versionis consistent with what eventually gets into Python.
Oh, and one final nitpick: <http://www.rfc-editor.org/rfc/rfc2606.txt> says you really should not put real domain names intoyour "web crawl example", especially not "some-made-up-domain.com".
_______________________________________________
Python-Dev mailing list
Python-Dev@python.org
http://mail.python.org/mailman/listinfo/python-dev
Unsubscribe: 
http://mail.python.org/mailman/options/python-dev/brian%40sweetapp.com

_______________________________________________
Python-Dev mailing list
Python-Dev@python.org
http://mail.python.org/mailman/listinfo/python-dev
Unsubscribe: 
http://mail.python.org/mailman/options/python-dev/archive%40mail-archive.com

Re: [Python-Dev] PEP 3148 ready for pronouncement

Reply via email to