Hi, something that is not clear to me (and I cannot find clear answers to this), is whether the send thread of an executor A is able to directly move a tuple to the input queue of another executor B (within the same worker of course) or whether all output tuples have to go through the shared transfer queue and then copied to input queues by the worker send thread. In such a case, wouldn't such a thread be a bottleneck? A worker would be able to go as fast as this thread is able to move tuples around, isn't it?
Thanks!
