inactive streams

Vlad Rozov Sat, 01 Apr 2017 08:13:19 -0700

All,

Currently Apex assumes that an operator can emit on any defined outputport and all streams defined by a DAG are active. I'd like to propose anability for an operator to open and close output ports. By default allports defined by an operator will be open. In the case an operator forany reason decides that it will not emit tuples on the output port, itmay close it. This will make the stream inactive and the applicationmaster may undeploy the downstream (for that input stream) operators. Ifthis leads to containers that don't have any active operators, thosecontainers may be undeployed as well leading to better cluster resourceutilization and better Apex elasticity. Later, the operator may be in astate where it needs to emit tuples on the closed port. In this case, itneeds to re-open the port and wait till the stream becomes active againbefore emitting tuples on that port. Making inactive stream activeagain, requires the application master to re-allocate containers andre-deploy the downstream operators.

It should be also possible for an application designer to mark streamsas inactive when an application starts. This will allow the applicationmaster avoid reserving all containers when the application starts.Later, the port can be open and inactive stream become active.


Thank you,

Vlad

open/close ports and active/inactive streams

Reply via email to