GitHub user prateekm opened a pull request: https://github.com/apache/samza/pull/194
SAMZA-1221, SAMZA-1101: Internal cleanup for High-Level API implementation. SAMZA-1221: Separated the OperatorSpec and MessageStream DAGs so that they're now duals of each other. Users interact with and construct the MessageStream DAG; we create and use the OperatorSpec DAG internally. Moved common OperatorSpec functionality (getId, getOpCode, getOpName etc.) to the OperatorSpec abstract base class. Added a new JoinOperatorSpec and PartialJoinOperatorImpls which are created from JoinOperatorSpec in OperatorGraphImpl. Added a new InputOperatorSpec and InputOperatorImpl (previously RootOperatorImpl). InputOperatorSpec is created when StreamGraph#getInputStream is called. SAMZA-1101: Added a new OutputOperatorSpec and OutputOperatorImpl for partitionBy and sendTo. These are Separate from SinkOperatorSpec for and SinkOperatorImpl for sink. We don't need to create a sinkFn for partitionBy and sendTo anymore. Updated most unit tests to use the new classes and avoid reflection. You can merge this pull request into a Git repository by running: $ git pull https://github.com/prateekm/samza internal-cleanup Alternatively you can review and apply these changes as the patch at: https://github.com/apache/samza/pull/194.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #194 ---- commit 58ea09f21d3810b05acdea4c55cf254885e0e324 Author: Prateek Maheshwari <pmahe...@linkedin.com> Date: 2017-05-17T22:58:28Z Internal cleanup for High-Level API implementation. Separated the OperatorSpec and MessageStream DAGs so that they're now duals of each other. Users interact with and construct the MessageStream DAG; we create and use the OperatorSpec DAG internally. Moved common OperatorSpec functionality (getId, getOpCode, getOpName etc.) to the OperatorSpec abstract base class. New JoinOperatorSpec and PartialJoinOperatorImpls created from JoinOperatorSpec in OperatorGraphImpl. New InputOperatorSpec and InputOperatorImpl (previously RootOperatorImpl) created when StreamGraph#getInputStream is called. New OutputOperatorSpec and OutputOperatorImpl for partionBy and sendTo. These are Separate from SinkOperatorSpec for and SinkOperatorImpl for sink. Don't need to create a sinkFn for partitionBy and sendTo anymore. Updated most unit tests to use the new classes and avoid reflection. ---- --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---