GitHub user vjagadish1989 opened a pull request:
https://github.com/apache/samza/pull/211
SAMZA-1271: Guarantee predictable, deterministic order for operator
initialization and finalization
Currently, the order of initialization of operators in the Samza high level
API is not deterministic. The non-determinism arises from two primary causes:
- No fixed order of iteration for all subscribed `OperatorSpec`s for a
given `MessageStream`
- No fixed order of iteration for all the `OperatorImpl`s in the
`OperatorImplGraph`
We aim to provide the following 2 guarantees in this patch:
For any 2 operators A, B in the graph, if B consumes the output of A:
- A is initialized before B is initialized
- A is finalized only after B is finalized
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/vjagadish1989/samza deterministic_order
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/samza/pull/211.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #211
----
commit 95dde24595702748ac4d448fb528a3c3bb8c43da
Author: vjagadish1989 <[email protected]>
Date: 2017-06-02T07:29:03Z
Guarantee deterministic order for operator initialization and finalization
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---