[ 
https://issues.apache.org/jira/browse/SAMZA-1064?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15768557#comment-15768557
 ] 

Navina Ramesh commented on SAMZA-1064:
--------------------------------------

We had a round of discussions based on this initial design. Here are some notes:
* Action Items (or things not yet addressed in the design):
** Split config from JobModel message in the Coordination Stream
** Job Model distribution mechanism should be defined clearly
** Add a note about Yarn: In yarn , the leader starts the processors. Hence, it 
can tell the container what version of job model to pick up from the CS. 

* Design document should call out the following:
** rolling bounce is not supported in this design
** what defines the lifetime of a Samza job attempt?
** Debounce time should be larger than sessionTimeout (debounceTime >> 
sessionTimeout)
** JobModel (and its version) should be treated as a resource that can be 
updated by only one processor. Hence, we should use conditional updates to 
fence off JobModel resource
** Figure out how to clean up the incomplete and/ old barriers

* Questions to resolve:
** Is coordination stream a suitable option for job model distribution? What 
kind of guarantees do we want wrt jobmodel distribution (atomicity/durability?)
** Should JM be written as a single message to the coordinator stream ? or 
multiple small container model messages? (1 msg / multipart message in 
Coordinator Stream). If it is the latter, how do we guarantee atomicity of 
writes? Do we need marker messages to indicate the  start/stop of the batch of 
job model messages?
** Do we need job attempts in the tree? If yes, who creates the persistent 
subtrees 
** Apart from performance, what is the benefit of having a /leader znode as a 
lock? Does it really provide fencing for the leader processor?

> Standalone Samza with Zookeeper for Coordination
> ------------------------------------------------
>
>                 Key: SAMZA-1064
>                 URL: https://issues.apache.org/jira/browse/SAMZA-1064
>             Project: Samza
>          Issue Type: New Feature
>            Reporter: Navina Ramesh
>         Attachments: Samza Standalone-0.md, Samza Standalone-0.pdf, 
> image_0.png, image_1.png
>
>
> In this use-case, we propose using Zookeeper for coordinating processor 
> liveness and task distribution in a Samza job. Additionally, it opens up the 
> possibility of allowing a flexible number of participating processors (no 
> fixed container count).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to