[ 
https://issues.apache.org/jira/browse/SAMZA-676?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14570127#comment-14570127
 ] 

Yan Fang commented on SAMZA-676:
--------------------------------

First attempt RB: https://reviews.apache.org/r/34974/

1. basically using the similar approach [~twbecker] mentioned: the same ssp is 
assigned to multiple taskInstances. At the same time, modified the taskInstance 
to allow it to skip already-processed-messages. This is useful because when 
users relocate containers, difference taskInstances may have different offsets 
for the broadcast stream. Using the oldest offset when the SSP has multiple 
offsets.

2. also rewrite the Grouper classes with Java. added global streams when 
grouping.

3. touched SAMZA-569 -- added a OffsetComparator for SystemAdmin

4. fixed SAMZA-699 -- changed to LinkedHashSet to preserve the order

5. modified the offsetManager to preserve the relation between 
taskName-SSPs-offsets.

Looking for feedbacks! Thank you.

> Implement Broadcast Stream
> --------------------------
>
>                 Key: SAMZA-676
>                 URL: https://issues.apache.org/jira/browse/SAMZA-676
>             Project: Samza
>          Issue Type: Improvement
>          Components: container
>            Reporter: Yan Fang
>            Assignee: Yan Fang
>         Attachments: BroadcastStreamDesign.md, BroadcastStreamDesign.pdf, 
> BroadcastStreamDesign_1.md, BroadcastStreamDesign_1.pdf
>
>
> There are a lot of discussion in SAMZA-353 about assigning the same SSP to 
> multiple taskNames. This ticket is a subset of the discussion. Only focus on 
> the broadcast stream implementation. 
> The goal is to assign one SSP to all the taskNames. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to