Jakob Homan created SAMZA-355:
---------------------------------

             Summary: Provide a SystemStreamPartitionGrouper that groups into N 
sets
                 Key: SAMZA-355
                 URL: https://issues.apache.org/jira/browse/SAMZA-355
             Project: Samza
          Issue Type: Task
          Components: container
    Affects Versions: 0.8.0
            Reporter: Jakob Homan
         Attachments: SAMZA-355.patch

As part of SAMZA-123, it was proposed to provide an SSPGrouper that would hash 
the SSPs into a fixed number of tasknames.  This would provide similar, 
not-at-all-grouped functionality as the GroupByPartitionGrouper but with only n 
TaskInstances created rather than one for each SSP.  If that n is tied to the 
number of containers, this is a conceptually simple way of processing lots of 
SSPs that need no co-grouping.  

As long as the n is not changed, the SSPs would be guaranteed to hash to the 
same TaskName with each run.

There was some concern over if this is the correct approach to take, so it was 
agreed to post the patch separately and make it available to the community and 
for further discussion.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to