[ 
https://issues.apache.org/jira/browse/SAMZA-1293?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16013281#comment-16013281
 ] 

ASF GitHub Bot commented on SAMZA-1293:
---------------------------------------

Github user lindong28 closed the pull request at:

    https://github.com/apache/samza/pull/193


> Enable partition expansion of input streams
> -------------------------------------------
>
>                 Key: SAMZA-1293
>                 URL: https://issues.apache.org/jira/browse/SAMZA-1293
>             Project: Samza
>          Issue Type: New Feature
>            Reporter: Dong Lin
>
> Right now, Samza does not allow partitions of the input stream to increase 
> after a stateful job is created. This causes problem when Kafka is used as 
> the input system, because we need to expand partitions of an existing topic 
> as the byte-in-rate of the topic increases over time in order to limit the 
> size of the maximum partition in Kafka. Kafka broker may have performance 
> issue if the size of a given partition is too large.
> This patch provides a solution to increase partition number of the input 
> streams of a stateful Samza job while still ensuring the correctness of Samze 
> job output. The solution should work when Kafka is used as the input system. 
> We expect this solution to work similarly with other input system as well. 
> The motivation of increasing partition number of Kafka topic is 1) increase 
> performance of Kafka broker and 2) increase throughput of Kafka consumer in 
> the Samza container.
> See SEP-5 
> (https://cwiki.apache.org/confluence/display/SAMZA/SEP-5%3A+Enable+partition+expansion+of+input+streams)
>  for the design and the interface change of this patch.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to