Hi Seung, You can better refer to the section Stream Groupings in the following link attached below
https://storm.apache.org/documentation/Concepts.html It will get you better understanding of the tuple distribution in Storm, for clear understanding here is the pictorial representation of the same, https://blog.safaribooksonline.com/wp-content/uploads/2013/06/Grouping.png Cheers! On Mon, Jun 8, 2015 at 10:47 AM, Dima Dragan <[email protected]> wrote: > Hi, Seungtack! > > Distribution of messages will be depends only from grouping (in case of > "shuffe grouping", Tuples are randomly distributed across the all bolt's > tasks in a way such that each bolt is guaranteed to get an equal number of > tuples. > > Best regards, > Dmytro Dragan > On Jun 8, 2015 07:12, "Seungtack Baek" <[email protected]> > wrote: > >> Hi, >> >> I have read from the documentation that if you have more spout tasks than >> kafka partition, the excessive tasks will remain idle for entire lifecycle >> of the topology. >> >> Now, Let's consider 4 spout tasks, 32 bolt tasks (of one class) in 4 >> workers (in 4 nodes) and 2 partitions in kafka. Then 2 tasks will be >> assigned to each partitions in kafka and the other 2 will remain idle. >> However, does that mean that only the bolts within the same worker will get >> the messages (assuming shuffle grouping)? Or, do the messages get emitted >> to whatever bolt taks available, regardless of which worker? >> >> Thanks, >> Baek >> >> >> *Seungtack Baek | Precocity, LLC* >> >> Tel/Direct: (972) 378-1030 | Mobile: (214) 477-5715 >> >> *[email protected] <[email protected]>* | >> www.precocityllc.com >> >> >> This is the end of this message. >> >> -- >> >> On Sun, Jun 7, 2015 at 10:12 PM, Seungtack Baek < >> [email protected]> wrote: >> >>> Hi, >>> >>> I have read from the documentation that if you have more spout tasks >>> than kafka partition, the excessive tasks will remain idle for entire >>> lifecycle of the topology. >>> >>> Now, Let's consider 4 spout tasks, 32 bolt tasks (of one class) in 4 >>> workers (in 4 nodes) and 2 partitions in kafka. Then 2 tasks will be >>> assigned to each partitions in kafka and the other 2 will remain idle. >>> However, does that mean that only the bolts within the same worker will get >>> the messages (assuming shuffle grouping)? Or, do the messages get emitted >>> to whatever bolt taks available, regardless of which worker? >>> >>> Thanks, >>> Baek >>> >>> >>> *Seungtack Baek | Precocity, LLC* >>> >>> Tel/Direct: (972) 378-1030 | Mobile: (214) 477-5715 >>> >>> *[email protected] <[email protected]>* | >>> www.precocityllc.com >>> >>> >>> This is the end of this message. >>> >>> -- >>> >> >>
