Hi all,
Taking Storm distributed processing framework as a reference: Following two problems we are facing. (1) Find the optimal time out of batch processing based on traffic load and provision to set timeout at runtime in storm cluster. (2) Computes the optimal degree of parallelism for each bolt based on execute latency of bolt. (Hint: find the bolt stats through nimbus service and increase the bolt’s executor by firing rebalancing command after fixed interval if load increases) Can any one help us in this? If you have any link on this please share us. regards Tarkeshwar On Fri, Apr 25, 2014 at 3:16 AM, Charles LeDoux <[email protected]>wrote: > Is it possible to join a trident stream with itself? > > My particular use case is that I want to take the cross product of all the > incoming tuples for a batch and then only keep the joined tuples containing > a known value. > > I believe the SQL for what I am trying to accomplish is: > > SELECT * FROM table AS t1 JOIN table AS t2 ON field1 WHERE t1.field2 = > "known value"; > > My intention was to do a self join on my stream and then run the now > joined stream through a filter. > > Thanks, > Charles > > -- > PhD Candidate; University Fellow > University of Louisiana at Lafayette > Center for Advanced Computer Studies > http://charlesledoux.com > >
