Re: Syncing multiple streams to compute final result from a bolt

2016-09-20 Thread Harsh Choudhary
My use case is that I have a json which contains an array. I need to split that array into multiple jsons and do some computations on them. After that, results from each json has to be used in further calculation altogether and come up with the final result. *Cheers!* Harsh Choudhary / Software

Re: Syncing multiple streams to compute final result from a bolt

2016-09-20 Thread Harsh Choudhary
ool. > > That said the more important question is is Storm the right place do to > this? When you perform time window aggregation you are susceptible to tuple > timeouts and have to also deal with making sure your aggregation is > idempotent. > > On Sep 20, 2016 7:49 AM

Syncing multiple streams to compute final result from a bolt

2016-09-20 Thread Harsh Choudhary
Hi I am thinking of doing the following. Spout subscribed to Kafka and get JSONs. Spout emits the JSONs as individual tuples. Bolt-A has subscribed to the spout. Bolt-A creates multiple JSONs from a json and emits them as multiple streams. Bolt-B receives these streams and do the computation

Re: Syncing multiple streams to compute final result from a bolt

2016-09-23 Thread Harsh Choudhary
Thanks for all the help. :) On Wed, Sep 21, 2016 at 11:56 AM, Harsh Choudhary <shry.ha...@gmail.com> wrote: > It is real-time. I get streaming JSONs from Kafka. > > > > > On Wed, Sep 21, 2016 at 4:15 AM, Ambud Sharma <asharma52...@gmail.com> > wrote

Re: Syncing multiple streams to compute final result from a bolt

2016-09-20 Thread Harsh Choudhary
gt; On Tue, Sep 20, 2016 at 9:20 AM, Harsh Choudhary <shry.ha...@gmail.com> > wrote: > >> Hi >> >> I am thinking of doing the following. >> >> Spout subscribed to Kafka and get JSONs. Spout emits the JSONs as >> individual tuples. >> >> Bo

Re: Syncing multiple streams to compute final result from a bolt

2016-09-21 Thread Harsh Choudhary
> > On Sep 20, 2016 9:39 AM, "Harsh Choudhary" <shry.ha...@gmail.com> wrote: > >> My use case is that I have a json which contains an array. I need to >> split that array into multiple jsons and do some computations on them. >> After that, results fr

Storm spout sends next tuple before completion of current

2016-11-06 Thread Harsh Choudhary
Hi I have a bolt (SPLITTER) which receives data from kafkaspout. The SPLITTER bolt splits the data and emits them into multiple streams to another bolt (WRANGLER). The Wrangler takes some time for processing some of the data. So before it can emit the data to another stream, the spout sends the

Re: NullPointerException on startup

2016-11-18 Thread Harsh Choudhary
Hi This happens when there is some code in bolt or spout which throws Null Pointer Exception. I suggest you to use Debugger in your IDE, to find out where is this happening. You can try making a Local Cluster and runs it in IDE to figure it out easily. It never happens because of Storm, so do not

Re: NullPointerException on startup

2016-11-18 Thread Harsh Choudhary
tialization. > > > > Thanks, > > Nick > > > > *From:* Harsh Choudhary [mailto:shry.ha...@gmail.com] > *Sent:* Friday, November 18, 2016 11:18 AM > *To:* user@storm.apache.org > *Subject:* Re: NullPointerException on startup > > > > Hi > > This

Re: NullPointerException on startup

2016-11-18 Thread Harsh Choudhary
the Kafka spout? *Cheers!* Harsh Choudhary On Sat, Nov 19, 2016 at 1:24 AM, Cuneo, Nicholas <ncu...@tycoint.com> wrote: > The spout is initialized during topology submission, so how would you > delay that? Kafka is already running for a long period of time. > > Thanks, > Nick

Re: [storm-kafka] where is stored Kafka Spout consummer's offset?

2017-03-28 Thread Harsh Choudhary
t; > Best regards, > Alexandre Vermeerbergen > > > 2017-03-28 8:12 GMT+02:00 Harsh Choudhary <shry.ha...@gmail.com>: > >> The storm stores its offset in the Zookeeper, it is connected to. So, you >> won't find the offset information for the storm clients in the same

Re: [storm-kafka] where is stored Kafka Spout consummer's offset?

2017-03-28 Thread Harsh Choudhary
Zookeeper. *Cheers!* Harsh Choudhary / Software Engineer Blog / express.harshti.me [image: Facebook] <https://facebook.com/shry.harsh> [image: Twitter] <https://twitter.com/har_ssh> [image: Google Plus] <https://plus.google.com/107567038912927268680> <https://in.linkedin.

Re: Storm files and folders permissions on Linux

2017-03-30 Thread Harsh Choudhary
It depends on from which user, you are running storm. That user must own the folders of storm. So, you need not use chmod but chown. *Cheers!* Harsh Choudhary On Thu, Mar 30, 2017 at 11:24 PM, I PVP <i...@hotmail.com> wrote: > What are the recommended files/folders permissions fo

Re: Centralized logging for storm

2017-03-31 Thread Harsh Choudhary
also lookup all the other logs near a timestamp. *Cheers!* Harsh Choudhary On Fri, Mar 31, 2017 at 1:16 PM, Shashank Prasad <shash...@machaao.com> wrote: > Hi folks, > > Storm is a great tool but the logs are all over the place. As you increase > your workers, your log files wil

Re: Storm files and folders permissions on Linux

2017-03-31 Thread Harsh Choudhary
Can you show the exact error you getting? *Cheers!* Harsh Choudhary On Thu, Mar 30, 2017 at 11:39 PM, I PVP <i...@hotmail.com> wrote: > that is how it is being done as of now: > > sudo chown -R storm:storm /opt/storm > sudo chmod -R 700 /opt/storm > > but still