Did you find any solutions to this? I ran into exactly the same situation with 0.9.4. I have a testing kafka topic with around 10m tuples and the supervisor started to kill its worker (first time around 15minutes later, then I saw the same after 2minutes, or 6 minutes, although I have set supervisor.worker.start.timeout.secs and supervisor.worker.timeout.secs to 40 minutes each).
I then did another experiment, right after the topology was running, I killed all supervisors and the workers could finish all tuples without issues. On Thu, Apr 16, 2015 at 11:56 AM, Grant Overby (groverby) < [email protected]> wrote: > I’m not, and If I had to guess I’d say it’s likely something is going > wrong with the heartbeats, but how can I go about finding out? > > > > > From: Paul Poulosky <[email protected]> > Reply-To: "[email protected]" <[email protected]>, Paul Poulosky < > [email protected]> > Date: Thursday, April 16, 2015 at 2:38 PM > To: "[email protected]" <[email protected]> > Subject: Re: Supervisor repeatedly killing worker > > 10.0.1.5 >
