If your bolt hanged it will cause workers not to send heartbeats and supervisor.worker.timeout.secs trigger causing workers to be killed and restarted. Did you try using https://github.com/apache/storm/tree/master/external/storm-kafka -Harsha
On Wed, Nov 26, 2014, at 01:40 PM, Hefeng Yuan wrote: > Hello, > > I’m trying to us HolmesNL/kafka-spout, it worked pretty well for happy > path, however, when tuple fails (e.g.* _collector.fail(input) *gets > called in bolt), it seems like only retry 3 or 4 times, and then hang > there, until the *supervisor.worker.timeout.secs* reaches, and > topology got restarted. Just wondering where is this number of retried > controlled, and also, since the tuple already *fail*, why would it > still trigger *supervisor.worker.timeout.secs*? > > Thanks, Hefeng
