Hi Team,

I am facing an issue with one of our failover tests. Storm fails all the
messages post worker restarts.

Steps done,
0. 1 spout, 3 bolts, 5 ackers
1. Pre-load tibems with 50k messages
2. Start the topology
3. Let it run for brief time and the kill the worker where the spout is
executing (spout in our topology is a single instance)
4. The worker is brought up by the supervisor automatically

Observation/query,
When spout starts pumping in data again into the topology, storm starts
failing the messages even though they are successfully processed (I have
verified this as our last bolt pushes data to kafka and the incoming/kafka
data njmber matches). I have checked the tuple anchoring and that seems to
be fine as without the worker restarts the topology acks and processes
messages fine.

Any thing I should check again ?

Reply via email to