Hi Team, I am facing an issue with one of our failover tests. Storm fails all the messages post worker restarts.
Steps done, 0. 1 spout, 3 bolts, 5 ackers 1. Pre-load tibems with 50k messages 2. Start the topology 3. Let it run for brief time and the kill the worker where the spout is executing (spout in our topology is a single instance) 4. The worker is brought up by the supervisor automatically Observation/query, When spout starts pumping in data again into the topology, storm starts failing the messages even though they are successfully processed (I have verified this as our last bolt pushes data to kafka and the incoming/kafka data njmber matches). I have checked the tuple anchoring and that seems to be fine as without the worker restarts the topology acks and processes messages fine. Any thing I should check again ?
