Hi Sahil, Which Storm version are you using? Do you have logs for Nimbus, Supervisors and Workers? If so, can you post the errors?
Some extra documentation: https://storm.apache.org/releases/1.2.3/Daemon-Fault-Tolerance.html https://storm.apache.org/releases/1.2.3/nimbus-ha-design.html On Wed, 17 Jul 2024 at 05:40, Sahil Kamboj <sahilkamboj...@gmail.com> wrote: > Hi all > > Could somebody explain me how nimbus ha can be achieved? We followed > official apache storm docs and have all config for high availability but > this seems not working. > > Issue - > > Yesterday we have a master node status check fail on aws. During this > window we were unable to open storm ui and topologies were also went to > halt state. > We have min replication count to 3 and have 3 master nodes but despite all > of this storm process was on halt. > To get it worked we taken restart of the affected master node and topology > resumed automatically. > So doesn’t storm should do it automatically? If one of the nimbus process > is down then others are there to support ha? > > > Please let me know if I am missing something. > > Regards > Sahil >