Hi Julián, Which version of Storm do you use? I remember some of Storm 0.9.x versions has some issues when workers are failing, so I'd like to know about it.
Thanks, Jungtaek Lim (HeartSaVioR) 2016년 5월 26일 (목) 오후 5:53, Julián Bermejo Ferreiro | BEEVA < [email protected]>님이 작성: > Hello, > > We have a multiple-node storm cluster running on a Production environment. > We have had some issues with a couple of machines, which have been out of > service for a few hours. > > Because some workers of the deployed topologies were running on the failed > machines, cluster's behaviour has been unusual (It has been running but not > as it should). > > Once we recovered the failed nodes, and rebalanced the topologies, the > cluster returned to work properly. > > We would like to know if there is any way to alert nimbus, when a node > fall down, in order to rebalance the affected topologies and create new > workers in the healthy nodes of the cluster that supply those who were > working on the failed ones. > > This would have helped us so much, because we could have kept consistency > in our service in spite of the failed nodes. > > Any advice? > > Tahnks in advance! > > > > > > > *JULIÁN BERMEJO FERREIRO* > *Departamento de Tecnología * > *[email protected] <[email protected]>* > <http://www.beeva.com/> > > > >
