Hello,

We have a multiple-node storm cluster running on a Production environment.
We have had some issues with a couple of machines, which have been out of
service for a few hours.

Because some workers of the deployed topologies were running on the failed
machines, cluster's behaviour has been unusual (It has been running but not
as it should).

Once we recovered the failed nodes, and rebalanced the topologies, the
cluster returned to work properly.

We would like to know if there is any way to alert nimbus, when a node fall
down, in order to rebalance the affected topologies and  create new workers
in the healthy nodes of the cluster that supply those who were working on
the failed ones.

This would have helped us so much, because we could have kept consistency
in our service in spite of the failed nodes.

Any advice?

Tahnks in advance!






*JULIÁN BERMEJO FERREIRO*
*Departamento de Tecnología *
*[email protected] <[email protected]>*
<http://www.beeva.com/>

Reply via email to