we restart nodes 1 by 1 based on what I said earlier. We offset the restarts so we dont actually take the whole cluster down ...
On Fri, Jul 30, 2021 at 9:00 AM Juan Pablo Gardella < gardellajuanpa...@gmail.com> wrote: > Thanks Chris, > > I am not very clear about the approach to restart the entire cluster using > systemd. Are you operating at node level using systemd if I understand > correctly, how you will restart the cluster? > > Thanks, > Juan > > On Fri, 30 Jul 2021 at 10:02, Chris McKeever <cgmckee...@gmail.com> wrote: > >> We start it with systemd, and have timed jobs that offset stop nodes -- >> systemd then sees the failed job and spins it back up >> >> On Fri, Jul 30, 2021 at 7:54 AM Juan Pablo Gardella < >> gardellajuanpa...@gmail.com> wrote: >> >>> Hi devs, >>> >>> Which mechanism are you using to restart the cluster? Apart from using >>> ambari, any other suggestions? >>> >>> Thanks, >>> Juan >>> >>