On Thu, Jul 5, 2018 at 10:45 AM, Tom Pantelis <tompante...@gmail.com> wrote:
> > > On Thu, Jul 5, 2018 at 1:42 PM, Michael Vorburger <vorbur...@redhat.com> > wrote: > >> On Thu, Jul 5, 2018 at 7:39 PM, Tom Pantelis <tompante...@gmail.com> >> wrote: >> >>> On Thu, Jul 5, 2018 at 1:35 PM, Michael Vorburger <vorbur...@redhat.com> >>> wrote: >>> >>>> Tom, or Robert, or anyone else having hit this themselves, >>>> >>>> would you be able to remind us what in clustering can cause an ODL >>>> abrupt restart - System.exit() via bundleContext.getBundle(0).stop(); >>>> from https://github.com/opendaylight/controller/blob/master/opend >>>> aylight/md-sal/sal-distributed-datastore/src/main/java/org/o >>>> pendaylight/controller/cluster/akka/osgi/impl/QuarantinedMon >>>> itorActorPropsFactory.java ? >>>> >>>> I do vaguely an "inconsistent cluster" leading to this - clarify >>>> exactly what situation leads to that? Loss of leader? Loss of majority? >>>> >>>> asking for https://bugzilla.redhat.com/show_bug.cgi?id=1597304 ... >>>> >>> >>> That happens when akka quarantines a node - it can no longer rejoin the >>> majority cluster unless the actor system is restarted, hence we restart the >>> whole JVM. >>> >> >> and what can cause Akka to have to quarantine a node? >> > > > An unrecoverable failure state - see https://livingston.io/ > understanding-akkas-quarantine-state/ for more detail. > The most common cause is nodes getting isolated for a considerable amount of time > > > _______________________________________________ > controller-dev mailing list > controller-dev@lists.opendaylight.org > https://lists.opendaylight.org/mailman/listinfo/controller-dev > >
_______________________________________________ controller-dev mailing list controller-dev@lists.opendaylight.org https://lists.opendaylight.org/mailman/listinfo/controller-dev