Priyanka Gupta created MESOS-5193:
-------------------------------------

             Summary: Recovery failed: Failed to recover registrar on reboot of 
mesos master
                 Key: MESOS-5193
                 URL: https://issues.apache.org/jira/browse/MESOS-5193
             Project: Mesos
          Issue Type: Bug
          Components: master
    Affects Versions: 0.27.0, 0.22.0
            Reporter: Priyanka Gupta


Hi all, 

We are using a 3 node cluster with mesos master, mesos slave and zookeeper on 
all of them. We are using chronos on top of it. The problem is when we reboot 
the mesos master leader, the other nodes try to get elected as leader but fail 
with recovery registrar issue. 
"Recovery failed: Failed to recover registrar: Failed to perform fetch within 
1mins"

The next node then try to become the leader but again fails with same error. I 
am not sure about the issue. We are currently using mesos 0.22 and also tried 
to upgrade to mesos 0.27 as well but the problem continues to happen. 

 /usr/sbin/mesos-master --work_dir=/tmp/mesos_dir 
--zk=zk://node1:2181,node2:2181,node3:2181/mesos --quorum=2

Can you please help us resolve this issue as its a production system.

Thanks,
Priyanka



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to