Hello Mesos-Users,

I have set up 3 node mess cluster with ubuntu 14.04. i have started 
zookeeper,Mesos and marathon. Every thing working fine expect three things.

1) When i restart the whole cluster mesos does not show completed tasks. is it 
expected behaviour? if not what i should do?

2) in mesos web ui i’m not able to see 
staged/started/finished/killed/failed/lost task numbers even when tasks are 
running.

3) Every zookeeper instance throws this exception regularly:

 2015-10-09 17:27:26,302 [myid:3] - WARN  
[SendWorker:1:QuorumCnxManager$SendWorker@679] - Interrupted while waiting for 
message on queue
java.lang.InterruptedException
        at 
java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.reportInterruptAfterWait(AbstractQueuedSynchronizer.java:2014)
        at 
java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.awaitNanos(AbstractQueuedSynchronizer.java:2088)
        at 
java.util.concurrent.ArrayBlockingQueue.poll(ArrayBlockingQueue.java:418)
        at 
org.apache.zookeeper.server.quorum.QuorumCnxManager.pollSendQueue(QuorumCnxManager.java:831)
        at 
org.apache.zookeeper.server.quorum.QuorumCnxManager.access$500(QuorumCnxManager.java:62)
        at 
org.apache.zookeeper.server.quorum.QuorumCnxManager$SendWorker.run(QuorumCnxManager.java:667)




Here is my Mesos-master configuration:

mesos master --ip=10.1.0.72  --work_dir=/var/lib/mesos-master 
--zk=file:///etc/mesos/conf/zk --quorum=file:///etc/mesos/conf/quorum
        
        Where zk=zk://zoo.service.consul:2181/mesos
              quorum=2
              


Mesos-Slave Configuration:

mesos slave --work_dir=/var/lib/mesos-slave --ip=10.1.0.72 --hostname=10.1.0.72 
--strict=false  --master=file:///etc/mesos/conf/master 
FrameworkInfo.checkpoint=True
      


Marathon Configuration:

java -jar /opt/marathon.jar  --master zk://zoo.service.consul:2181/mesos  --zk 
zk://zoo.service.consul:2181/marathon  --ha --hostname 10.1.0.72  --checkpoint




Zookeeper configs with java version "1.8.0_45":



dataDir=/var/lib/zookeeper
clientPort=2181
tickTime=2000
initLimit=10
syncLimit=20


autopurge.purgeInterval=0


zookeeper.connection.timeout.ms=6000
server.1=10.1.0.70:2888:3888
server.2=10.1.0.71:2888:3888
server.3=10.1.0.72:2888:3888

And different myid has been given.


Can Anyone Help!!!


Reply via email to