+1 for that theory, we had some screwy issues when we tried to span
subnets until we set every slave and master
to listen on a specific IP so we could tie down routing correctly.

Saw very similar symptoms that have been described.

On 18 April 2016 at 18:35, Alex Rukletsov <[email protected]> wrote:
> I believe it's because slaves are able to connect to the master, but the
> master is not able to connect to the slaves. That's why you see them
> connected for some time and gone afterwards.
>
> On Mon, Apr 18, 2016 at 6:47 PM, Stefano Bianchi <[email protected]>
> wrote:
>>
>> Indeed, i dont know why, i am not able to reach all the machines from a
>> network to the other, just some machines can interconnect with some others
>> among the networks.
>> On mesos i see that all the slaves at a certain time are all connected,
>> then disconnected and after a while connected again, it seems like they are
>> able to connect for a while.
>> However is an openstack issue i guess.
>>
>> Does this also happen when master3 is leading? My guess is that you're not
>> allowong incoming connections from master1 and master2 to slave3. Generally,
>> masters should be able to connect to slaves, not just respond to their
>> requests.
>>
>> On 18 Apr 2016 13:17, "Stefano Bianchi" <[email protected]> wrote:
>>>
>>> Hi
>>> On openstack i plugged two virtual networks to the same virtual router so
>>> that the hosts on the 2 networks can communicate each other.
>>> this is my topology:
>>>
>>> -----------------------internet-----------------------
>>>                                 |
>>>                            Router1
>>>                                 |
>>> --------------------------------------------------------
>>> |                                                                 |
>>> Net1                                                        Net2
>>> Master1 Master2                                     Master3
>>> Slave1 slave2                                          Slave3
>>>
>>> I have set zookeeper in with this line:
>>>
>>> zk://Master1_IP:2181,Master2_IP:2181,Master3_IP:2181/mesos
>>>
>>> The 3 masters, even though on 2 separated networks, elect the leader
>>> correclty.
>>> Now i have started the slaves, and in a first time i see all 3 correctly
>>> registered, but after a while the slave 3, independently form who is the
>>> master, disconnects.
>>> I saw in the log and i get the message in the object.
>>> Can you help me to solve this problem?
>>>
>>>
>>> Thanks to all.
>
>

Reply via email to