On 2016年05月25日 10:11, haosdent wrote:
> but come to step 3, why the running containers become KILLED or LOST ?
If you removed the work_dir of slave, or some break changes make slave
could not recover those containers successfully, they would be killed.
we executed "rm -f /data/mesos/meta/slaves/latest" and restart docker
services before installing new mesos version to avoid mesos-slave fails
starting for all mesos-agents. does that make the KILLED or LOST ? or it
need restart docker service in the process of upgrading mesos ?
>But why suppose upgrade mesos-master before mesos-slave in the Apach
Mesos official site ? there any good idea to do that ?
I think most contributors suppose user upgrade mesos-master first when
introduce some break changes into Mesos.
For example, we updated the MasterInfo format from protobuf to json
before.
What we do at that time is make the new version of Mesos Master
suppose both protobuf and json, so no matter old slave or new slave
could communicate with Master successfully.
But if you update slave in this case, new slave suppose the format of
MasterInfo is json and when it communicate with the old Master,
something wrong may happen....
it makes sense for the explanation, but we met the case when upgrading
mesos from mesos 0.21.0 to 0.25.0 before, the communication between each
other the status as belows:
1. mesos-master 0.21.0 with mesos-slave 0.25.0 , they work good, can
communicate with each other.
2. mesos-master 0.25.0 with mesos-slave 0.21.0, will see no slaves are
listed in the mesos ui (<master:5050>), but the slaves with old version
are running in the backend.
so, we upgrade mesos-slave first, then mesos-master. but what's cause
the result in case2 ?
On Tue, May 24, 2016 at 6:43 PM, Qiang Chen <[email protected]
<mailto:[email protected]>> wrote:
Hi haosdent,
I have 2 questions.
Q1. When I upgrade a mesos cluster by following steps:
1. stop mesos-slave and uninstall mesos old version
2. install mesos new version
3. start mesos-slave
In step 1 & 2, the docker containers run in the mesos-agent keeps
running..
but come to step 3, why the running containers become KILLED or LOST ?
So it will affect the running tasks when upgrade that slave,
although it will reschedule and starts all the tasks to other
slaves in the upgrade process.
But I think it shouldn't affect running tasks in the process of
upgarding... it's make sense ? any good suggestion when upgrading ?
Q2. I think the upgrade sequence is as belows:
1. upgrade mesos-slave
2. upgrade mesos-master
3. updrade frameworks to adapt the new mesos version
But why suppose upgrade mesos-master before mesos-slave in the
Apach Mesos official site ? there any good idea to do that ?
Thanks very much.
--
Best Regards,
Chen, Qiang
--
Best Regards,
Haosdent Huang
--
Best Regards,
Chen, Qiang