Thanks Joseph.

I saw this from mesos [doc site](http://mesos.apache.org/documentation/latest/maintenance/):

"Each machine must have at least a hostname or IP included. The hostname is not case-sensitive."

From my test, the statement above is not correct, as if I only specific the hostname or IP, it will NOT take effect for the maintenance agents.
but should specific both will OK.

On 2016年07月19日 02:17, Joseph Wu wrote:
Boxbe <https://www.boxbe.com/overview> This message is eligible for Automatic Cleanup! ([email protected]) Add cleanup rule <https://www.boxbe.com/popup?url=https%3A%2F%2Fwww.boxbe.com%2Fcleanup%3Fkey%3Dm%252B%252F9y8szBbdXKWiZ%252FDADQ0%252Fzx2OsVPpMz1%252BhAd8WOjE%253D%26token%3D7yPWMILH6f2hh7W8GLG1B4W3dWqI9yjvahQVEYFryQn3PGah0U1DPo7rfMlTIncRBOxGwo9jI4CHtQ%252BZ435zSbIfdjC1em9cdavejMkUAGEDLcp7EpoDgqU0pX3rrX3o0uawWqnSxys%253D&tc_serial=26129651012&tc_rand=629032590&utm_source=stf&utm_medium=email&utm_campaign=ANNO_CLEANUP_ADD&utm_content=001> | More info <http://blog.boxbe.com/general/boxbe-automatic-cleanup?tc_serial=26129651012&tc_rand=629032590&utm_source=stf&utm_medium=email&utm_campaign=ANNO_CLEANUP_ADD&utm_content=001>


My guess is that your agents don't match the machines you specified. Note: The maintenance endpoints in Mesos allow you to specify maintenance against non-existent machines, because the operator may add agents on those machines in future.

In Mesos' maintenance primitives, a "machine" is a hostname + IP. (A physical/virtual machine can hold multiple agents.) The response in /maintenance/status is in terms of machines, not agents. If none of your frameworks support inverse offers, then you won't get any useful information from the /maintenance/status endpoint.

You can figure out an agent's hostname/IP by hitting the /master/slaves endpoint:
{
   "slaves": [
     {
       "pid":"slave(1)@127.0.0.1:5051 <http://127.0.0.1:5051>",
       "hostname":"foo-bar",
       ...
^ The above translates to a machine = { "hostname": "foo-bar", "ip" : " 127.0.0.1" }

On Mon, Jul 18, 2016 at 2:08 AM, Qiang Chen <[email protected] <mailto:[email protected]>> wrote:

    Hi all,

    I'm puzzled in using maintenance mode.

    I see this from mesos [doc
    site](http://mesos.apache.org/documentation/latest/maintenance/):

    ```
    When maintenance is triggered by the operator, all agents on the
    machine are told to shutdown. These agents are removed from the
    master, which means that a |TASK_LOST| status update will be sent
    for every task running on each of those agents. The scheduler
    driver’s |slaveLost| callback will also be invoked for each of the
    removed agents. Any agents on machines in maintenance are also
    prevented from re-registering with the master in the future (until
    maintenance is completed and the machine is brought back up).
    ```
    But I didn't find the agent machine shutdown or task failed when I
    test the maintenance HTTP endpoints.

    If mesos agents are in that mode will move the running tasks to
    other agents? namely, it will evacuate all the tasks in those
    agents? and the shutdown?

    When I POST "/maintenance/schedule" and "/machine/down" and give a
    proper maintain time window. I got the response that those
    specified agents are in the "draining_machines" and
    "down_machines" list by GET "/maintenance/status", but didn't
    shutdown and evacuate any tasks, why ? does it make sense?

    Thanks.

-- Best Regards,
    Chen, Qiang



--
Best Regards,
Chen, Qiang

Reply via email to