[jira] [Created] (MESOS-9918) Agent fails to scale many tasks/containers with command health checks

2019-07-31 Thread Greg Mann (JIRA)
Greg Mann created MESOS-9918: Summary: Agent fails to scale many tasks/containers with command health checks Key: MESOS-9918 URL: https://issues.apache.org/jira/browse/MESOS-9918 Project: Mesos

[jira] [Assigned] (MESOS-9845) Add docs for automatic agent draining

2019-07-31 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9845?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Mann reassigned MESOS-9845: Assignee: Greg Mann > Add docs for automatic agent draining >

[jira] [Created] (MESOS-9907) Retain agent draining start time in master

2019-07-25 Thread Greg Mann (JIRA)
Greg Mann created MESOS-9907: Summary: Retain agent draining start time in master Key: MESOS-9907 URL: https://issues.apache.org/jira/browse/MESOS-9907 Project: Mesos Issue Type: Task C

[jira] [Created] (MESOS-9892) Test various agent state transitions involving agent draining

2019-07-15 Thread Greg Mann (JIRA)
Greg Mann created MESOS-9892: Summary: Test various agent state transitions involving agent draining Key: MESOS-9892 URL: https://issues.apache.org/jira/browse/MESOS-9892 Project: Mesos Issue Ty

[jira] [Created] (MESOS-9891) Add AGENT_UPDATED event with drain state.

2019-07-12 Thread Greg Mann (JIRA)
Greg Mann created MESOS-9891: Summary: Add AGENT_UPDATED event with drain state. Key: MESOS-9891 URL: https://issues.apache.org/jira/browse/MESOS-9891 Project: Mesos Issue Type: Task

[jira] [Assigned] (MESOS-9816) Add draining state information to master event stream and state endpoints

2019-07-12 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9816?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Mann reassigned MESOS-9816: Assignee: Greg Mann (was: Joseph Wu) > Add draining state information to master event stream and

[jira] [Created] (MESOS-9884) Retain entire SlaveInfo for unreachable agents

2019-07-09 Thread Greg Mann (JIRA)
Greg Mann created MESOS-9884: Summary: Retain entire SlaveInfo for unreachable agents Key: MESOS-9884 URL: https://issues.apache.org/jira/browse/MESOS-9884 Project: Mesos Issue Type: Task

[jira] [Assigned] (MESOS-9816) Add draining state information to master event stream and state endpoints

2019-07-03 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9816?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Mann reassigned MESOS-9816: Assignee: Joseph Wu > Add draining state information to master event stream and state endpoints >

[jira] [Commented] (MESOS-9875) Mesos did not respond correctly when operations should fail

2019-07-02 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9875?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16877190#comment-16877190 ] Greg Mann commented on MESOS-9875: -- Perhaps we can fix this in the short-term by simply

[jira] [Commented] (MESOS-9818) Implement minimal agent-side draining handler

2019-06-28 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9818?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16875328#comment-16875328 ] Greg Mann commented on MESOS-9818: -- {code} commit b2cfcecb34e76b9a5380bf27f8a2b650510ed1

[jira] [Commented] (MESOS-9814) Implement DrainAgent master/operator call with associated registry actions

2019-06-28 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9814?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16875327#comment-16875327 ] Greg Mann commented on MESOS-9814: -- {code} commit 8a85efb877f50c793954e06b2a6db3f4d30490

[jira] [Created] (MESOS-9869) Reject master API operations when agent is draining

2019-06-27 Thread Greg Mann (JIRA)
Greg Mann created MESOS-9869: Summary: Reject master API operations when agent is draining Key: MESOS-9869 URL: https://issues.apache.org/jira/browse/MESOS-9869 Project: Mesos Issue Type: Task

[jira] [Assigned] (MESOS-9853) Update Docker executor to allow kill policy overrides

2019-06-25 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9853?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Mann reassigned MESOS-9853: Assignee: Greg Mann > Update Docker executor to allow kill policy overrides >

[jira] [Assigned] (MESOS-9860) Agent should erase DrainInfo when draining complete

2019-06-24 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9860?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Mann reassigned MESOS-9860: Assignee: Benjamin Bannier > Agent should erase DrainInfo when draining complete > ---

[jira] [Created] (MESOS-9860) Agent should erase DrainInfo when draining complete

2019-06-24 Thread Greg Mann (JIRA)
Greg Mann created MESOS-9860: Summary: Agent should erase DrainInfo when draining complete Key: MESOS-9860 URL: https://issues.apache.org/jira/browse/MESOS-9860 Project: Mesos Issue Type: Task

[jira] [Created] (MESOS-9853) Update Docker executor to allow kill policy overrides

2019-06-19 Thread Greg Mann (JIRA)
Greg Mann created MESOS-9853: Summary: Update Docker executor to allow kill policy overrides Key: MESOS-9853 URL: https://issues.apache.org/jira/browse/MESOS-9853 Project: Mesos Issue Type: Task

[jira] [Created] (MESOS-9846) Update UI for agent draining

2019-06-13 Thread Greg Mann (JIRA)
Greg Mann created MESOS-9846: Summary: Update UI for agent draining Key: MESOS-9846 URL: https://issues.apache.org/jira/browse/MESOS-9846 Project: Mesos Issue Type: Task Components: web

[jira] [Created] (MESOS-9845) Add docs for automatic agent draining

2019-06-13 Thread Greg Mann (JIRA)
Greg Mann created MESOS-9845: Summary: Add docs for automatic agent draining Key: MESOS-9845 URL: https://issues.apache.org/jira/browse/MESOS-9845 Project: Mesos Issue Type: Task Compon

[jira] [Assigned] (MESOS-9821) Agent kills all tasks when draining

2019-06-12 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Mann reassigned MESOS-9821: Assignee: Greg Mann > Agent kills all tasks when draining > --- >

[jira] [Comment Edited] (MESOS-9818) Implement minimal agent-side draining handler

2019-06-12 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9818?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16861491#comment-16861491 ] Greg Mann edited comment on MESOS-9818 at 6/12/19 8:13 AM: --- Rev

[jira] [Assigned] (MESOS-9818) Implement minimal agent-side draining handler

2019-06-10 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9818?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Mann reassigned MESOS-9818: Assignee: Greg Mann > Implement minimal agent-side draining handler >

[jira] [Assigned] (MESOS-9814) Implement DrainAgent master/operator call with associated registry actions

2019-06-10 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Mann reassigned MESOS-9814: Assignee: Joseph Wu > Implement DrainAgent master/operator call with associated registry actions >

[jira] [Created] (MESOS-9823) Agent should modify status updates while draining

2019-06-05 Thread Greg Mann (JIRA)
Greg Mann created MESOS-9823: Summary: Agent should modify status updates while draining Key: MESOS-9823 URL: https://issues.apache.org/jira/browse/MESOS-9823 Project: Mesos Issue Type: Task

[jira] [Created] (MESOS-9822) Agent recovery code for task draining

2019-06-05 Thread Greg Mann (JIRA)
Greg Mann created MESOS-9822: Summary: Agent recovery code for task draining Key: MESOS-9822 URL: https://issues.apache.org/jira/browse/MESOS-9822 Project: Mesos Issue Type: Task Compon

[jira] [Created] (MESOS-9821) Agent kills all tasks when draining

2019-06-05 Thread Greg Mann (JIRA)
Greg Mann created MESOS-9821: Summary: Agent kills all tasks when draining Key: MESOS-9821 URL: https://issues.apache.org/jira/browse/MESOS-9821 Project: Mesos Issue Type: Task Componen

[jira] [Created] (MESOS-9819) Update the agent's behavior when marked GONE

2019-06-04 Thread Greg Mann (JIRA)
Greg Mann created MESOS-9819: Summary: Update the agent's behavior when marked GONE Key: MESOS-9819 URL: https://issues.apache.org/jira/browse/MESOS-9819 Project: Mesos Issue Type: Task

[jira] [Created] (MESOS-9818) Implement agent-side handling of automatic draining

2019-06-04 Thread Greg Mann (JIRA)
Greg Mann created MESOS-9818: Summary: Implement agent-side handling of automatic draining Key: MESOS-9818 URL: https://issues.apache.org/jira/browse/MESOS-9818 Project: Mesos Issue Type: Task

[jira] [Created] (MESOS-9815) Deprecate maintenance primitives

2019-06-04 Thread Greg Mann (JIRA)
Greg Mann created MESOS-9815: Summary: Deprecate maintenance primitives Key: MESOS-9815 URL: https://issues.apache.org/jira/browse/MESOS-9815 Project: Mesos Issue Type: Task Components:

[jira] [Assigned] (MESOS-9509) Benchmark command health checks in default executor

2019-05-20 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Mann reassigned MESOS-9509: Assignee: Joseph Wu (was: Gastón Kleiman) > Benchmark command health checks in default executor >

[jira] [Created] (MESOS-9774) Design server-side SSL cert verification

2019-05-08 Thread Greg Mann (JIRA)
Greg Mann created MESOS-9774: Summary: Design server-side SSL cert verification Key: MESOS-9774 URL: https://issues.apache.org/jira/browse/MESOS-9774 Project: Mesos Issue Type: Task R

[jira] [Assigned] (MESOS-9754) Design doc for agent draining

2019-05-08 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9754?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Mann reassigned MESOS-9754: Assignee: Greg Mann > Design doc for agent draining > - > >

[jira] [Commented] (MESOS-9767) Add self health monitoring in Mesos master

2019-05-06 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16834118#comment-16834118 ] Greg Mann commented on MESOS-9767: -- [~ggarg] thanks for the info! Did the master respond

[jira] [Commented] (MESOS-9698) DroppedOperationStatusUpdate test is flaky

2019-05-01 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9698?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16831308#comment-16831308 ] Greg Mann commented on MESOS-9698: -- It looks like this failure occurs because two {{Reco

[jira] [Created] (MESOS-9754) Design doc for agent draining

2019-05-01 Thread Greg Mann (JIRA)
Greg Mann created MESOS-9754: Summary: Design doc for agent draining Key: MESOS-9754 URL: https://issues.apache.org/jira/browse/MESOS-9754 Project: Mesos Issue Type: Task Reporter: Gr

[jira] [Created] (MESOS-9753) Agent Draining

2019-05-01 Thread Greg Mann (JIRA)
Greg Mann created MESOS-9753: Summary: Agent Draining Key: MESOS-9753 URL: https://issues.apache.org/jira/browse/MESOS-9753 Project: Mesos Issue Type: Epic Reporter: Greg Mann This

[jira] [Assigned] (MESOS-9698) DroppedOperationStatusUpdate test is flaky

2019-04-29 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9698?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Mann reassigned MESOS-9698: Assignee: Greg Mann Sprint: Mesos Foundations: RI13 Sp 45 > DroppedOperationStatusUpdate tes

[jira] [Commented] (MESOS-9609) Master check failure when marking agent unreachable

2019-04-26 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9609?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16827273#comment-16827273 ] Greg Mann commented on MESOS-9609: -- I've been unable to reproduce this issue, or identif

[jira] [Commented] (MESOS-9739) When recovered agent marked gone, retain agent ID

2019-04-26 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16827228#comment-16827228 ] Greg Mann commented on MESOS-9739: -- True we would have responded to reconciliation reque

[jira] [Commented] (MESOS-9619) Mesos Master Crashes with Launch Group when using Port Resources

2019-04-26 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9619?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16827105#comment-16827105 ] Greg Mann commented on MESOS-9619: -- 1.5.x branch: {code} commit 05c4b09299a7a803e66083eb

[jira] [Commented] (MESOS-9619) Mesos Master Crashes with Launch Group when using Port Resources

2019-04-25 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9619?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16826548#comment-16826548 ] Greg Mann commented on MESOS-9619: -- 1.6.x branch: {code} commit f7e3a8ee649424ef72676c12

[jira] [Commented] (MESOS-9619) Mesos Master Crashes with Launch Group when using Port Resources

2019-04-25 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9619?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16826532#comment-16826532 ] Greg Mann commented on MESOS-9619: -- 1.7.x branch: {code} commit 206e44006540a4d9487da1cb

[jira] [Created] (MESOS-9741) Test `SlaveRecoveryTest.AgentReconfigurationWithRunningTask` is flaky.

2019-04-23 Thread Greg Mann (JIRA)
Greg Mann created MESOS-9741: Summary: Test `SlaveRecoveryTest.AgentReconfigurationWithRunningTask` is flaky. Key: MESOS-9741 URL: https://issues.apache.org/jira/browse/MESOS-9741 Project: Mesos

[jira] [Created] (MESOS-9739) When recovered agent marked gone, retain agent ID

2019-04-23 Thread Greg Mann (JIRA)
Greg Mann created MESOS-9739: Summary: When recovered agent marked gone, retain agent ID Key: MESOS-9739 URL: https://issues.apache.org/jira/browse/MESOS-9739 Project: Mesos Issue Type: Improveme

[jira] [Comment Edited] (MESOS-9619) Mesos Master Crashes with Launch Group when using Port Resources

2019-04-23 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9619?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16823580#comment-16823580 ] Greg Mann edited comment on MESOS-9619 at 4/23/19 11:11 PM: 1

[jira] [Commented] (MESOS-9619) Mesos Master Crashes with Launch Group when using Port Resources

2019-04-22 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9619?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16823580#comment-16823580 ] Greg Mann commented on MESOS-9619: -- Backports forthcoming > Mesos Master Crashes wi

[jira] [Commented] (MESOS-9619) Mesos Master Crashes with Launch Group when using Port Resources

2019-04-22 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9619?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16823576#comment-16823576 ] Greg Mann commented on MESOS-9619: -- On master branch: {code} commit cbae57b7e790b8b46c79

[jira] [Created] (MESOS-9735) Migrate master metrics to PushGauge

2019-04-22 Thread Greg Mann (JIRA)
Greg Mann created MESOS-9735: Summary: Migrate master metrics to PushGauge Key: MESOS-9735 URL: https://issues.apache.org/jira/browse/MESOS-9735 Project: Mesos Issue Type: Task Affects Versio

[jira] [Comment Edited] (MESOS-9619) Mesos Master Crashes with Launch Group when using Port Resources

2019-04-22 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9619?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16816794#comment-16816794 ] Greg Mann edited comment on MESOS-9619 at 4/22/19 6:18 PM: --- Rev

[jira] [Comment Edited] (MESOS-9619) Mesos Master Crashes with Launch Group when using Port Resources

2019-04-21 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9619?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16816794#comment-16816794 ] Greg Mann edited comment on MESOS-9619 at 4/22/19 3:57 AM: --- Rev

[jira] [Comment Edited] (MESOS-9619) Mesos Master Crashes with Launch Group when using Port Resources

2019-04-19 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9619?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16816794#comment-16816794 ] Greg Mann edited comment on MESOS-9619 at 4/19/19 7:57 AM: --- Rev

[jira] [Comment Edited] (MESOS-9609) Master check failure when marking agent unreachable

2019-04-18 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9609?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16821377#comment-16821377 ] Greg Mann edited comment on MESOS-9609 at 4/18/19 6:15 PM: --- Loo

[jira] [Commented] (MESOS-9609) Master check failure when marking agent unreachable

2019-04-18 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9609?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16821377#comment-16821377 ] Greg Mann commented on MESOS-9609: -- Looks like {{Master::_markUnreachable()}} is execute

[jira] [Assigned] (MESOS-9609) Master check failure when marking agent unreachable

2019-04-10 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9609?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Mann reassigned MESOS-9609: Assignee: Greg Mann > Master check failure when marking agent unreachable > --

[jira] [Assigned] (MESOS-9619) Mesos Master Crashes with Launch Group when using Port Resources

2019-04-10 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9619?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Mann reassigned MESOS-9619: Assignee: Greg Mann > Mesos Master Crashes with Launch Group when using Port Resources > -

[jira] [Assigned] (MESOS-9545) Marking an unreachable agent as gone should transition the tasks to terminal state

2019-04-10 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9545?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Mann reassigned MESOS-9545: Assignee: Greg Mann (was: Andrei Sekretenko) > Marking an unreachable agent as gone should transi

[jira] [Created] (MESOS-9709) Docker executor can become stuck terminating

2019-04-08 Thread Greg Mann (JIRA)
Greg Mann created MESOS-9709: Summary: Docker executor can become stuck terminating Key: MESOS-9709 URL: https://issues.apache.org/jira/browse/MESOS-9709 Project: Mesos Issue Type: Bug Affect

[jira] [Created] (MESOS-9705) Operation Feedback Improvements

2019-04-05 Thread Greg Mann (JIRA)
Greg Mann created MESOS-9705: Summary: Operation Feedback Improvements Key: MESOS-9705 URL: https://issues.apache.org/jira/browse/MESOS-9705 Project: Mesos Issue Type: Epic Reporter:

[jira] [Created] (MESOS-9702) Agent's 'registered' metric reports 'true' too early

2019-04-04 Thread Greg Mann (JIRA)
Greg Mann created MESOS-9702: Summary: Agent's 'registered' metric reports 'true' too early Key: MESOS-9702 URL: https://issues.apache.org/jira/browse/MESOS-9702 Project: Mesos Issue Type: Bug

[jira] [Created] (MESOS-9700) ROOT_CreateDestroyPersistentMountVolumeWithRecovery is flaky

2019-04-04 Thread Greg Mann (JIRA)
Greg Mann created MESOS-9700: Summary: ROOT_CreateDestroyPersistentMountVolumeWithRecovery is flaky Key: MESOS-9700 URL: https://issues.apache.org/jira/browse/MESOS-9700 Project: Mesos Issue Typ

[jira] [Commented] (MESOS-9700) ROOT_CreateDestroyPersistentMountVolumeWithRecovery is flaky

2019-04-04 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9700?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16810204#comment-16810204 ] Greg Mann commented on MESOS-9700: -- cc [~chhsia0] > ROOT_CreateDestroyPersistentMountVo

[jira] [Assigned] (MESOS-9635) OperationReconciliationTest.AgentPendingOperationAfterMasterFailover is flaky again (3x) due to orphan operations

2019-04-03 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9635?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Mann reassigned MESOS-9635: Assignee: Greg Mann (was: Gastón Kleiman) > OperationReconciliationTest.AgentPendingOperationAfte

[jira] [Comment Edited] (MESOS-9635) OperationReconciliationTest.AgentPendingOperationAfterMasterFailover is flaky again (3x) due to orphan operations

2019-04-03 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9635?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16803066#comment-16803066 ] Greg Mann edited comment on MESOS-9635 at 4/4/19 12:11 AM: --- I t

[jira] [Assigned] (MESOS-9635) OperationReconciliationTest.AgentPendingOperationAfterMasterFailover is flaky again (3x) due to orphan operations

2019-04-01 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9635?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Mann reassigned MESOS-9635: Assignee: Gastón Kleiman (was: Greg Mann) > OperationReconciliationTest.AgentPendingOperationAfte

[jira] [Commented] (MESOS-9667) Check failure when executor for task using resource provider resources subscribes before agent is registered

2019-03-29 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16805461#comment-16805461 ] Greg Mann commented on MESOS-9667: -- [~chhsia0], regarding the third item in your list, I

[jira] [Assigned] (MESOS-8582) Add a way to make sure an agent always knows the full framework information of all frameworks executing operations on its resources

2019-03-28 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8582?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Mann reassigned MESOS-8582: Assignee: Greg Mann > Add a way to make sure an agent always knows the full framework information

[jira] [Commented] (MESOS-8582) Add a way to make sure an agent always knows the full framework information of all frameworks executing operations on its resources

2019-03-28 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8582?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16804439#comment-16804439 ] Greg Mann commented on MESOS-8582: -- WIP patch here: https://reviews.apache.org/r/70335/

[jira] [Commented] (MESOS-9191) Docker command executor may stuck at infinite unkillable loop.

2019-03-28 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9191?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16804330#comment-16804330 ] Greg Mann commented on MESOS-9191: -- I just did some local testing which is relevant to t

[jira] [Assigned] (MESOS-2842) Master crashes when framework changes principal on re-registration

2019-03-27 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-2842?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Mann reassigned MESOS-2842: Assignee: Andrei Sekretenko > Master crashes when framework changes principal on re-registration >

[jira] [Created] (MESOS-9681) Update framework principal on re-registration

2019-03-27 Thread Greg Mann (JIRA)
Greg Mann created MESOS-9681: Summary: Update framework principal on re-registration Key: MESOS-9681 URL: https://issues.apache.org/jira/browse/MESOS-9681 Project: Mesos Issue Type: Improvement

[jira] [Assigned] (MESOS-9635) OperationReconciliationTest.AgentPendingOperationAfterMasterFailover is flaky again (3x) due to orphan operations

2019-03-27 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9635?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Mann reassigned MESOS-9635: Assignee: Greg Mann (was: Gastón Kleiman) > OperationReconciliationTest.AgentPendingOperationAfte

[jira] [Commented] (MESOS-9635) OperationReconciliationTest.AgentPendingOperationAfterMasterFailover is flaky again (3x) due to orphan operations

2019-03-27 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9635?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16803066#comment-16803066 ] Greg Mann commented on MESOS-9635: -- I think this issue would be better addressed by allo

[jira] [Created] (MESOS-9679) Send OPERATION_GONE_BY_OPERATOR when RP is marked gone

2019-03-26 Thread Greg Mann (JIRA)
Greg Mann created MESOS-9679: Summary: Send OPERATION_GONE_BY_OPERATOR when RP is marked gone Key: MESOS-9679 URL: https://issues.apache.org/jira/browse/MESOS-9679 Project: Mesos Issue Type: Impr

[jira] [Assigned] (MESOS-7911) Non-checkpointing framework's tasks should not be marked LOST when agent disconnects.

2019-03-25 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Mann reassigned MESOS-7911: Assignee: (was: Benno Evers) > Non-checkpointing framework's tasks should not be marked LOST w

[jira] [Created] (MESOS-9665) MasterTest.MasterFailoverLongLivedExecutor is flaky

2019-03-20 Thread Greg Mann (JIRA)
Greg Mann created MESOS-9665: Summary: MasterTest.MasterFailoverLongLivedExecutor is flaky Key: MESOS-9665 URL: https://issues.apache.org/jira/browse/MESOS-9665 Project: Mesos Issue Type: Bug

[jira] [Created] (MESOS-9664) UpgradeTest.RefineResourceOnOldAgent is flaky

2019-03-18 Thread Greg Mann (JIRA)
Greg Mann created MESOS-9664: Summary: UpgradeTest.RefineResourceOnOldAgent is flaky Key: MESOS-9664 URL: https://issues.apache.org/jira/browse/MESOS-9664 Project: Mesos Issue Type: Bug

[jira] [Comment Edited] (MESOS-9648) Make operation reconciliation send asynchronous updates

2019-03-13 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9648?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16791963#comment-16791963 ] Greg Mann edited comment on MESOS-9648 at 3/14/19 1:43 AM: --- Rev

[jira] [Assigned] (MESOS-9635) OperationReconciliationTest.AgentPendingOperationAfterMasterFailover is flaky again (3x) due to orphan operations

2019-03-13 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9635?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Mann reassigned MESOS-9635: Assignee: Gastón Kleiman (was: Joseph Wu) > OperationReconciliationTest.AgentPendingOperationAfte

[jira] [Assigned] (MESOS-9318) Consider providing better operation status updates while an RP is recovering

2019-03-13 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Mann reassigned MESOS-9318: Assignee: Greg Mann > Consider providing better operation status updates while an RP is recovering

[jira] [Assigned] (MESOS-9648) Make operation reconciliation send asynchronous updates

2019-03-13 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9648?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Mann reassigned MESOS-9648: Assignee: Greg Mann > Make operation reconciliation send asynchronous updates > --

[jira] [Created] (MESOS-9649) Recover frameworks from reregistered agents with operations

2019-03-11 Thread Greg Mann (JIRA)
Greg Mann created MESOS-9649: Summary: Recover frameworks from reregistered agents with operations Key: MESOS-9649 URL: https://issues.apache.org/jira/browse/MESOS-9649 Project: Mesos Issue Type

[jira] [Created] (MESOS-9648) Make operation reconciliation send asynchronous updates

2019-03-11 Thread Greg Mann (JIRA)
Greg Mann created MESOS-9648: Summary: Make operation reconciliation send asynchronous updates Key: MESOS-9648 URL: https://issues.apache.org/jira/browse/MESOS-9648 Project: Mesos Issue Type: Tas

[jira] [Commented] (MESOS-9460) Speculative operations may make master and allocator resource views out of sync.

2019-03-07 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9460?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16787423#comment-16787423 ] Greg Mann commented on MESOS-9460: -- Sharing some learnings here after a couple difficult

[jira] [Commented] (MESOS-9460) Speculative operations may make master and allocator resource views out of sync.

2019-03-06 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9460?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16786243#comment-16786243 ] Greg Mann commented on MESOS-9460: -- WIP review posted here: https://reviews.apache.org/r

[jira] [Commented] (MESOS-7622) Agent can crash if a HTTP executor tries to retry subscription in running state.

2019-03-06 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7622?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16785992#comment-16785992 ] Greg Mann commented on MESOS-7622: -- [~aaron.wood] is this still an issue? Is this relate

[jira] [Assigned] (MESOS-9615) Example framework for feedback on agent default resources

2019-02-27 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9615?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Mann reassigned MESOS-9615: Assignee: Benno Evers > Example framework for feedback on agent default resources > --

[jira] [Assigned] (MESOS-9610) Fetcher vulnerability - escaping from sandbox

2019-02-27 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Mann reassigned MESOS-9610: Assignee: Joseph Wu > Fetcher vulnerability - escaping from sandbox >

[jira] [Created] (MESOS-9615) Example framework for feedback on agent default resources

2019-02-27 Thread Greg Mann (JIRA)
Greg Mann created MESOS-9615: Summary: Example framework for feedback on agent default resources Key: MESOS-9615 URL: https://issues.apache.org/jira/browse/MESOS-9615 Project: Mesos Issue Type: T

[jira] [Assigned] (MESOS-8241) Add metrics for offer operation feedback

2019-02-27 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Mann reassigned MESOS-8241: Assignee: Benno Evers > Add metrics for offer operation feedback > ---

[jira] [Created] (MESOS-9609) Master check failure when marking agent unreachable

2019-02-25 Thread Greg Mann (JIRA)
Greg Mann created MESOS-9609: Summary: Master check failure when marking agent unreachable Key: MESOS-9609 URL: https://issues.apache.org/jira/browse/MESOS-9609 Project: Mesos Issue Type: Bug

[jira] [Commented] (MESOS-7568) Introduce a heartbeat mechanism for v0 executor <-> agent links.

2019-02-20 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16773263#comment-16773263 ] Greg Mann commented on MESOS-7568: -- [~kaysoky] is this still an issue for v0 executors?

[jira] [Created] (MESOS-9589) AgentContainerAPITest.NestedContainerIdempotentLaunch is flaky

2019-02-20 Thread Greg Mann (JIRA)
Greg Mann created MESOS-9589: Summary: AgentContainerAPITest.NestedContainerIdempotentLaunch is flaky Key: MESOS-9589 URL: https://issues.apache.org/jira/browse/MESOS-9589 Project: Mesos Issue T

[jira] [Commented] (MESOS-9571) Release Mesos 1.6.2

2019-02-19 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9571?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16772300#comment-16772300 ] Greg Mann commented on MESOS-9571: -- Vote email for rc1 has been sent to the mailing list

[jira] [Commented] (MESOS-9191) Docker command executor may stuck at infinite unkillable loop.

2019-02-15 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9191?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16769892#comment-16769892 ] Greg Mann commented on MESOS-9191: -- Retargeted 1.6.3. > Docker command executor may stu

[jira] [Created] (MESOS-9572) Release Mesos 1.7.2

2019-02-13 Thread Greg Mann (JIRA)
Greg Mann created MESOS-9572: Summary: Release Mesos 1.7.2 Key: MESOS-9572 URL: https://issues.apache.org/jira/browse/MESOS-9572 Project: Mesos Issue Type: Task Components: release

[jira] [Assigned] (MESOS-9571) Release Mesos 1.6.2

2019-02-13 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9571?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Mann reassigned MESOS-9571: Assignee: Greg Mann > Release Mesos 1.6.2 > --- > > Key: MESOS-957

[jira] [Created] (MESOS-9571) Release Mesos 1.6.2

2019-02-13 Thread Greg Mann (JIRA)
Greg Mann created MESOS-9571: Summary: Release Mesos 1.6.2 Key: MESOS-9571 URL: https://issues.apache.org/jira/browse/MESOS-9571 Project: Mesos Issue Type: Task Components: release

[jira] [Assigned] (MESOS-9564) Logrotate container logger lets tasks execute arbitrary commands in the Mesos agent's namespace

2019-02-13 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9564?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Mann reassigned MESOS-9564: Shepherd: Greg Mann Assignee: Joseph Wu Sprint: Mesos Foundations RI11 Sp 40

[jira] [Created] (MESOS-9570) Manual testing of agent operation checkpointing/recovery

2019-02-13 Thread Greg Mann (JIRA)
Greg Mann created MESOS-9570: Summary: Manual testing of agent operation checkpointing/recovery Key: MESOS-9570 URL: https://issues.apache.org/jira/browse/MESOS-9570 Project: Mesos Issue Type: Ta

[jira] [Commented] (MESOS-9541) Transition agent operations to some "lost" state when the agent is removed.

2019-02-12 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9541?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16766699#comment-16766699 ] Greg Mann commented on MESOS-9541: -- After discussing this with some other committers, I

[jira] [Comment Edited] (MESOS-9535) Master should clean up operations from downgraded agents

2019-02-12 Thread Greg Mann (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16757789#comment-16757789 ] Greg Mann edited comment on MESOS-9535 at 2/13/19 2:08 AM: --- Rev

<    1   2   3   4   5   6   7   8   9   10   >