[jira] [Created] (MESOS-10158) Mesos Agent gets stuck in Draining due to pending unacknowledged status updates

2020-07-07 Thread Andrei Budnik (Jira)
Andrei Budnik created MESOS-10158: - Summary: Mesos Agent gets stuck in Draining due to pending unacknowledged status updates Key: MESOS-10158 URL: https://issues.apache.org/jira/browse/MESOS-10158

[jira] [Assigned] (MESOS-7485) Add verbose logging for curl commands used in fetcher/puller

2020-06-10 Thread Andrei Budnik (Jira)
[ https://issues.apache.org/jira/browse/MESOS-7485?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrei Budnik reassigned MESOS-7485: Assignee: Andrei Budnik > Add verbose logging for curl commands used in fetcher/puller >

[jira] [Comment Edited] (MESOS-10131) Agent frequently dies with error "Cycle found in mount table hierarchy"

2020-05-29 Thread Andrei Budnik (Jira)
[ https://issues.apache.org/jira/browse/MESOS-10131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17119576#comment-17119576 ] Andrei Budnik edited comment on MESOS-10131 at 5/29/20, 1:04 PM: - Please

[jira] [Commented] (MESOS-10131) Agent frequently dies with error "Cycle found in mount table hierarchy"

2020-05-29 Thread Andrei Budnik (Jira)
[ https://issues.apache.org/jira/browse/MESOS-10131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17119576#comment-17119576 ] Andrei Budnik commented on MESOS-10131: --- Please keep posting error messages on agent crash.

[jira] [Comment Edited] (MESOS-10131) Agent frequently dies with error "Cycle found in mount table hierarchy"

2020-05-28 Thread Andrei Budnik (Jira)
[ https://issues.apache.org/jira/browse/MESOS-10131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17118896#comment-17118896 ] Andrei Budnik edited comment on MESOS-10131 at 5/28/20, 5:21 PM: - I

[jira] [Commented] (MESOS-10131) Agent frequently dies with error "Cycle found in mount table hierarchy"

2020-05-28 Thread Andrei Budnik (Jira)
[ https://issues.apache.org/jira/browse/MESOS-10131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17118896#comment-17118896 ] Andrei Budnik commented on MESOS-10131: --- I think the message containing the whole mount table is

[jira] [Commented] (MESOS-10131) Agent frequently dies with error "Cycle found in mount table hierarchy"

2020-05-27 Thread Andrei Budnik (Jira)
[ https://issues.apache.org/jira/browse/MESOS-10131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17117903#comment-17117903 ] Andrei Budnik commented on MESOS-10131: --- [~tomplummer] It seems that the tail of the log message

[jira] [Commented] (MESOS-10131) Agent frequently dies with error "Cycle found in mount table hierarchy"

2020-05-27 Thread Andrei Budnik (Jira)
[ https://issues.apache.org/jira/browse/MESOS-10131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17117892#comment-17117892 ] Andrei Budnik commented on MESOS-10131: --- Mount table without extra newlines: {code:java} 18 41

[jira] [Commented] (MESOS-10131) Agent frequently dies with error "Cycle found in mount table hierarchy"

2020-05-27 Thread Andrei Budnik (Jira)
[ https://issues.apache.org/jira/browse/MESOS-10131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17117873#comment-17117873 ] Andrei Budnik commented on MESOS-10131: --- I've copy-pasted the mount table from the log excerpt

[jira] [Assigned] (MESOS-10131) Agent frequently dies with error "Cycle found in mount table hierarchy"

2020-05-27 Thread Andrei Budnik (Jira)
[ https://issues.apache.org/jira/browse/MESOS-10131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrei Budnik reassigned MESOS-10131: - Assignee: Andrei Budnik > Agent frequently dies with error "Cycle found in mount table

[jira] [Commented] (MESOS-10107) containeriser: failed to remove cgroup - EBUSY

2020-05-07 Thread Andrei Budnik (Jira)
[ https://issues.apache.org/jira/browse/MESOS-10107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17101572#comment-17101572 ] Andrei Budnik commented on MESOS-10107: --- {code:java} commit

[jira] [Commented] (MESOS-10119) failure to destroy container can cause the agent to "leak" a GPU

2020-04-21 Thread Andrei Budnik (Jira)
[ https://issues.apache.org/jira/browse/MESOS-10119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17088568#comment-17088568 ] Andrei Budnik commented on MESOS-10119: --- Could you reproduce the cgroups desctruction problem

[jira] [Commented] (MESOS-10107) containeriser: failed to remove cgroup - EBUSY

2020-04-15 Thread Andrei Budnik (Jira)
[ https://issues.apache.org/jira/browse/MESOS-10107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17084136#comment-17084136 ] Andrei Budnik commented on MESOS-10107: --- {code:java} commit

[jira] [Assigned] (MESOS-10107) containeriser: failed to remove cgroup - EBUSY

2020-04-15 Thread Andrei Budnik (Jira)
[ https://issues.apache.org/jira/browse/MESOS-10107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrei Budnik reassigned MESOS-10107: - Assignee: Charles Natali > containeriser: failed to remove cgroup - EBUSY >

[jira] [Commented] (MESOS-10107) containeriser: failed to remove cgroup - EBUSY

2020-04-01 Thread Andrei Budnik (Jira)
[ https://issues.apache.org/jira/browse/MESOS-10107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17072680#comment-17072680 ] Andrei Budnik commented on MESOS-10107: --- Thanks for the detailed explanations! Could you please

[jira] [Deleted] (MESOS-10078) Cgroups isolator: update cgroups subsystems to support nested cgroups

2020-02-27 Thread Andrei Budnik (Jira)
[ https://issues.apache.org/jira/browse/MESOS-10078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrei Budnik deleted MESOS-10078: -- > Cgroups isolator: update cgroups subsystems to support nested cgroups >

[jira] [Created] (MESOS-10098) Mesos agent fails to start on outdated systemd.

2020-02-24 Thread Andrei Budnik (Jira)
Andrei Budnik created MESOS-10098: - Summary: Mesos agent fails to start on outdated systemd. Key: MESOS-10098 URL: https://issues.apache.org/jira/browse/MESOS-10098 Project: Mesos Issue

[jira] [Commented] (MESOS-9853) Update Docker executor to allow kill policy overrides

2020-02-04 Thread Andrei Budnik (Jira)
[ https://issues.apache.org/jira/browse/MESOS-9853?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17030016#comment-17030016 ] Andrei Budnik commented on MESOS-9853: -- Backported /r/71033/ ("Moved the Docker executor declaration

[jira] [Commented] (MESOS-8537) Default executor doesn't wait for status updates to be ack'd before shutting down

2020-02-03 Thread Andrei Budnik (Jira)
[ https://issues.apache.org/jira/browse/MESOS-8537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17029065#comment-17029065 ] Andrei Budnik commented on MESOS-8537: -- 1.5.x {code:java} commit

[jira] [Comment Edited] (MESOS-9847) Docker executor doesn't wait for status updates to be ack'd before shutting down.

2020-02-03 Thread Andrei Budnik (Jira)
[ https://issues.apache.org/jira/browse/MESOS-9847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17029047#comment-17029047 ] Andrei Budnik edited comment on MESOS-9847 at 2/3/20 3:54 PM: -- {code:java}

[jira] [Commented] (MESOS-9847) Docker executor doesn't wait for status updates to be ack'd before shutting down.

2020-02-03 Thread Andrei Budnik (Jira)
[ https://issues.apache.org/jira/browse/MESOS-9847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17029051#comment-17029051 ] Andrei Budnik commented on MESOS-9847: -- 1.5.x {code:java} commit

[jira] [Assigned] (MESOS-8537) Default executor doesn't wait for status updates to be ack'd before shutting down

2020-01-20 Thread Andrei Budnik (Jira)
[ https://issues.apache.org/jira/browse/MESOS-8537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrei Budnik reassigned MESOS-8537: Assignee: Andrei Budnik > Default executor doesn't wait for status updates to be ack'd

[jira] [Created] (MESOS-10080) Cgroups isolator: update cleanup logic to support nested cgroups

2019-12-23 Thread Andrei Budnik (Jira)
Andrei Budnik created MESOS-10080: - Summary: Cgroups isolator: update cleanup logic to support nested cgroups Key: MESOS-10080 URL: https://issues.apache.org/jira/browse/MESOS-10080 Project: Mesos

[jira] [Created] (MESOS-10079) Cgroups isolator: recover nested cgroups

2019-12-23 Thread Andrei Budnik (Jira)
Andrei Budnik created MESOS-10079: - Summary: Cgroups isolator: recover nested cgroups Key: MESOS-10079 URL: https://issues.apache.org/jira/browse/MESOS-10079 Project: Mesos Issue Type: Task

[jira] [Created] (MESOS-10078) Cgroups isolator: update cgroups subsystems to support nested cgroups

2019-12-23 Thread Andrei Budnik (Jira)
Andrei Budnik created MESOS-10078: - Summary: Cgroups isolator: update cgroups subsystems to support nested cgroups Key: MESOS-10078 URL: https://issues.apache.org/jira/browse/MESOS-10078 Project:

[jira] [Created] (MESOS-10077) Cgroups isolator: allow updating and isolating resources for nested cgroups

2019-12-23 Thread Andrei Budnik (Jira)
Andrei Budnik created MESOS-10077: - Summary: Cgroups isolator: allow updating and isolating resources for nested cgroups Key: MESOS-10077 URL: https://issues.apache.org/jira/browse/MESOS-10077

[jira] [Created] (MESOS-10076) Cgroups isolator: create nested cgroups

2019-12-23 Thread Andrei Budnik (Jira)
Andrei Budnik created MESOS-10076: - Summary: Cgroups isolator: create nested cgroups Key: MESOS-10076 URL: https://issues.apache.org/jira/browse/MESOS-10076 Project: Mesos Issue Type: Task

[jira] [Commented] (MESOS-10066) mesos-docker-executor process dies when agent stops. Recovery fails when agent returns

2019-12-13 Thread Andrei Budnik (Jira)
[ https://issues.apache.org/jira/browse/MESOS-10066?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16995737#comment-16995737 ] Andrei Budnik commented on MESOS-10066: --- cc [~qianzhang] > mesos-docker-executor process dies

[jira] [Commented] (MESOS-10066) mesos-docker-executor process dies when agent stops. Recovery fails when agent returns

2019-12-06 Thread Andrei Budnik (Jira)
[ https://issues.apache.org/jira/browse/MESOS-10066?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16989880#comment-16989880 ] Andrei Budnik commented on MESOS-10066: --- So the Docker socket is mounted from the host FS into the

[jira] [Commented] (MESOS-10066) mesos-docker-executor process dies when agent stops. Recovery fails when agent returns

2019-12-06 Thread Andrei Budnik (Jira)
[ https://issues.apache.org/jira/browse/MESOS-10066?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16989808#comment-16989808 ] Andrei Budnik commented on MESOS-10066: --- Did you try to specify  --docker_mesos_image 

[jira] [Commented] (MESOS-10066) mesos-docker-executor process dies when agent stops. Recovery fails when agent returns

2019-12-06 Thread Andrei Budnik (Jira)
[ https://issues.apache.org/jira/browse/MESOS-10066?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16989728#comment-16989728 ] Andrei Budnik commented on MESOS-10066: --- Could you please attach full agent logs? >

[jira] [Created] (MESOS-10014) `tryUntrackFrameworkUnderRole` check failed in `HierarchicalAllocatorProcess::removeFramework`.

2019-10-18 Thread Andrei Budnik (Jira)
Andrei Budnik created MESOS-10014: - Summary: `tryUntrackFrameworkUnderRole` check failed in `HierarchicalAllocatorProcess::removeFramework`. Key: MESOS-10014 URL: https://issues.apache.org/jira/browse/MESOS-10014

[jira] [Commented] (MESOS-6480) Support for docker live-restore option in Mesos

2019-10-02 Thread Andrei Budnik (Jira)
[ https://issues.apache.org/jira/browse/MESOS-6480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16942893#comment-16942893 ] Andrei Budnik commented on MESOS-6480: -- design doc:

[jira] [Commented] (MESOS-9843) Implement tests for the `containerizer/debug` endpoint.

2019-09-24 Thread Andrei Budnik (Jira)
[ https://issues.apache.org/jira/browse/MESOS-9843?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16936734#comment-16936734 ] Andrei Budnik commented on MESOS-9843: -- {code:java} commit dee4b849c8179ea46947c8ea4dd031f6eb37b659

[jira] [Commented] (MESOS-9969) Agent crashes when trying to clean up volue

2019-09-17 Thread Andrei Budnik (Jira)
[ https://issues.apache.org/jira/browse/MESOS-9969?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16931645#comment-16931645 ] Andrei Budnik commented on MESOS-9969: -- Could you please provide steps to reproduce this bug? >

[jira] [Commented] (MESOS-9914) Refactor `MesosTest::StartSlave` in favour of builder style interface

2019-09-06 Thread Andrei Budnik (Jira)
[ https://issues.apache.org/jira/browse/MESOS-9914?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16924139#comment-16924139 ] Andrei Budnik commented on MESOS-9914: --   {code:java} commit

[jira] [Commented] (MESOS-9914) Refactor `MesosTest::StartSlave` in favour of builder style interface

2019-09-02 Thread Andrei Budnik (Jira)
[ https://issues.apache.org/jira/browse/MESOS-9914?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16920964#comment-16920964 ] Andrei Budnik commented on MESOS-9914: -- [https://reviews.apache.org/r/71424/] > Refactor

[jira] [Assigned] (MESOS-9914) Refactor `MesosTest::StartSlave` in favour of builder style interface

2019-08-29 Thread Andrei Budnik (Jira)
[ https://issues.apache.org/jira/browse/MESOS-9914?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrei Budnik reassigned MESOS-9914: Assignee: Andrei Budnik > Refactor `MesosTest::StartSlave` in favour of builder style

[jira] [Commented] (MESOS-9887) Race condition between two terminal task status updates for Docker executor.

2019-08-26 Thread Andrei Budnik (Jira)
[ https://issues.apache.org/jira/browse/MESOS-9887?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16915755#comment-16915755 ] Andrei Budnik commented on MESOS-9887: -- {code:java} commit 8aae23ec7cd4bc50532df0b1d1ea6ec23ce078f8

[jira] [Comment Edited] (MESOS-9887) Race condition between two terminal task status updates for Docker executor.

2019-08-26 Thread Andrei Budnik (Jira)
[ https://issues.apache.org/jira/browse/MESOS-9887?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16912558#comment-16912558 ] Andrei Budnik edited comment on MESOS-9887 at 8/26/19 12:22 PM:

[jira] [Comment Edited] (MESOS-9887) Race condition between two terminal task status updates for Docker executor.

2019-08-26 Thread Andrei Budnik (Jira)
[ https://issues.apache.org/jira/browse/MESOS-9887?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16912558#comment-16912558 ] Andrei Budnik edited comment on MESOS-9887 at 8/26/19 12:22 PM:

[jira] [Commented] (MESOS-9844) Update documentation describing `containerizer/debug` endpoint.

2019-08-22 Thread Andrei Budnik (Jira)
[ https://issues.apache.org/jira/browse/MESOS-9844?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16913441#comment-16913441 ] Andrei Budnik commented on MESOS-9844: --

[jira] [Commented] (MESOS-9887) Race condition between two terminal task status updates for Docker executor.

2019-08-21 Thread Andrei Budnik (Jira)
[ https://issues.apache.org/jira/browse/MESOS-9887?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16912558#comment-16912558 ] Andrei Budnik commented on MESOS-9887: -- https://reviews.apache.org/r/71343/ > Race condition

[jira] [Commented] (MESOS-9887) Race condition between two terminal task status updates for Docker executor.

2019-08-21 Thread Andrei Budnik (Jira)
[ https://issues.apache.org/jira/browse/MESOS-9887?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16912400#comment-16912400 ] Andrei Budnik commented on MESOS-9887: -- Discarding these patches ^^ since multiple consecutive

[jira] [Commented] (MESOS-9836) Docker containerizer overwrites `/mesos/slave` cgroups.

2019-08-20 Thread Andrei Budnik (Jira)
[ https://issues.apache.org/jira/browse/MESOS-9836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16911305#comment-16911305 ] Andrei Budnik commented on MESOS-9836: -- Shall we deprecate the option to run a custom executor in a

[jira] [Commented] (MESOS-9836) Docker containerizer overwrites `/mesos/slave` cgroups.

2019-08-15 Thread Andrei Budnik (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16908250#comment-16908250 ] Andrei Budnik commented on MESOS-9836: -- {quote} So what is the purpose of Docker containerizer's

[jira] [Commented] (MESOS-9936) Slave recovery is very slow with high local volume persistant ( marathon app )

2019-08-15 Thread Andrei Budnik (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9936?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16908065#comment-16908065 ] Andrei Budnik commented on MESOS-9936: -- How to reproduce the issue? Could you please share an app

[jira] [Commented] (MESOS-9936) Slave recovery is very slow with high local volume persistant ( marathon app )

2019-08-13 Thread Andrei Budnik (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9936?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16906165#comment-16906165 ] Andrei Budnik commented on MESOS-9936: -- [~Fcomte] what version of Mesos are you using? > Slave

[jira] [Assigned] (MESOS-9887) Race condition between two terminal task status updates for Docker executor.

2019-08-08 Thread Andrei Budnik (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9887?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrei Budnik reassigned MESOS-9887: Assignee: Andrei Budnik > Race condition between two terminal task status updates for

[jira] [Created] (MESOS-9926) Assertion failed in Master for `Slave::apply` while running `UnreserveVolumeResources` test.

2019-08-06 Thread Andrei Budnik (JIRA)
Andrei Budnik created MESOS-9926: Summary: Assertion failed in Master for `Slave::apply` while running `UnreserveVolumeResources` test. Key: MESOS-9926 URL: https://issues.apache.org/jira/browse/MESOS-9926

[jira] [Created] (MESOS-9914) Refactor `MesosTest::StartSlave` in favour of builder style interface

2019-07-30 Thread Andrei Budnik (JIRA)
Andrei Budnik created MESOS-9914: Summary: Refactor `MesosTest::StartSlave` in favour of builder style interface Key: MESOS-9914 URL: https://issues.apache.org/jira/browse/MESOS-9914 Project: Mesos

[jira] [Commented] (MESOS-9836) Docker containerizer overwrites `/mesos/slave` cgroups.

2019-07-25 Thread Andrei Budnik (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16892844#comment-16892844 ] Andrei Budnik commented on MESOS-9836: -- A typical cgroup for Docker containers looks like:

[jira] [Created] (MESOS-9887) Race condition between two terminal task status updates for Docker executor.

2019-07-10 Thread Andrei Budnik (JIRA)
Andrei Budnik created MESOS-9887: Summary: Race condition between two terminal task status updates for Docker executor. Key: MESOS-9887 URL: https://issues.apache.org/jira/browse/MESOS-9887 Project:

[jira] [Created] (MESOS-9844) Update documentation describing `containerizer/debug` endpoint.

2019-06-12 Thread Andrei Budnik (JIRA)
Andrei Budnik created MESOS-9844: Summary: Update documentation describing `containerizer/debug` endpoint. Key: MESOS-9844 URL: https://issues.apache.org/jira/browse/MESOS-9844 Project: Mesos

[jira] [Created] (MESOS-9843) Implement tests for the `containerizer/debug` endpoint.

2019-06-12 Thread Andrei Budnik (JIRA)
Andrei Budnik created MESOS-9843: Summary: Implement tests for the `containerizer/debug` endpoint. Key: MESOS-9843 URL: https://issues.apache.org/jira/browse/MESOS-9843 Project: Mesos Issue

[jira] [Created] (MESOS-9842) Implement tests for the `FutureTracker` class and for its helper functions.

2019-06-12 Thread Andrei Budnik (JIRA)
Andrei Budnik created MESOS-9842: Summary: Implement tests for the `FutureTracker` class and for its helper functions. Key: MESOS-9842 URL: https://issues.apache.org/jira/browse/MESOS-9842 Project:

[jira] [Created] (MESOS-9841) Integrate `IsolatorTracker` and `LinuxLauncher` with Mesos containerizer.

2019-06-12 Thread Andrei Budnik (JIRA)
Andrei Budnik created MESOS-9841: Summary: Integrate `IsolatorTracker` and `LinuxLauncher` with Mesos containerizer. Key: MESOS-9841 URL: https://issues.apache.org/jira/browse/MESOS-9841 Project:

[jira] [Created] (MESOS-9840) Implement `LauncherTracker` class.

2019-06-12 Thread Andrei Budnik (JIRA)
Andrei Budnik created MESOS-9840: Summary: Implement `LauncherTracker` class. Key: MESOS-9840 URL: https://issues.apache.org/jira/browse/MESOS-9840 Project: Mesos Issue Type: Task

[jira] [Assigned] (MESOS-9839) Implement `IsolatorTracker` class.

2019-06-12 Thread Andrei Budnik (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9839?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrei Budnik reassigned MESOS-9839: Assignee: Andrei Budnik Labels: containerization (was: ) Component/s:

[jira] [Created] (MESOS-9839) Implement `IsolatorTracker` class.

2019-06-12 Thread Andrei Budnik (JIRA)
Andrei Budnik created MESOS-9839: Summary: Implement `IsolatorTracker` class. Key: MESOS-9839 URL: https://issues.apache.org/jira/browse/MESOS-9839 Project: Mesos Issue Type: Bug

[jira] [Created] (MESOS-9838) Leaked HTTP input connection between agent and IOSwitchboard when launched with TTY enabled.

2019-06-12 Thread Andrei Budnik (JIRA)
Andrei Budnik created MESOS-9838: Summary: Leaked HTTP input connection between agent and IOSwitchboard when launched with TTY enabled. Key: MESOS-9838 URL: https://issues.apache.org/jira/browse/MESOS-9838

[jira] [Assigned] (MESOS-9837) Implement `FutureTracker` class along with helper functions.

2019-06-12 Thread Andrei Budnik (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9837?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrei Budnik reassigned MESOS-9837: Assignee: Andrei Budnik > Implement `FutureTracker` class along with helper functions. >

[jira] [Created] (MESOS-9837) Implement `FutureTracker` class along with helper functions.

2019-06-12 Thread Andrei Budnik (JIRA)
Andrei Budnik created MESOS-9837: Summary: Implement `FutureTracker` class along with helper functions. Key: MESOS-9837 URL: https://issues.apache.org/jira/browse/MESOS-9837 Project: Mesos

[jira] [Assigned] (MESOS-9756) Introduce a container debug endpoint.

2019-06-12 Thread Andrei Budnik (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrei Budnik reassigned MESOS-9756: Assignee: Andrei Budnik > Introduce a container debug endpoint. >

[jira] [Deleted] (MESOS-9830) Implement the container debug endpoint on slave/http.cpp

2019-06-12 Thread Andrei Budnik (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9830?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrei Budnik deleted MESOS-9830: - > Implement the container debug endpoint on slave/http.cpp >

[jira] [Commented] (MESOS-9800) libarchive cannot extract tarfile due to UTF-8 encoding issues

2019-05-28 Thread Andrei Budnik (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9800?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16849924#comment-16849924 ] Andrei Budnik commented on MESOS-9800: -- Thanks for filing a detailed ticket! Hope [~kaysoky] might

[jira] [Comment Edited] (MESOS-9306) Mesos containerizer can get stuck during cgroup cleanup

2019-05-27 Thread Andrei Budnik (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9306?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16849042#comment-16849042 ] Andrei Budnik edited comment on MESOS-9306 at 5/27/19 4:10 PM: --- The patch

[jira] [Commented] (MESOS-9306) Mesos containerizer can get stuck during cgroup cleanup

2019-05-27 Thread Andrei Budnik (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9306?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16849042#comment-16849042 ] Andrei Budnik commented on MESOS-9306: -- The patch `/r/70609/` was discarded. If `cgroups::destroy`

[jira] [Commented] (MESOS-9306) Mesos containerizer can get stuck during cgroup cleanup

2019-05-08 Thread Andrei Budnik (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9306?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16835580#comment-16835580 ] Andrei Budnik commented on MESOS-9306: -- I've reproduced the timeout case for `cgroups::destroy` by

[jira] [Commented] (MESOS-9695) Remove the duplicate pid check in Docker containerizer

2019-04-30 Thread Andrei Budnik (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9695?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16830263#comment-16830263 ] Andrei Budnik commented on MESOS-9695: -- {code:java} commit c8004ee8a0962d0e0f9147718853160bb708f5bc

[jira] [Commented] (MESOS-9718) Compile failures with char8_t by MSVC under /std:c++latest(C++20) mode

2019-04-24 Thread Andrei Budnik (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9718?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16825251#comment-16825251 ] Andrei Budnik commented on MESOS-9718: -- Hi [~QuellaZhang], Just verified your patch in our internal

[jira] [Commented] (MESOS-8983) SlaveRecoveryTest/0.PingTimeoutDuringRecovery is flaky

2019-04-16 Thread Andrei Budnik (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8983?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16819281#comment-16819281 ] Andrei Budnik commented on MESOS-8983: -- This test fails pretty often on ARM. >

[jira] [Commented] (MESOS-9718) Compile failures with char8_t by MSVC under /std:c++latest mode

2019-04-10 Thread Andrei Budnik (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9718?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16814730#comment-16814730 ] Andrei Budnik commented on MESOS-9718: -- [~QuellaZhang] If you have a possible fix in mind, we could

[jira] [Commented] (MESOS-9718) Compile failures with char8_t by MSVC under /std:c++latest mode

2019-04-10 Thread Andrei Budnik (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9718?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16814322#comment-16814322 ] Andrei Budnik commented on MESOS-9718: -- [~kaysoky] what could be a possible fix or mitigation for

[jira] [Commented] (MESOS-9718) Compile failures with char8_t by MSVC under /std:c++latest mode

2019-04-10 Thread Andrei Budnik (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9718?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16814320#comment-16814320 ] Andrei Budnik commented on MESOS-9718: -- This error appeared after the following patch landed:

[jira] [Comment Edited] (MESOS-9709) Docker executor can become stuck terminating

2019-04-09 Thread Andrei Budnik (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16813335#comment-16813335 ] Andrei Budnik edited comment on MESOS-9709 at 4/9/19 4:58 PM: -- This agent

[jira] [Comment Edited] (MESOS-9709) Docker executor can become stuck terminating

2019-04-09 Thread Andrei Budnik (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16813335#comment-16813335 ] Andrei Budnik edited comment on MESOS-9709 at 4/9/19 1:24 PM: -- This agent

[jira] [Commented] (MESOS-9709) Docker executor can become stuck terminating

2019-04-09 Thread Andrei Budnik (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16813390#comment-16813390 ] Andrei Budnik commented on MESOS-9709: -- It's a Linux kernel bug: 

[jira] [Commented] (MESOS-9709) Docker executor can become stuck terminating

2019-04-09 Thread Andrei Budnik (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16813335#comment-16813335 ] Andrei Budnik commented on MESOS-9709: -- This agent responds on polling `/state` endpoint, but hangs

[jira] [Commented] (MESOS-9707) Calling link::lo() may cause runtime error

2019-04-08 Thread Andrei Budnik (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9707?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16812625#comment-16812625 ] Andrei Budnik commented on MESOS-9707: -- Thanks for filing the ticket! Would you like to create a PR

[jira] [Commented] (MESOS-6285) Agents may OOM during recovery if there are too many tasks or executors

2019-04-08 Thread Andrei Budnik (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-6285?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16812536#comment-16812536 ] Andrei Budnik commented on MESOS-6285: -- [~kaysoky] What is the relation between this ticket and 

[jira] [Commented] (MESOS-8972) when choose docker image use user network all mesos agent crash

2019-04-04 Thread Andrei Budnik (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=1681#comment-1681 ] Andrei Budnik commented on MESOS-8972: -- [~saturnman], [~omegavveapon] Could you please provide

[jira] [Created] (MESOS-9698) DroppedOperationStatusUpdate test is flaky

2019-04-04 Thread Andrei Budnik (JIRA)
Andrei Budnik created MESOS-9698: Summary: DroppedOperationStatusUpdate test is flaky Key: MESOS-9698 URL: https://issues.apache.org/jira/browse/MESOS-9698 Project: Mesos Issue Type: Bug

[jira] [Commented] (MESOS-9693) Add master validation for SeccompInfo.

2019-03-30 Thread Andrei Budnik (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16805771#comment-16805771 ] Andrei Budnik commented on MESOS-9693: -- > 2. at most one field of profile_name and unconfined should

[jira] [Created] (MESOS-9614) Implement filtering of Seccomp rules by kernel version.

2019-02-27 Thread Andrei Budnik (JIRA)
Andrei Budnik created MESOS-9614: Summary: Implement filtering of Seccomp rules by kernel version. Key: MESOS-9614 URL: https://issues.apache.org/jira/browse/MESOS-9614 Project: Mesos Issue

[jira] [Assigned] (MESOS-9564) Logrotate container logger lets tasks execute arbitrary commands in the Mesos agent's namespace

2019-02-14 Thread Andrei Budnik (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9564?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrei Budnik reassigned MESOS-9564: Assignee: Andrei Budnik (was: Joseph Wu) > Logrotate container logger lets tasks execute

[jira] [Issue Comment Deleted] (MESOS-6632) ContainerLogger might leak FD if container launch fails.

2019-02-08 Thread Andrei Budnik (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-6632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrei Budnik updated MESOS-6632: - Comment: was deleted (was: [~gilbert] Could you please fill out Fix Version/s?) >

[jira] [Commented] (MESOS-6632) ContainerLogger might leak FD if container launch fails.

2019-02-08 Thread Andrei Budnik (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-6632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16763730#comment-16763730 ] Andrei Budnik commented on MESOS-6632: -- [~gilbert] Could you please fill out Fix Version/s? >

[jira] [Assigned] (MESOS-6632) ContainerLogger might leak FD if container launch fails.

2019-02-08 Thread Andrei Budnik (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-6632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrei Budnik reassigned MESOS-6632: Assignee: Andrei Budnik > ContainerLogger might leak FD if container launch fails. >

[jira] [Commented] (MESOS-6632) ContainerLogger might leak FD if container launch fails.

2019-02-08 Thread Andrei Budnik (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-6632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16763727#comment-16763727 ] Andrei Budnik commented on MESOS-6632: -- https://reviews.apache.org/r/69684/ > ContainerLogger might

[jira] [Assigned] (MESOS-9507) Agent could not recover due to empty docker volume checkpointed files.

2019-02-07 Thread Andrei Budnik (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrei Budnik reassigned MESOS-9507: Assignee: Andrei Budnik (was: Gilbert Song) > Agent could not recover due to empty

[jira] [Comment Edited] (MESOS-7971) PersistentVolumeEndpointsTest.EndpointCreateThenOfferRemove test is flaky

2019-01-10 Thread Andrei Budnik (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7971?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16739635#comment-16739635 ] Andrei Budnik edited comment on MESOS-7971 at 1/10/19 5:40 PM: --- This is

[jira] [Commented] (MESOS-7971) PersistentVolumeEndpointsTest.EndpointCreateThenOfferRemove test is flaky

2019-01-10 Thread Andrei Budnik (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7971?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16739635#comment-16739635 ] Andrei Budnik commented on MESOS-7971: -- This is something different from previous ones. {code:java}

[jira] [Comment Edited] (MESOS-9463) Parallel test runner gets confused if a GTEST_FILTER expression also matches a sequential filter

2018-12-19 Thread Andrei Budnik (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16725038#comment-16725038 ] Andrei Budnik edited comment on MESOS-9463 at 12/19/18 2:23 PM: Since

[jira] [Commented] (MESOS-9463) Parallel test runner gets confused if a GTEST_FILTER expression also matches a sequential filter

2018-12-19 Thread Andrei Budnik (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16725038#comment-16725038 ] Andrei Budnik commented on MESOS-9463: -- Since GTEST filter [does not

[jira] [Comment Edited] (MESOS-9462) Devices in a container are inaccessible due to `nodev` on `/var/run`.

2018-12-10 Thread Andrei Budnik (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9462?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16715052#comment-16715052 ] Andrei Budnik edited comment on MESOS-9462 at 12/10/18 6:53 PM:

[jira] [Commented] (MESOS-9462) Devices in a container are inaccessible due to `nodev` on `/var/run`.

2018-12-10 Thread Andrei Budnik (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9462?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16715052#comment-16715052 ] Andrei Budnik commented on MESOS-9462: -- [https://reviews.apache.org/r/69540/] > Devices in a

[jira] [Created] (MESOS-9461) `CgroupsIsolatorTest.ROOT_CGROUPS_BlkioUsage` is flaky.

2018-12-07 Thread Andrei Budnik (JIRA)
Andrei Budnik created MESOS-9461: Summary: `CgroupsIsolatorTest.ROOT_CGROUPS_BlkioUsage` is flaky. Key: MESOS-9461 URL: https://issues.apache.org/jira/browse/MESOS-9461 Project: Mesos Issue

[jira] [Created] (MESOS-9456) Set `SCMP_FLTATR_CTL_LOG` attribute during initialization of Seccomp context

2018-12-05 Thread Andrei Budnik (JIRA)
Andrei Budnik created MESOS-9456: Summary: Set `SCMP_FLTATR_CTL_LOG` attribute during initialization of Seccomp context Key: MESOS-9456 URL: https://issues.apache.org/jira/browse/MESOS-9456 Project:

[jira] [Commented] (MESOS-9157) cannot pull docker image from dockerhub

2018-12-04 Thread Andrei Budnik (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16708647#comment-16708647 ] Andrei Budnik commented on MESOS-9157: -- [~MichaelBowie] feel free to reach out to me directly if you

  1   2   3   4   >