[jira] [Commented] (MESOS-8038) Launching GPU task sporadically fails.

2018-08-10 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16576641#comment-16576641 ] Zhitao Li commented on MESOS-8038: -- [~gilbert] I don't think we will use forever. My plan is to use a

[jira] [Created] (MESOS-9148) Make cgroups destroy timeout configurable for Mesos containerizer

2018-08-10 Thread Zhitao Li (JIRA)
Zhitao Li created MESOS-9148: Summary: Make cgroups destroy timeout configurable for Mesos containerizer Key: MESOS-9148 URL: https://issues.apache.org/jira/browse/MESOS-9148 Project: Mesos

[jira] [Commented] (MESOS-8038) Launching GPU task sporadically fails.

2018-07-26 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16558938#comment-16558938 ] Zhitao Li commented on MESOS-8038: -- I just attached another full agent log with this issue. > Launching

[jira] [Commented] (MESOS-8038) Launching GPU task sporadically fails.

2018-07-25 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16556308#comment-16556308 ] Zhitao Li commented on MESOS-8038: -- Some update: We have another episode on this issue. Our setup is

[jira] [Comment Edited] (MESOS-8038) Launching GPU task sporadically fails.

2018-06-20 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16518573#comment-16518573 ] Zhitao Li edited comment on MESOS-8038 at 6/20/18 8:41 PM: --- We have this

[jira] [Comment Edited] (MESOS-8038) Launching GPU task sporadically fails.

2018-06-20 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16518573#comment-16518573 ] Zhitao Li edited comment on MESOS-8038 at 6/20/18 8:35 PM: --- We have this

[jira] [Comment Edited] (MESOS-8038) Launching GPU task sporadically fails.

2018-06-20 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16518573#comment-16518573 ] Zhitao Li edited comment on MESOS-8038 at 6/20/18 8:34 PM: --- We have this

[jira] [Comment Edited] (MESOS-8038) Launching GPU task sporadically fails.

2018-06-20 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16518573#comment-16518573 ] Zhitao Li edited comment on MESOS-8038 at 6/20/18 8:34 PM: --- We have this

[jira] [Commented] (MESOS-8038) Launching GPU task sporadically fails.

2018-06-20 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16518573#comment-16518573 ] Zhitao Li commented on MESOS-8038: -- We have this happening again in our cluster. One suggestion I have

[jira] [Assigned] (MESOS-8038) Launching GPU task sporadically fails.

2018-06-20 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhitao Li reassigned MESOS-8038: Assignee: Zhitao Li > Launching GPU task sporadically fails. >

[jira] [Commented] (MESOS-9000) Operator API event stream can miss task status updates

2018-06-15 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-9000?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16513816#comment-16513816 ] Zhitao Li commented on MESOS-9000: -- I believe the high level intention was to avoid sending unnecessary

[jira] [Assigned] (MESOS-8830) Agent gc on old slave sandboxes could empty persistent volume data

2018-06-01 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8830?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhitao Li reassigned MESOS-8830: Assignee: Zhitao Li > Agent gc on old slave sandboxes could empty persistent volume data >

[jira] [Commented] (MESOS-8830) Agent gc on old slave sandboxes could empty persistent volume data

2018-05-23 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8830?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16486761#comment-16486761 ] Zhitao Li commented on MESOS-8830: -- [~jieyu] I put up a patch in https://reviews.apache.org/r/67264/.

[jira] [Commented] (MESOS-8909) Scrubbing value secret from HTTP responses

2018-05-22 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8909?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16484234#comment-16484234 ] Zhitao Li commented on MESOS-8909: -- [~jieyu] Yes this is only applicable to `Value` type secret (we don't

[jira] [Commented] (MESOS-8909) Scrubbing value secret from HTTP responses

2018-05-17 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8909?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16479892#comment-16479892 ] Zhitao Li commented on MESOS-8909: -- My current thought: - Create a common function of `void

[jira] [Comment Edited] (MESOS-8830) Agent gc on old slave sandboxes could empty persistent volume data

2018-05-15 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8830?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16476604#comment-16476604 ] Zhitao Li edited comment on MESOS-8830 at 5/15/18 11:27 PM: [~jieyu]

[jira] [Comment Edited] (MESOS-8830) Agent gc on old slave sandboxes could empty persistent volume data

2018-05-15 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8830?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16476604#comment-16476604 ] Zhitao Li edited comment on MESOS-8830 at 5/15/18 11:22 PM: [~jieyu]

[jira] [Commented] (MESOS-8830) Agent gc on old slave sandboxes could empty persistent volume data

2018-05-15 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8830?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16476604#comment-16476604 ] Zhitao Li commented on MESOS-8830: -- [~jieyu] Unfortunately I lost the environment on this issue. Still,

[jira] [Created] (MESOS-8909) Scrubbing value secret from HTTP responses

2018-05-11 Thread Zhitao Li (JIRA)
Zhitao Li created MESOS-8909: Summary: Scrubbing value secret from HTTP responses Key: MESOS-8909 URL: https://issues.apache.org/jira/browse/MESOS-8909 Project: Mesos Issue Type: Task

[jira] [Commented] (MESOS-8600) Add more permissive reconfiguration policies

2018-05-08 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16468172#comment-16468172 ] Zhitao Li commented on MESOS-8600: -- Another usability improvement I'm considering is something like

[jira] [Commented] (MESOS-8884) Flaky `DockerContainerizerTest.ROOT_DOCKER_MaxCompletionTime`.

2018-05-07 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16466546#comment-16466546 ] Zhitao Li commented on MESOS-8884: -- Attempt to fix: https://reviews.apache.org/r/66993/ > Flaky

[jira] [Assigned] (MESOS-8884) Flaky `DockerContainerizerTest.ROOT_DOCKER_MaxCompletionTime`.

2018-05-07 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhitao Li reassigned MESOS-8884: Assignee: Zhitao Li > Flaky `DockerContainerizerTest.ROOT_DOCKER_MaxCompletionTime`. >

[jira] [Commented] (MESOS-8851) Introduce a push-based gauge.

2018-05-01 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16460228#comment-16460228 ] Zhitao Li commented on MESOS-8851: -- Okay, I'm not particularly picky on naming so either is fine. Do you

[jira] [Created] (MESOS-8856) UNIMPLEMENTED macro in stout could conflict with protobuf

2018-05-01 Thread Zhitao Li (JIRA)
Zhitao Li created MESOS-8856: Summary: UNIMPLEMENTED macro in stout could conflict with protobuf Key: MESOS-8856 URL: https://issues.apache.org/jira/browse/MESOS-8856 Project: Mesos Issue Type:

[jira] [Commented] (MESOS-8851) Introduce a push-based gauge.

2018-05-01 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16459789#comment-16459789 ] Zhitao Li commented on MESOS-8851: -- This is great, but would it be better called `PollGauge`? >

[jira] [Commented] (MESOS-8830) Agent gc on old slave sandboxes could empty persistent volume data

2018-04-24 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8830?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16450468#comment-16450468 ] Zhitao Li commented on MESOS-8830: -- Minor correction: I mistakenly thought the problematic path is a hard

[jira] [Created] (MESOS-8830) Agent gc on old slave sandboxes could empty persistent volume data

2018-04-24 Thread Zhitao Li (JIRA)
Zhitao Li created MESOS-8830: Summary: Agent gc on old slave sandboxes could empty persistent volume data Key: MESOS-8830 URL: https://issues.apache.org/jira/browse/MESOS-8830 Project: Mesos

[jira] [Created] (MESOS-8791) Convert grow_volume and shrink_volume into non-speculative operations

2018-04-16 Thread Zhitao Li (JIRA)
Zhitao Li created MESOS-8791: Summary: Convert grow_volume and shrink_volume into non-speculative operations Key: MESOS-8791 URL: https://issues.apache.org/jira/browse/MESOS-8791 Project: Mesos

[jira] [Assigned] (MESOS-5933) Refactor the uri::Fetcher as a binary.

2018-04-12 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-5933?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhitao Li reassigned MESOS-5933: Assignee: (was: Zhitao Li) > Refactor the uri::Fetcher as a binary. >

[jira] [Issue Comment Deleted] (MESOS-8725) Support max_duration for tasks

2018-04-12 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhitao Li updated MESOS-8725: - Comment: was deleted (was: One minor decision I'm making is to require all tasks in the same group to

[jira] [Comment Edited] (MESOS-8600) Add more permissive reconfiguration policies

2018-04-12 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16436125#comment-16436125 ] Zhitao Li edited comment on MESOS-8600 at 4/12/18 6:44 PM: --- ping?

[jira] [Commented] (MESOS-8600) Add more permissive reconfiguration policies

2018-04-12 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16436125#comment-16436125 ] Zhitao Li commented on MESOS-8600: -- ping? [~vinodkone][~bennoe] We are running this patch for a while in

[jira] [Created] (MESOS-8768) Provide custom reason for cascaded kill in a task group

2018-04-09 Thread Zhitao Li (JIRA)
Zhitao Li created MESOS-8768: Summary: Provide custom reason for cascaded kill in a task group Key: MESOS-8768 URL: https://issues.apache.org/jira/browse/MESOS-8768 Project: Mesos Issue Type:

[jira] [Created] (MESOS-8748) Create ACL for grow and shrink volume

2018-03-28 Thread Zhitao Li (JIRA)
Zhitao Li created MESOS-8748: Summary: Create ACL for grow and shrink volume Key: MESOS-8748 URL: https://issues.apache.org/jira/browse/MESOS-8748 Project: Mesos Issue Type: Task

[jira] [Created] (MESOS-8747) Support resizing persistent volume through operator API

2018-03-28 Thread Zhitao Li (JIRA)
Zhitao Li created MESOS-8747: Summary: Support resizing persistent volume through operator API Key: MESOS-8747 URL: https://issues.apache.org/jira/browse/MESOS-8747 Project: Mesos Issue Type:

[jira] [Created] (MESOS-8746) Support difference for hashset in stout

2018-03-28 Thread Zhitao Li (JIRA)
Zhitao Li created MESOS-8746: Summary: Support difference for hashset in stout Key: MESOS-8746 URL: https://issues.apache.org/jira/browse/MESOS-8746 Project: Mesos Issue Type: Improvement

[jira] [Commented] (MESOS-8725) Support max_duration for tasks

2018-03-26 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8725?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16414617#comment-16414617 ] Zhitao Li commented on MESOS-8725: -- One minor decision I'm making is to require all tasks in the same

[jira] [Commented] (MESOS-8725) Support deadline for tasks

2018-03-23 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8725?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16412053#comment-16412053 ] Zhitao Li commented on MESOS-8725: -- {quote}bq.Can you look into whether we could/should implement this in

[jira] [Commented] (MESOS-8725) Support deadline for tasks

2018-03-23 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8725?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16412045#comment-16412045 ] Zhitao Li commented on MESOS-8725: -- The following chain is a proof of concept in command executor:

[jira] [Commented] (MESOS-8725) Support deadline for tasks

2018-03-23 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8725?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16411704#comment-16411704 ] Zhitao Li commented on MESOS-8725: -- [~jpe...@apache.org], thanks for shepherding this. I'll start with a

[jira] [Assigned] (MESOS-8725) Support deadline for tasks

2018-03-23 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhitao Li reassigned MESOS-8725: Shepherd: James Peach Assignee: Zhitao Li > Support deadline for tasks >

[jira] [Commented] (MESOS-8725) Support deadline for tasks

2018-03-22 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8725?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16410613#comment-16410613 ] Zhitao Li commented on MESOS-8725: -- [~jamesmulcahy], we actually started on that path, however some of

[jira] [Created] (MESOS-8725) Support deadline for tasks

2018-03-22 Thread Zhitao Li (JIRA)
Zhitao Li created MESOS-8725: Summary: Support deadline for tasks Key: MESOS-8725 URL: https://issues.apache.org/jira/browse/MESOS-8725 Project: Mesos Issue Type: Improvement

[jira] [Commented] (MESOS-8600) Add more permissive reconfiguration policies

2018-03-16 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16402406#comment-16402406 ] Zhitao Li commented on MESOS-8600: -- [~bennoe], can you add whoever had previous conversations I take from

[jira] [Commented] (MESOS-8411) Killing a queued task can lead to the command executor never terminating.

2018-03-14 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8411?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16399654#comment-16399654 ] Zhitao Li commented on MESOS-8411: -- Hi, do you think it's possible to paste log line patterns when this

[jira] [Comment Edited] (MESOS-8609) Create a metric to indicate how long agent takes to recover executors

2018-03-14 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8609?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16399215#comment-16399215 ] Zhitao Li edited comment on MESOS-8609 at 3/14/18 8:27 PM: --- {noformat} commit

[jira] [Comment Edited] (MESOS-8609) Create a metric to indicate how long agent takes to recover executors

2018-03-14 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8609?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16399215#comment-16399215 ] Zhitao Li edited comment on MESOS-8609 at 3/14/18 7:59 PM: --- commit

[jira] [Comment Edited] (MESOS-8609) Create a metric to indicate how long agent takes to recover executors

2018-03-14 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8609?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16399019#comment-16399019 ] Zhitao Li edited comment on MESOS-8609 at 3/14/18 6:11 PM: ---

[jira] [Commented] (MESOS-7461) balloon test and disk full framework test relies on possibly unavailable ports

2018-03-14 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16398987#comment-16398987 ] Zhitao Li commented on MESOS-7461: -- The issue with disk_full_framework.sh using fixed port is still

[jira] [Assigned] (MESOS-7461) balloon test and disk full framework test relies on possibly unavailable ports

2018-03-14 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhitao Li reassigned MESOS-7461: Assignee: Zhitao Li > balloon test and disk full framework test relies on possibly unavailable

[jira] [Created] (MESOS-8663) Support transfer of persistent volume between roles without losing data

2018-03-12 Thread Zhitao Li (JIRA)
Zhitao Li created MESOS-8663: Summary: Support transfer of persistent volume between roles without losing data Key: MESOS-8663 URL: https://issues.apache.org/jira/browse/MESOS-8663 Project: Mesos

[jira] [Comment Edited] (MESOS-6918) Prometheus exporter endpoints for metrics

2018-03-06 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-6918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16388599#comment-16388599 ] Zhitao Li edited comment on MESOS-6918 at 3/6/18 10:07 PM: --- [~jamespeach], do

[jira] [Commented] (MESOS-6918) Prometheus exporter endpoints for metrics

2018-03-06 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-6918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16388599#comment-16388599 ] Zhitao Li commented on MESOS-6918: -- [~jamespeach], do you think it's feasible to target some of this work

[jira] [Comment Edited] (MESOS-4965) Support resizing of an existing persistent volume

2018-03-06 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-4965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16388552#comment-16388552 ] Zhitao Li edited comment on MESOS-4965 at 3/6/18 9:24 PM: -- WIP [design

[jira] [Commented] (MESOS-4965) Support resizing of an existing persistent volume

2018-03-06 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-4965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16388552#comment-16388552 ] Zhitao Li commented on MESOS-4965: -- WIP[ design

[jira] [Commented] (MESOS-8641) New heartbeat on event stream could change the behavior for subscriber

2018-03-06 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8641?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16388094#comment-16388094 ] Zhitao Li commented on MESOS-8641: -- Attempt to fix:  https://reviews.apache.org/r/65930 > New heartbeat

[jira] [Created] (MESOS-8641) New heartbeat on event stream could change the behavior for subscriber

2018-03-05 Thread Zhitao Li (JIRA)
Zhitao Li created MESOS-8641: Summary: New heartbeat on event stream could change the behavior for subscriber Key: MESOS-8641 URL: https://issues.apache.org/jira/browse/MESOS-8641 Project: Mesos

[jira] [Created] (MESOS-8637) Persistent volume doc missed related operator API calls

2018-03-05 Thread Zhitao Li (JIRA)
Zhitao Li created MESOS-8637: Summary: Persistent volume doc missed related operator API calls Key: MESOS-8637 URL: https://issues.apache.org/jira/browse/MESOS-8637 Project: Mesos Issue Type:

[jira] [Assigned] (MESOS-4965) Support resizing of an existing persistent volume

2018-03-02 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-4965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhitao Li reassigned MESOS-4965: Assignee: Zhitao Li > Support resizing of an existing persistent volume >

[jira] [Created] (MESOS-8609) Create a metric to indicate how long agent takes to recover executors

2018-02-24 Thread Zhitao Li (JIRA)
Zhitao Li created MESOS-8609: Summary: Create a metric to indicate how long agent takes to recover executors Key: MESOS-8609 URL: https://issues.apache.org/jira/browse/MESOS-8609 Project: Mesos

[jira] [Assigned] (MESOS-8609) Create a metric to indicate how long agent takes to recover executors

2018-02-24 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8609?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhitao Li reassigned MESOS-8609: Assignee: Zhitao Li > Create a metric to indicate how long agent takes to recover executors >

[jira] [Assigned] (MESOS-8506) Add test coverage for `Resources::find` on revocable resources

2018-01-29 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhitao Li reassigned MESOS-8506: Shepherd: James Peach Assignee: Zhitao Li > Add test coverage for `Resources::find` on

[jira] [Created] (MESOS-8506) Add test coverage for `Resources::find` on revocable resources

2018-01-29 Thread Zhitao Li (JIRA)
Zhitao Li created MESOS-8506: Summary: Add test coverage for `Resources::find` on revocable resources Key: MESOS-8506 URL: https://issues.apache.org/jira/browse/MESOS-8506 Project: Mesos Issue

[jira] [Assigned] (MESOS-8471) Allow revocable_resources capability for mesos-execute

2018-01-29 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhitao Li reassigned MESOS-8471: Shepherd: James Peach Assignee: Zhitao Li > Allow revocable_resources capability for

[jira] [Comment Edited] (MESOS-8480) Mesos returns high resource usage when killing a Docker task.

2018-01-24 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338114#comment-16338114 ] Zhitao Li edited comment on MESOS-8480 at 1/24/18 7:39 PM: --- Will this be also

[jira] [Commented] (MESOS-8480) Mesos returns high resource usage when killing a Docker task.

2018-01-24 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338114#comment-16338114 ] Zhitao Li commented on MESOS-8480: -- Will this be also back ported to 1.5.0 since the RC is still not

[jira] [Commented] (MESOS-8471) Allow revocable_resources capability for mesos-execute

2018-01-23 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8471?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16336149#comment-16336149 ] Zhitao Li commented on MESOS-8471: -- A quick attempt is at https://reviews.apache.org/r/65294/ > Allow

[jira] [Created] (MESOS-8471) Allow revocable_resources capability for mesos-execute

2018-01-21 Thread Zhitao Li (JIRA)
Zhitao Li created MESOS-8471: Summary: Allow revocable_resources capability for mesos-execute Key: MESOS-8471 URL: https://issues.apache.org/jira/browse/MESOS-8471 Project: Mesos Issue Type:

[jira] [Commented] (MESOS-8161) Potentially dangerous dangling mount when stopping task with persistent volume

2018-01-21 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8161?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16333751#comment-16333751 ] Zhitao Li commented on MESOS-8161: -- The `TASK_ERROR` state was picked by framework author without good

[jira] [Updated] (MESOS-6893) Track total docker image layer size in store

2018-01-08 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-6893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhitao Li updated MESOS-6893: - Priority: Minor (was: Major) Description: We want to give cluster operator some insights on total

[jira] [Commented] (MESOS-4945) Garbage collect unused docker layers in the store.

2018-01-08 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-4945?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16316816#comment-16316816 ] Zhitao Li commented on MESOS-4945: -- That one is not necessarily part of this epic. I'll move it out. >

[jira] [Created] (MESOS-8365) Create AuthN support for prune images API

2017-12-28 Thread Zhitao Li (JIRA)
Zhitao Li created MESOS-8365: Summary: Create AuthN support for prune images API Key: MESOS-8365 URL: https://issues.apache.org/jira/browse/MESOS-8365 Project: Mesos Issue Type: Improvement

[jira] [Updated] (MESOS-8365) Create AuthN support for prune images API

2017-12-28 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhitao Li updated MESOS-8365: - Target Version/s: 1.5.0 > Create AuthN support for prune images API >

[jira] [Updated] (MESOS-8358) Create agent endpoints for pruning images

2017-12-22 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8358?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhitao Li updated MESOS-8358: - Issue Type: Improvement (was: Bug) > Create agent endpoints for pruning images >

[jira] [Created] (MESOS-8358) Create agent endpoints for pruning images

2017-12-22 Thread Zhitao Li (JIRA)
Zhitao Li created MESOS-8358: Summary: Create agent endpoints for pruning images Key: MESOS-8358 URL: https://issues.apache.org/jira/browse/MESOS-8358 Project: Mesos Issue Type: Bug

[jira] [Created] (MESOS-8353) Duplicate task for same framework on multiple agents crashes out master after failover

2017-12-20 Thread Zhitao Li (JIRA)
Zhitao Li created MESOS-8353: Summary: Duplicate task for same framework on multiple agents crashes out master after failover Key: MESOS-8353 URL: https://issues.apache.org/jira/browse/MESOS-8353

[jira] [Created] (MESOS-8324) Add succeeded metric to container launch in Mesos agent

2017-12-12 Thread Zhitao Li (JIRA)
Zhitao Li created MESOS-8324: Summary: Add succeeded metric to container launch in Mesos agent Key: MESOS-8324 URL: https://issues.apache.org/jira/browse/MESOS-8324 Project: Mesos Issue Type:

[jira] [Created] (MESOS-8323) Separate resource fetching timeout from executor_registere_timeout

2017-12-12 Thread Zhitao Li (JIRA)
Zhitao Li created MESOS-8323: Summary: Separate resource fetching timeout from executor_registere_timeout Key: MESOS-8323 URL: https://issues.apache.org/jira/browse/MESOS-8323 Project: Mesos

[jira] [Commented] (MESOS-8070) Bundled GRPC build does not build on Debian 8

2017-12-10 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8070?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16285336#comment-16285336 ] Zhitao Li commented on MESOS-8070: -- [~gilbert], can we make sure this catches 1.5 release? Thanks! >

[jira] [Assigned] (MESOS-8280) Mesos Containerizer GC should set 'layers' after checkpointing layer ids in provisioner.

2017-11-29 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhitao Li reassigned MESOS-8280: Assignee: Zhitao Li > Mesos Containerizer GC should set 'layers' after checkpointing layer ids in

[jira] [Commented] (MESOS-7366) Agent sandbox gc could accidentally delete the entire persistent volume content

2017-11-01 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7366?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16235004#comment-16235004 ] Zhitao Li commented on MESOS-7366: -- I filed MESOS-8161 for the other case. > Agent sandbox gc could

[jira] [Created] (MESOS-8161) Potentially dangerous dangling mount when stopping task with persistent volume

2017-11-01 Thread Zhitao Li (JIRA)
Zhitao Li created MESOS-8161: Summary: Potentially dangerous dangling mount when stopping task with persistent volume Key: MESOS-8161 URL: https://issues.apache.org/jira/browse/MESOS-8161 Project: Mesos

[jira] [Commented] (MESOS-8090) Mesos 1.4.0 crashes with 1.3.x agent with oversubscription

2017-10-17 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16208103#comment-16208103 ] Zhitao Li commented on MESOS-8090: -- A quick attempt to fix: https://reviews.apache.org/r/63084/ > Mesos

[jira] [Updated] (MESOS-8090) Mesos 1.4.0 crashes with 1.3.x agent with oversubscription

2017-10-13 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8090?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhitao Li updated MESOS-8090: - Description: We are seeing a crash in 1.4.0 master when it receives {{updateSlave}} from a

[jira] [Updated] (MESOS-8090) Mesos 1.4.0 crashes with 1.3.x agent with oversubscription

2017-10-13 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8090?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhitao Li updated MESOS-8090: - Description: We are seeing a crash in 1.4.0 master when it receives {{updateSlave}} from a

[jira] [Created] (MESOS-8090) Mesos 1.4.0 crashes with 1.3.x agent with oversubscription

2017-10-13 Thread Zhitao Li (JIRA)
Zhitao Li created MESOS-8090: Summary: Mesos 1.4.0 crashes with 1.3.x agent with oversubscription Key: MESOS-8090 URL: https://issues.apache.org/jira/browse/MESOS-8090 Project: Mesos Issue Type:

[jira] [Updated] (MESOS-8090) Mesos 1.4.0 crashes with 1.3.x agent with oversubscription

2017-10-13 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8090?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhitao Li updated MESOS-8090: - Affects Version/s: 1.4.0 > Mesos 1.4.0 crashes with 1.3.x agent with oversubscription >

[jira] [Updated] (MESOS-8075) Add RWMutex to libprocess

2017-10-12 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhitao Li updated MESOS-8075: - Shepherd: Benjamin Hindman > Add RWMutex to libprocess > - > >

[jira] [Created] (MESOS-8079) Checkpoint and recover layers used to provision rootfs in provisioner

2017-10-12 Thread Zhitao Li (JIRA)
Zhitao Li created MESOS-8079: Summary: Checkpoint and recover layers used to provision rootfs in provisioner Key: MESOS-8079 URL: https://issues.apache.org/jira/browse/MESOS-8079 Project: Mesos

[jira] [Assigned] (MESOS-8075) Add RWMutex to libprocess

2017-10-11 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhitao Li reassigned MESOS-8075: Assignee: Zhitao Li > Add RWMutex to libprocess > - > >

[jira] [Created] (MESOS-8075) Add RWMutex to libprocess

2017-10-11 Thread Zhitao Li (JIRA)
Zhitao Li created MESOS-8075: Summary: Add RWMutex to libprocess Key: MESOS-8075 URL: https://issues.apache.org/jira/browse/MESOS-8075 Project: Mesos Issue Type: Task Components:

[jira] [Created] (MESOS-8070) Bundled GRPC build does not build on Debian 8

2017-10-10 Thread Zhitao Li (JIRA)
Zhitao Li created MESOS-8070: Summary: Bundled GRPC build does not build on Debian 8 Key: MESOS-8070 URL: https://issues.apache.org/jira/browse/MESOS-8070 Project: Mesos Issue Type: Bug

[jira] [Commented] (MESOS-6240) Allow executor/agent communication over non-TCP/IP stream socket.

2017-10-04 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-6240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16191610#comment-16191610 ] Zhitao Li commented on MESOS-6240: -- +1 Taking out executor to agent API from TCP to domain socket will

[jira] [Updated] (MESOS-8040) Return nested containers in `GET_CONTAINERS` API call

2017-09-28 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8040?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhitao Li updated MESOS-8040: - Component/s: containerization Issue Type: Improvement (was: Bug) > Return nested containers in

[jira] [Created] (MESOS-8040) Return nested containers in `GET_CONTAINERS` API call

2017-09-28 Thread Zhitao Li (JIRA)
Zhitao Li created MESOS-8040: Summary: Return nested containers in `GET_CONTAINERS` API call Key: MESOS-8040 URL: https://issues.apache.org/jira/browse/MESOS-8040 Project: Mesos Issue Type: Bug

[jira] [Comment Edited] (MESOS-8018) Allow framework to opt-in to forward executor's JWT token to the tasks

2017-09-28 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16184562#comment-16184562 ] Zhitao Li edited comment on MESOS-8018 at 9/28/17 6:01 PM: --- [~jamespeach] If the

[jira] [Comment Edited] (MESOS-8018) Allow framework to opt-in to forward executor's JWT token to the tasks

2017-09-28 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16184562#comment-16184562 ] Zhitao Li edited comment on MESOS-8018 at 9/28/17 6:00 PM: --- [~jamespeach] If the

[jira] [Commented] (MESOS-8018) Allow framework to opt-in to forward executor's JWT token to the tasks

2017-09-28 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-8018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16184562#comment-16184562 ] Zhitao Li commented on MESOS-8018: -- [~jamespeach] If the framework *opt-in* to this behavior, then the

[jira] [Created] (MESOS-8018) Allow framework to opt-in to forward executor's JWT token to the tasks

2017-09-26 Thread Zhitao Li (JIRA)
Zhitao Li created MESOS-8018: Summary: Allow framework to opt-in to forward executor's JWT token to the tasks Key: MESOS-8018 URL: https://issues.apache.org/jira/browse/MESOS-8018 Project: Mesos

[jira] [Commented] (MESOS-1739) Allow slave reconfiguration on restart

2017-09-22 Thread Zhitao Li (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-1739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16177221#comment-16177221 ] Zhitao Li commented on MESOS-1739: -- Ping on this too. I'm willing to work on this in the next couple of

  1   2   3   4   >