Re: --resources documentation

2015-08-04 Thread Greg Mann
Hi John, You are correct, memory disk are specified in MB. This is documented at http://mesos.apache.org/documentation/attributes-resources/ in the section titled Predefined Uses Conventions. Cheers, Greg On Tue, Aug 4, 2015 at 6:54 AM, John Omernik j...@omernik.com wrote: Hey, all, I am

Re: mesos fetch uri when behind a squid proxy

2015-11-03 Thread Greg Mann
It's possible that Spark sets the executor environment explicitly, which would lead to the http_proxy and https_proxy environment variables not being passed along to the executor. You could try using the `--executor_environment_variables` command-line flag when running the agent to specify these

Re: Executor can't start d-bus communication

2015-10-14 Thread Greg Mann
t; > After I started all components as user, its worked. > > I wrote the executor using the Executor APIs. By creating a new class. > > Thanks for your reply. > > Regards, > Pradeep > > > On 14 October 2015 at 17:18, Greg Mann <g...@mesosphere.io> wrote: &

Re: Executor can't start d-bus communication

2015-10-14 Thread Greg Mann
Hi Pradeep, Can you tell us a bit about your executor: in particular, did you write it against the Executor API , or is it a plain executable that gets run through the command executor? Cheers, Greg On Wed, Oct 14,

Re: [VOTE] Release Apache Mesos 0.25.0 (rc2)

2015-10-07 Thread Greg Mann
Successfully built `sudo make distcheck` on CentOS 7.1 and Ubuntu 14.04 with only expected test failures. On our Fedora 22 CI build, however, while the tests are building the following compile-time error is produced: [17:18:46][Step 4/6] CXX

Re: mesos 0.24 released?

2015-09-18 Thread Greg Mann
Hi Craig, 0.24.0 has been released, the vote that selected 0.24.0-rc2 as the final release passed on Sept. 4, which matches the date from that tarball. Cheers, Greg On Fri, Sep 18, 2015 at 10:57 AM, craig w wrote: > So RC releases are still going out, it's not yet "final"?

Re: Access to Design Doc

2016-01-12 Thread Greg Mann
Hi John, I just shared the doc with you; let me know if you still have trouble accessing it. Cheers, Greg On Tue, Jan 12, 2016 at 11:42 AM, John Omernik wrote: > Is there a place to request google doc permissions on the design doc here: > >

Re: Understanding offers

2016-06-09 Thread Greg Mann
Hi Nihal, Mesos makes decisions about resource allocation for frameworks based on the Dominant Resource Fairness (DRF) algorithm [1]. There are a couple different ways to affect the offers provided to a particular framework: you can assign a *quota* to the framework's role to give it a guarantee

Re: [VOTE] Release Apache Mesos 0.27.0 (rc2)

2016-01-30 Thread Greg Mann
+1 (non-binding) Note that I discovered a previously-resolved flaky test failure that's cropped back up, noted below. It doesn't seem like a blocker to me, but I wanted to draw attention to it just in case. On a CentOS 7.1 VM, with libevent and SSL enabled: `make check` -> OK `sudo make check`

Re: [VOTE] Release Apache Mesos 0.24.2 (rc1)

2016-02-29 Thread Greg Mann
+1 (non-binding) `sudo make check` on Ubuntu 14.04, using gcc with libevent and SSL enabled. All tests pass except MemoryPressureMesosTest.CGROUPS_ROOT_Statistics, which is a known failure in 0.24. Cheers, Greg On Mon, Feb 29, 2016 at 11:20 AM, Kapil Arya wrote: > +1

Re: [VOTE] Release Apache Mesos 0.27.2 (rc1)

2016-02-29 Thread Greg Mann
+1 (non-binding) `sudo make check` on Ubuntu 14.04 using gcc, with libevent and SSL enabled. All tests pass except MemoryPressureMesosTest.CGROUPS_ROOT_Statistics, which seems to be due to the issue found here: https://issues.apache.org/jira/browse/MESOS-4053 On Mon, Feb 29, 2016 at 2:17 PM,

Re: [VOTE] Release Apache Mesos 0.26.1 (rc1)

2016-02-29 Thread Greg Mann
+1 (non-binding) `sudo make check` on Ubuntu 14.04 using gcc, with libevent and SSL enabled. All tests pass except: PerfEventIsolatorTest.ROOT_CGROUPS_Sample, which is covered here: https://issues.apache.org/jira/browse/MESOS-4655 CgroupsAnyHierarchyWithPerfEventTest.ROOT_CGROUPS_Perf,

Re: [VOTE] Release Apache Mesos 0.27.2 (rc1)

2016-03-01 Thread Greg Mann
gt;>> Fedora 23: `make check` non-root OK > >>> OS X: `make check` non-root OK > >>> Ubuntu 14.04: `make check` non-root, three failures: > >>> ContainerLoggerTest.DefaultToSandbox [flaky, tracked in MESOS-4615] > >>> MasterQuotaTest.AvailableResourcesAfterRescinding [flaky, tracked in >

Re: [VOTE] Release Apache Mesos 0.26.1 (rc1)

2016-03-01 Thread Greg Mann
I was also able to successfully test a simple upgrade scenario between 0.25.1-rc1 and 0.26.1-rc1 using Niklas's upgrade testing script, which I've modified slightly and reposted here: https://reviews.apache.org/r/44229/ On Mon, Feb 29, 2016 at 10:31 PM, Greg Mann <g...@mesosphere.io> wrote:

Re: [VOTE] Release Apache Mesos 0.25.1 (rc1)

2016-03-01 Thread Greg Mann
I was also able to successfully test a simple upgrade scenario between 0.24.2-rc1 and 0.25.1-rc1 using Niklas's upgrade testing script, which I've modified slightly and reposted here: https://reviews.apache.org/r/44229/ On Tue, Mar 1, 2016 at 9:29 AM, Greg Mann <g...@mesosphere.io> wrote:

Re: [VOTE] Release Apache Mesos 0.25.1 (rc2)

2016-03-10 Thread Greg Mann
+1 (non-binding) Ran `sudo make check` on CentOS 7, using gcc with libevent and SSL enabled. The following tests failed during the first test run: ContainerizerTest.ROOT_CGROUPS_BalloonFramework MemoryPressureMesosTest.CGROUPS_ROOT_Statistics MemoryPressureMesosTest.CGROUPS_ROOT_SlaveRecovery

Re: [VOTE] Release Apache Mesos 0.26.1 (rc2)

2016-03-10 Thread Greg Mann
+1 (non-binding) Ran `sudo make check` on CentOS 7, using gcc with libevent and SSL enabled. All tests pass. I was also able to successfully test a simple upgrade scenario from 0.25.1-rc2 to 0.26.1-rc2 using the script found here: https://reviews.apache.org/r/44229/ Cheers, Greg On Tue, Mar

Re: [VOTE] Release Apache Mesos 0.24.2 (rc2)

2016-03-10 Thread Greg Mann
+1 (non-binding) Ran `sudo make check` on CentOS 7, using gcc with libevent and SSL enabled. All tests pass. I was also able to successfully test a simple upgrade scenario from 0.23.1 to 0.24.2-rc2 using the script found here: https://reviews.apache.org/r/44229/ Cheers, Greg On Tue, Mar 8,

Re: orphaned_tasks cleanup and prevention method

2016-04-07 Thread Greg Mann
Hi June, Are these Spark tasks being run in cluster mode or client mode? If it's client mode, then perhaps your local Spark scheduler is tearing itself down before the executors exit, thus leaving them orphaned. I'd love to see master/agent logs during the time that the tasks are becoming

Re: Disappearing tasks

2016-04-06 Thread Greg Mann
Hi Justin, I'm sorry that you've been having difficulty with your cluster. Do you have access to master/agent logs around the time that these tasks went missing from the Mesos UI? It would be great to have a look at those if possible. I would still recommend against setting the agent work_dir to

Mesos Community Sync 4/7/16 - Notes

2016-04-12 Thread Greg Mann
Hey Mesos community, Find below the notes from our last Mesos Community Sync meeting, which occurred on 4/7/2016 at 9am PST. The next community sync will be on 4/21/2016 from 3-4pm PST. See the Mesos community calendar for upcoming meetings (note that you can

[Proposal] Remove the default value for agent work_dir

2016-04-12 Thread Greg Mann
Hey folks! A number of situations have arisen in which the default value of the Mesos agent `--work_dir` flag (/tmp/mesos) has caused problems on systems in which the automatic cleanup of '/tmp' deletes agent metadata. To resolve this, we would like to eliminate the default value of the agent

Re: how to change mesos resources

2016-04-08 Thread Greg Mann
Hi Stefano, The agent saves its resource information on the disk, so that it can recover in case of failure. When the agent starts up, it checks the work directory, and if it finds information left behind by a previous instance of the agent, it loads that information and attempts to register with

Re: orphaned_tasks cleanup and prevention method

2016-04-08 Thread Greg Mann
t; > Thanks, > June Taylor > System Administrator, Minnesota Population Center > University of Minnesota > > On Thu, Apr 7, 2016 at 3:37 PM, Greg Mann <g...@mesosphere.io> wrote: > >> Hi June, >> Are these Spark tasks being run in cluster mode or client mod

Re: [VOTE] Release Apache Mesos 0.28.1 (rc2)

2016-04-05 Thread Greg Mann
+1 (non-binding) Ran `sudo make check` on CentOS 7 with libevent and SSL enabled; all tests pass. I was also able to successfully simulate a simple upgrade scenario using 'test-upgrade.py'. Note that this initially failed due to some changes made to the test framework in this release, but after

Re: mesos/kafka issues (org.apache.mesos.Scheduler)

2016-04-11 Thread Greg Mann
Hi Justin, Do you have master/agent logs from a time when these tasks would have gone missing from the Mesos UI? What location are you using for the work_dir on the agents? Cheers, Greg On Mon, Apr 11, 2016 at 1:41 PM, Justin Ryan wrote: > Update : I noticed one of the

Re: HTTP Basic Auth for Mesos UI

2016-03-24 Thread Greg Mann
Mi Michal, We have recently enabled authentication on many master/agent endpoints, but authentication doesn't yet work on the webui. Once all the relevant endpoints have authentication enabled, we still need to add a login mechanism to the webui. You can find a ticket for the webui improvement

Re: [VOTE] Release Apache Mesos 0.28.0 (rc1)

2016-03-07 Thread Greg Mann
+1 (non-binding) `make check` and `sudo make check` on OSX 10.10.5 and CentOS 7.1, with libevent and SSL enabled. All tests pass on CentOS 7.1 When running the sudo tests on OSX, three failures were observed: ExamplesTest.JavaFramework is tracked by

Re: [VOTE] Release Apache Mesos 0.25.1 (rc1)

2016-03-01 Thread Greg Mann
+1 (non-binding) `sudo make check` on Ubuntu 14.04 using gcc, with libevent and SSL enabled. All tests pass except: PerfEventIsolatorTest.ROOT_CGROUPS_Sample, which is covered here: https://issues.apache.org/jira/browse/MESOS-4655 CgroupsAnyHierarchyWithPerfEventTest.ROOT_CGROUPS_Perf,

Re: Agent won't start

2016-03-29 Thread Greg Mann
Check out this link for info on /tmp cleanup in Ubuntu: http://askubuntu.com/questions/20783/how-is-the-tmp-directory-cleaned-up And check out this link for information on some of the work_dir's contents on a Mesos agent: http://mesos.apache.org/documentation/latest/sandbox/ The work_dir

Re: Agent won't start

2016-03-29 Thread Greg Mann
Hi Paul, Noticing the logging output, "Failed to find resources file '/tmp/mesos/meta/resources/resources.info'", I wonder if your trouble may be related to the location of your agent's work_dir. See this ticket: https://issues.apache.org/jira/browse/MESOS-4541 Some users have reported issues

Re: Agent won't start

2016-03-29 Thread Greg Mann
Cordially, > > Paul > > On Tue, Mar 29, 2016 at 5:18 PM, Greg Mann <g...@mesosphere.io> wrote: > >> Hi Paul, >> Noticing the logging output, "Failed to find resources file >> '/tmp/mesos/meta/resources/resources.info'", I wonder if your trouble

Mesos Community Sync Notes

2016-04-21 Thread Greg Mann
Hello all! Today we had another Mesos community sync meeting; find the notes pasted below. Of particular interest is our new roadmap document , which outlines items planned for development within the next six months. For

Community Sync Notes

2016-05-20 Thread Greg Mann
Hello all! We had some great discussion yesterday during the Mesos community sync, you can find the notes below. Cheers, Greg Mesos Developer Community Sync http://mesos.apache.org/community/ May 19, 2016 Time: 3pm PST Location: Mesosphere HQ Attendees: Mesosphere: Greg M., Gilbert,

Persistent Volume API Change

2016-05-23 Thread Greg Mann
Hello all, I'm currently working on MESOS-5005 , which is fixing a small bug in the persistent volumes API. When a new persistent volume is created, a `DiskInfo` message is included in the disk resources of the volume. Nested within another message

Re: [Proposal] Remove the default value for agent work_dir

2016-04-19 Thread Greg Mann
t; >> On Tue, Apr 12, 2016 at 9:31 PM, Jie Yu <yujie@gmail.com> wrote: >> >>> +1 >>> >>> On Tue, Apr 12, 2016 at 9:29 PM, James Peach <jor...@gmail.com> wrote: >>> >>> > >>> > > On Apr 12, 2016, at 3:58 PM, G

Re: [VOTE] Release Apache Mesos 1.0.0 (rc4)

2016-07-26 Thread Greg Mann
+1 (non-binding) * Ran `sudo make distcheck` successfully on CentOS 7.1 with only one test failure: ExamplesTest.PythonFramework fails for me the first time it's executed as part of the whole test suite, and then succeeds on subsequent executions. I'm investigating further, and will file a ticket

Re: [VOTE] Release Apache Mesos 1.0.1 (rc1)

2016-08-12 Thread Greg Mann
+1 (non-binding): * Ran "sudo make distcheck" on CentOS 7 with libevent and SSL enabled. All tests passed. * Used "test-upgrade.py" to test upgrades from 0.28.2 -> 1.0.1 and 1.0.0 -> 1.0.1; both were successful. Cheers, Greg On Wed, Aug 10, 2016 at 5:32 PM, Vinod Kone

Re: [VOTE] Release Apache Mesos 1.0.1 (rc1)

2016-08-12 Thread Greg Mann
Whoops! Sorry y'all, my wires got crossed :) I ran these tests on Ubuntu 14.04. G On Fri, Aug 12, 2016 at 12:50 PM, Greg Mann <g...@mesosphere.io> wrote: > +1 (non-binding): > > * Ran "sudo make distcheck" on CentOS 7 with libevent and SSL enabled. All > tests passe

Re: Accepting resources with role as framework

2016-07-18 Thread Greg Mann
Wil, Yes, the framework needs to include the role. Note that if a framework doesn't specify the resource's role, the field will be populated with the default value `*` automatically by the protobuf library when the message is serialized. So, any resource object accepted by your framework will

Re: Welcome Neil Conway as Mesos Committer and PMC member!

2017-01-23 Thread Greg Mann
Congratulations Neil!!! :D On Sun, Jan 22, 2017 at 4:46 PM, Neil Conway wrote: > Thanks for the kind words, everyone! It's been a pleasure to be a part > of the Mesos community, and I'm looking forward to continuing to > contribute. > > Neil > > On Sun, Jan 22, 2017 at

Re: Question on dynamic reservations

2017-01-17 Thread Greg Mann
Hi Povilas, Another approach you could try is to use dynamic reservations only. You could either: 1. Alter your stateful framework to dynamically reserve the resources that it needs, or 2. Add a script to your cluster tooling that would make use of the operator endpoint for dynamic

Re: Question on dynamic reservations

2017-01-17 Thread Greg Mann
ces) > with attributes on them you could have frameworks voluntarily choose not to > consume resources not meant for them. > > e.g. Disk resource has attribute `CASSANDRA`. Ok, since I'm a Kafka > framework I won't go use that disk. > > On Tue, Jan 17, 2017 at 11:24 AM Greg Mann

Re: Welcome Kevin Klues as a Mesos Committer and PMC member!

2017-03-01 Thread Greg Mann
Woowoo! Congrats Kevin!! On Wed, Mar 1, 2017 at 2:26 PM, Avinash Sridharan wrote: > Awesome !! Congrats Kevin !! > > On Wed, Mar 1, 2017 at 2:07 PM, Jie Yu wrote: > >> Congrats! Kevin! Well deserved! >> >> On Wed, Mar 1, 2017 at 2:05 PM, Benjamin

Re: [VOTE] Release Apache Mesos 1.2.0 (rc2)

2017-03-01 Thread Greg Mann
I wanted to give a heads up on a flaky test failure I've encountered while testing this RC: 'DockerRuntimeIsolatorTest.ROO T_INTERNET_CURL_DockerDefaultEntryptRegistryPuller'. One issue related to this test was resolved recently (https://issues.apache.org/ jira/browse/MESOS-6001), but this seems

Re: Resource Isolation in Mesos

2016-09-07 Thread Greg Mann
Hi Srikant, Without using cgroups, it won't be possible to enforce isolation of cpu/memory on a Linux agent. Could you elaborate a bit on why you aren't able to use cgroups currently? Have you tested the existing Mesos cgroup isolators in your system? Cheers, Greg On Tue, Sep 6, 2016 at 9:24 PM,

Re: missing documentation: view_frameworks, view_tasks etc in mesos 1.0

2016-09-07 Thread Greg Mann
Hi Haripriya, I just ran a quick test using your ACLs (I added a permissive ACL for "run_tasks" as well), and I was able to view everything in the web UI. I did this test with the current HEAD of Mesos master, however, so it's possible that something has changed since 1.0. One thing that can be

Re: missing documentation: view_frameworks, view_tasks etc in mesos 1.0

2016-09-07 Thread Greg Mann
configure view_tasks, view_executors, access_sandboxes, do we need > additional authentication added to the exisiting mesos UI? > > From the UI how does it recognize User A to be User A? Are there any > assumptions that I'm missing? What is the required infrastructure for multi > tenancy her

Re: mesos marathon roles

2016-09-07 Thread Greg Mann
Hi Vincent, Can you confirm it's because I didn't set any static reservation ? > Yes, that's correct. So how could I check the resource allocation with multiple marathon > instances and roles, and configured weights between these roles ? Is > Marathon supposed to reserve resources with the

Re: [VOTE] Release Apache Mesos 0.28.3 (rc1)

2016-11-30 Thread Greg Mann
+1 (non-binding) Did `sudo make check` on CentOS 7. Aside from several LinuxFilesystemIsolatorTests and two other flaky tests, CgroupsAnyHierarchyWithFreezerTest.ROOT_CGROUPS_DestroyTracedProcess and MemoryPressureMesosTest.CGROUPS_ROOT_SlaveRecovery, everything passed. Cheers, Greg On Wed, Nov

Re: [MESOS-6240] Allow executor/agent communication over non-TCP/IP stream socket.

2017-01-06 Thread Greg Mann
Hi Bingqiang, I've had some recent discussions with folks about this feature; it's something we are interested in doing but I'm not sure what its priority is in relation to other tickets. While the AF_UNIX address type has been added to libprocess, libprocess does not currently accept domain

Re: [Design Doc] [RFC] Executor Authentication

2017-01-04 Thread Greg Mann
, 2016 at 7:00 PM, Greg Mann <g...@mesosphere.io> wrote: > Hello all, > As part of the continuing effort to secure all communication in a Mesos > cluster, we would like to add authentication to the executor HTTP API. > Linked below is a design document draft for this feature; I w

[Design Doc] [RFC] Executor Authentication

2016-12-23 Thread Greg Mann
Hello all, As part of the continuing effort to secure all communication in a Mesos cluster, we would like to add authentication to the executor HTTP API. Linked below is a design document draft for this feature; I would love to get the community's feedback! Feel free to leave comments on the

Re: [VOTE] Release Apache Mesos 1.0.4 (rc1)

2017-04-25 Thread Greg Mann
+1 (non-binding) Ran `sudo make check` on CentOS 7 with Docker 1.12.1. The only test failure was: ProvisionerDockerPullerTest.ROOT_INTERNET_CURL_Whiteout While I haven't had a chance to look deeply into this, it seems that the whiteout handling was not correct at the time of 1.0, and these

Re: Welcome Gilbert Song as a new committer and PMC member!

2017-05-24 Thread Greg Mann
Congratulations Gilbert!! :D On Wed, May 24, 2017 at 12:01 PM, Avinash Sridharan wrote: > Congrats Gilbert !! Very well deserved !! > > On Wed, May 24, 2017 at 11:56 AM, Timothy Chen wrote: > > > Congrats! Rocking the containerizer world! > > > > Tim >

Re: Welcome Greg Mann as a new committer and PMC member!

2017-06-15 Thread Greg Mann
; > On Tue, Jun 13, 2017 at 2:42 PM Vinod Kone <vinodk...@apache.org> wrote: > > > Hi folks, > > > > Please welcome Greg Mann as the newest committer and PMC member of the > > Apache Mesos project. > > > > Greg has been an active contributor to the

[Design Doc] An Improved KillPolicy

2017-09-25 Thread Greg Mann
Hello all! I've been working on a little design for some improvements to the KillPolicy. You can find the design doc here . TL;DR: the plan is to extend the KillPolicy message to allow the

This Month in Mesos - December 2017

2017-12-12 Thread Greg Mann
Dear Apache Mesos Community, Development in Mesos has been active lately, with work taking place to enable things like hybrid cloud and network storage support, as well as improvements to the scheduler API designed to make the lives of framework developers easier. Apache Mesos version 1.5 is

UPDATE: Mesos 1.6.0 release

2018-05-04 Thread Greg Mann
. I have created the 1.6.x branch and *will cut Mesos 1.6 RC1 on Monday, May 7. *Keep your eyes peeled on the mailing list for the email, your help in testing is greatly appreciated! Thanks for your patience! Cheers, Greg On Tue, Apr 24, 2018 at 3:44 PM, Greg Mann <g...@mesosphere.io>

[VOTE] Release Apache Mesos 1.6.0 (rc1)

2018-05-07 Thread Greg Mann
Hi all, Please vote on releasing the following candidate as Apache Mesos 1.6.0. 1.6.0 includes the following: * Resizing of persistent volumes for agent default resources * Offer operation feedback for resource

Re: Getting roles' info in Scheduler

2018-05-10 Thread Greg Mann
Hi Pascal, This isn't possible directly with the SchedulerDriver, but your scheduler could use the 'GET_ROLES' call of the operator API [1] for this purpose. Cheers, Greg [1] http://mesos.apache.org/documentation/latest/operator-http-api/#get_roles On Sat, May 5, 2018 at 3:37 AM, Pascal Gillet

[RESULT][VOTE] Release Apache Mesos 1.6.0 (rc1)

2018-05-11 Thread Greg Mann
Hi all, The vote for Mesos 1.6.0 (rc1) has passed with the following votes. +1 (Binding) -- Vinod Kone Chun-Hung Hsiao James Peach Zhitao Li Andrew Schwartzmeyer There were no 0 or -1 votes. Please find the release at:

Soliciting documentation feedback

2018-05-17 Thread Greg Mann
Hi everyone, As part of our ongoing effort to improve the Mesos docs, we're looking for your help. What areas of the Mesos documentation need the most improvement? Do you have projects to suggest or mistakes to flag? We'll be compiling this feedback into project suggestions for the May 24th

Re: Doc-a-thon - May 24th

2018-05-17 Thread Greg Mann
Hi all, Just a reminder about the Mesos Doc-a-thon coming up next Thursday, May 24 starting at 3pm PST! You can join in person (RSVP here ) or online (link to join ). It would be great to see you

Re: [VOTE] Release Apache Mesos 1.5.1 (rc1)

2018-05-15 Thread Greg Mann
+1 (binding) I did `sudo make check` and verified that only expected flaky tests failed. Cheers, Greg On Fri, May 11, 2018 at 12:35 PM, Gilbert Song wrote: > Hi all, > > Please vote on releasing the following candidate as Apache Mesos 1.5.1. > > 1.5.1 includes the

Re: Proposing change to the allocatable check in the allocator

2018-06-12 Thread Greg Mann
Hi all, We had a nice discussion about this in the API working group meeting today. I agree that it's a good idea to do our best to make this change compatible with future updates to the Request call and/or quota. I think it would be beneficial to have a meeting in a few days to brainstorm some

[VOTE] Release Apache Mesos 1.6.1 (rc1)

2018-06-26 Thread Greg Mann
Hi all, Please vote on releasing the following candidate as Apache Mesos 1.6.1. 1.6.1 includes the following: *Announce major features here* *Announce major bug fixes here* The CHANGELOG for the

Re: [VOTE] Release Apache Mesos 1.6.1 (rc1)

2018-07-02 Thread Greg Mann
tical bug that would wipe out persistent data. I'm backporting > this to 1.4, 1.5 and 1.6. > > On Fri, Jun 29, 2018 at 9:05 AM Greg Mann wrote: > > > The failures here are mostly command executor/default executor tests. > > Looking at the test output, it seems that the tasks i

[API WG] Meeting tomorrow!

2018-04-30 Thread Greg Mann
Hi all, The API working group will meet tomorrow, May 1, from 11:00-11:50am PST. We'll be chatting about a proposal for per-framework metrics . Feel free to add more items to the agenda doc

Normalization of metric keys

2018-07-03 Thread Greg Mann
Hi all! I'm currently working on adding a suite of new per-framework metrics to help schedulers better debug unexpected/unwanted behavior (MESOS-8842 ). One issue that has come up during this work is how we should handle strings like the framework

Re: [VOTE] Release Apache Mesos 1.6.1 (rc1)

2018-07-03 Thread Greg Mann
Hey folks, an update on the 1.6.1-rc2 candidate: an issue surfaced after the fix was merged for MESOS-8830, which is being addressed currently. I'll be AFK for the next 3 days, so I'll cut 1.6.1-rc2 this coming Monday. Sorry for the delay! Cheers, Greg On Mon, Jul 2, 2018 at 12:30 PM, Greg Mann

This Month in Mesos - January 2018

2018-01-30 Thread Greg Mann
Dear Apache Mesos Community, Happy new year! Here's your latest update on happenings in Mesosworld over the past month. *Working Groups* The working groups have been active; here's a brief summary of recent developments, along with links to the agenda documents. If you see anything that

This Month in Mesos - February 2018

2018-02-28 Thread Greg Mann
Dear Apache Mesos Community, Hello all! I've got a short update for you this month with recent happenings in Mesosland: *Working Groups* Here's the latest from the working groups. Working group meetings can be found on the Mesos community calendar

Release policy and 1.6 release schedule

2018-03-13 Thread Greg Mann
Hi folks, During the recent API working group meeting [1], we discussed the release schedule. This has been a recurring topic of discussion in the developer sync meetings, and while our official policy still specifies time-based releases at a bi-monthly cadence, in practice we tend to gate our

Re: Mesos master endless attemps to kill unexisting task

2018-03-15 Thread Greg Mann
Hi Adam, The KILL calls are being sent to Mesos by Marathon. Since the KILL call is being forwarded to the agent, it seems that the Mesos master is aware of the task. Could you verify that the tasks show up as running in the Mesos UI? You say that the tasks don't exist anymore - how did you verify

Re: Release policy and 1.6 release schedule

2018-04-10 Thread Greg Mann
Thanks for the reviews, y'all! I've got a few "Ship-Its" - I'll commit this later today unless I hear any objections. Cheers, Greg On Wed, Apr 4, 2018 at 11:49 AM, Greg Mann <g...@mesosphere.io> wrote: > Hey folks, > I've posted a proposed update to our documented rel

Re: Mesos master endless attemps to kill unexisting task

2018-04-04 Thread Greg Mann
t; Yes I can confirm No kill ack received for instance >> [pub_api_oecd-rest-api-on-port-20015.marathon-196f414a-f61f-11e7-856c-f6e84742f1ef], >> retrying (73402 attempts so far)i cannot find this UUID in Mesos interface. >> >> Regards, Adam. >> >> On 03/15/2018 0

Re: Release policy and 1.6 release schedule

2018-04-04 Thread Greg Mann
Hey folks, I've posted a proposed update to our documented release schedule: https://reviews.apache.org/r/66454/ Please take a look and comment! Cheers, Greg On Mon, Mar 26, 2018 at 11:34 AM, Greg Mann <g...@mesosphere.io> wrote: > +1 for quarterly. I would also say that we should

Re: Doc-a-thon May 24th?

2018-04-13 Thread Greg Mann
Sounds good to me - thanks Judith!! On Wed, Apr 11, 2018 at 1:35 PM, Judith Malnick wrote: > Hi All, > > I'd like the next Mesos Doc-a-thon to happen on May 24th from 3-8pm > Pacific time. I picked the date because it's best for Ben H. > > Does anyone have major reasons

Proposal: Constrained upgrades from Mesos 1.6

2018-04-10 Thread Greg Mann
Hi all, We are currently working on patches to implement the new GROW_VOLUME and SHRINK_VOLUME operations [1]. In order to make it into Mesos 1.6, we're pursuing a workaround which affects the way these operations are accounted for in the Mesos master. These operations will be marked as

[API WG] Meeting tomorrow

2018-04-17 Thread Greg Mann
Hey folks, The API working group will be meeting tomorrow at 11am PST. We'll be chatting about CPU guarantees and limits. Feel free to add items to the agenda ! Cheers, Greg

UPDATE: Mesos 1.6.0 release

2018-04-24 Thread Greg Mann
Hi all, Several contributors who are either assignees or shepherds for 1.6 blocker tickets (myself included) have recently been pulled into some high priority internal work, so unfortunately I would like to delay the cut of the first release candidate for Mesos 1.6.0 to ensure that we have

This Month in Mesos - March 2018

2018-03-30 Thread Greg Mann
Oh hai there Apache Mesos Community! Back again with your monthly update on current events in the Mesosverse: *Working Groups* Below you'll find a brief summary of the group meetings from this past month, as well as some info about related work that's been happening in the project. Working

Re: Release policy and 1.6 release schedule

2018-03-26 Thread Greg Mann
h of an ask for committers and > maintainers. > > > > I also agree with folks that there're benefits doing releases more > > frequently. Given the historical data, I'd suggest we do quarterly > > releases, and maintain three release branches. > > > > - Jie

[API WG] Meeting today

2018-04-03 Thread Greg Mann
Hi all, The API working group will be meeting today at 11am PST. We'll be discussing HTTP return codes in Mesos [1]. If you have any other items for discussion, add them to the agenda! [2] Cheers, Greg [1] https://issues.apache.org/jira/browse/MESOS-7697 [2]

Re: Release policy and 1.6 release schedule

2018-03-26 Thread Greg Mann
> > I think the burden of maintaining a release branch is not just > backporting. We need to run CI to make sure every maintained release branch > are working, and do testing for that. It's a burden if there are too many > release branches. > > That's a good point, we do need to run CI on all

API Working Group Meeting Today

2018-03-20 Thread Greg Mann
Hey folks, Sorry for the late reminder - the API working group will meet this morning at 11am PST. Zhitao will be presenting the new volume resizing feature he's designing, and we'll have a discussion on the current state and future directions of the Mesos CLI. If you have any other items for

Re: Release policy and 1.6 release schedule

2018-03-23 Thread Greg Mann
we declared on policy. That makes upstream projects easier to >> gauge when a feature will be ready and when they can try it out. >> >> On Tue, Mar 13, 2018 at 3:10 PM, Greg Mann <g...@mesosphere.io> wrote: >> >> > Hi folks, >> > During the recent API worki

API Working Group - First Meeting Tomorrow

2018-03-05 Thread Greg Mann
Hello all, We'll be having our first API working group meeting tomorrow, March 6 at 11am PST. This working group is a great opportunity for us to work toward greater consistency and usability of our API, as well as raise issues with the current interface and plan future directions. You can find

Re: Backport Policy

2018-07-26 Thread Greg Mann
t;>>> are >>>> on the hook to triage and fix it and not the release manager. >>>> >>>> Having said that, if "during" the release process (i.e., cutting an RC) >>>> these backports cause delays for a release manager in getting the &g

Re: [VOTE] Release Apache Mesos 1.6.1 (rc2)

2018-07-25 Thread Greg Mann
aram() = true > > They are all known to be flaky. > > On Wed, Jul 11, 2018 at 6:15 PM Greg Mann wrote: > > > Hi all, > > > > Please vote on re

[RESULT][VOTE] Release Apache Mesos 1.6.1 (rc2)

2018-07-25 Thread Greg Mann
Hi all, The vote for Mesos 1.6.1 (rc2) has passed with the following votes: +1 (Binding) -- Chun-Hung Hsiao Vinod Kone Gastón Kleiman There were no 0 or -1 votes. Please find the release at: https://dist.apache.org/repos/dist/release/mesos/1.6.1 It is recommended

Re: [VOTE] Release Apache Mesos 1.6.1 (rc1)

2018-06-29 Thread Greg Mann
n 27, 2018 at 11:59 AM Jie Yu wrote: > >> +1 >> >> Passed on our internal CI that has the following matrix. I looked into >> the only failed test, looks to be a flaky test due to a race in the test. >> >> >> >> On Tue, Jun 26, 2018 at 7:02 PM, Gr

Re: Normalization of metric keys

2018-07-09 Thread Greg Mann
e user actually supplied > '#%$' or '/'. But using something like percent-encoding would have property > 3. > > On Fri, Jul 6, 2018 at 10:25 AM, Greg Mann wrote: > >> Thanks for the reply Ben! >> >> Yea I suspect the lack of normalization there was not intentional, and i

Re: Normalization of metric keys

2018-07-06 Thread Greg Mann
Thanks for the reply Ben! Yea I suspect the lack of normalization there was not intentional, and it means that you can no longer reliably split on '/' unless you apply some external controls to user input. Yep, this is bad :) One thing we should consider when normalizing metadata embedded in

Re: [VOTE] Release Apache Mesos 1.6.1 (rc2)

2018-07-11 Thread Greg Mann
-9002] - GCC 8.1 build failure in os::Fork::Tree. * [MESOS-9024] - Mesos master segfaults with stack overflow under load. * [MESOS-9025] - The container which joins CNI network and has checkpoint enabled will be mistakenly destroyed by agent. Cheers, Greg On Wed, Jul 11, 2018 at 6:15 PM, Greg

[VOTE] Release Apache Mesos 1.6.1 (rc2)

2018-07-11 Thread Greg Mann
Hi all, Please vote on releasing the following candidate as Apache Mesos 1.6.1. 1.6.1 includes the following: *Announce major features here* *Announce major bug fixes here* The CHANGELOG for the

Re: [VOTE] Release Apache Mesos 1.6.1 (rc2)

2018-07-12 Thread Greg Mann
ase > because of MESOS-8934? ;) > > On Wed, Jul 11, 2018, 9:37 PM Greg Mann wrote: > >> Whoops, I forgot to include the list of changes included in this release >> - sorry! >> >> 1.6.1-rc2 includes the following notable bug fixes: >> >> * [M

Re: Backport Policy

2018-07-13 Thread Greg Mann
It seems to me that putting the burden of deciding on backports on the release manager would actually increase the amount of work required. Simply cutting the release on a particular date is pretty quick - however, examining tickets to determine whether or not a particular fix should be backported

Re: [VOTE] Move the project repos to gitbox

2018-07-17 Thread Greg Mann
+1 On Tue, Jul 17, 2018 at 9:39 AM, Jie Yu wrote: > +1 > > On Tue, Jul 17, 2018 at 9:38 AM, Andrew Schwartzmeyer < > and...@schwartzmeyer.com> wrote: > >> +1 >> >> >> >> On 07/17/2018 8:54 am, Zhitao Li wrote: >> >> +1 >> >> On Tue, Jul 17, 2018 at 8:10 AM James Peach wrote: >> >>> >>> >>> >

  1   2   >