Welcome Charles-François Natali as Mesos Committer and PMC Member!

2021-08-15 Thread Andrei Sekretenko
Hi all,
Please welcome Charles-François as a committer and PMC member of the
Apache Mesos project!

Charles started contributing to Mesos a bit more than a year ago.
During this time, he has discovered and fixed a significant number of
issues in all corners of Mesos, from libprocess and the build system
to the cgroups isolator, thus becoming the most active contributor
since November 2020.

Also, Charles has been helping a lot with JIRA triage, mentoring other
contributors and moving forward a number of technical discussions.

Charles, thanks for all your contributions so far and looking forward to
continuing to work with you on the project!

- Andrei


Re: Permissions to close Jira Issues

2021-05-20 Thread Andrei Sekretenko
Looks like the "Contributors" group of the JIRA project should have
this permission; I've just added you there manually, now it should
work, please try again.

I don't remember if there has ever been any automation to fill this
group based on 'contributors.yaml'.

On Thu, 20 May 2021 at 11:33, Andreas Peters  wrote:
>
> Hi Qian,
>
> yes thats what I mean. But there is nothing to mark it as resolved. :-(
>
> Regards,
> Andreas
>
> Am 20.05.21 um 10:22 schrieb Qian Zhang:
> > Hi Andreas
> >
> > Do you mean marking issues in https://issues.apache.org/jira/ as resolved?
> > I do not think you need special permission to do that.
> >
> >
> > Regards,
> > Qian Zhang
> >
> >
> > On Thu, May 20, 2021 at 3:18 PM Andreas Peters 
> > wrote:
> >
> >> Hey,
> >>
> >>
> >> what I have to do, to get the permissions to close issues in "our" Jira?
> >>
> >>
> >> Regards,
> >> Andreas
> >>
> >>
> >
>
> --
> AVENTER UG (haftungsbeschraenkt)
> Köllner Chaussee 144
> 25337 Kölln-Reisiek
>
> Phone: (+49)4121 - 235582 0
> E-Mail: a...@aventer.biz
> Internet: https://www.aventer.biz
> Git: https://www.github.com/AVENTER-UG
>
> Geschäftsführer/CEO: Andreas Peters
> Unternehmenssitz/City: Kölln-Reisiek
> Handelsregister beim Amtsgericht/Legal Court: Itzehoe
> Handelsregister-Nummer/Company Registration Number: HRB 9995 PI
>
> Diese E-Mail kann vertrauliche und/oder rechtlich geschützte
> Informationen enthalten. Wenn Sie nicht der beabsichtigte Empfänger
> sind oder diese E-Mail irrtümlich erhalten haben, informieren Sie
> bitte sofort den Absender telefonisch oder per E-Mail und löschen
> Sie diese E-Mail aus Ihrem System. Das unerlaubte Kopieren sowie
> die unbefugte Weitergabe dieser Mail ist nicht gestattet.
>
>
> This email can be confidential and/or legally protected
> Information included. If you are not the intended recipient
> or have received this e-mail in error, inform your
> contact and the sender immediately by phone or email and delete this
> email from your system. Unauthorized copying as well
> the unauthorized forwarding of this mail is not permitted.
>


Re: [VOTE] Move Apache Mesos to Attic

2021-04-08 Thread Andrei Sekretenko
Well, one of the strengths of this project used to be effective
collaboration between people sitting in different locations and
different time zones (and, at certain intervals of time, working in
different companies).

Talking about COVID... the pandemic, assuming that it will subside,
should probably be one more reason to vote -1 at this point in time.
I don't know about everyone, but the additional overhead it creates
everywhere is, for example, absolutely not helping me spend any time
outside my working hours on any kind of knowledge transfer in Mesos
(including discussions on the future of the project, for that matter).

On Thu, 8 Apr 2021 at 18:29, Andreas Peters  wrote:
>
> As I see, Andrei you are sitting close to me (Hamburg, Germany), thats
> just 30km away from me. :-) It would make mentoring more easy, if we can
> see each other (of course, after covid).
>
> So, Andrei provide to be a mentor, I provide to be his student. :-)
>
> Cheers,
> Andreas
>
>


Re: [VOTE] Move Apache Mesos to Attic

2021-04-08 Thread Andrei Sekretenko
EDIT: I will reconsider my vote if someone explains how the latter
(lower the bar and remain in ASF) is MORE
harmful to the project than the former (lower the bar and leave ASF)

// Sorry, mixing up left and right, as usual

On Thu, 8 Apr 2021 at 18:10, Andrei Sekretenko  wrote:
>
> Sorry guys, let me try to fail this vote.
> -1 (binding)
>
> To continue any development (including maintenance), the entry bar for
> the people allowed to merge code and participate in project decisions
> has to be lowered, this way or another.
>
> I do not see how the approach "let's move the project out of ASF and
> lower the bar there" benefits the _future_ of the project, compared to
> lowering the committer bar while remaining in ASF.
> (I will reconsider my vote if someone explains how the latter is less
> harmful to the project than the former)
>
> To me, the elimination of the ASF reporting overhead is quite
> marginal, compared to the added overhead of:
>  - potential for future trademark hurdles (which can scare any
> potential contribution there might be)
>  - the need to immediately switch CI and issue tracker
>  - the old JIRA going read-only (yes, this should be counted as a
> separate concern)
> and so on
>
> I'm not saying that Mesos should stick to the ASF-provided
> infrastructure forever, but this is a different story.
> There are a number of things in Mesos that require major overhaul to
> lower the contribution overhead; in the new conditions, this might be
> one of them.
>
> Getting current (and potential future) contributors unstuck and
> relieving Vinod from the duties of heading the project (given that he
> wants to step down from being the PMC chair) is something we should do
> ASAP, however I doubt that moving into Attic is the right way to do
> that.
>
> I do not feel especially optimistic about the future prospects of the
> project, but IMO moving to Attic makes them even worse.
>
>
> ---
> Talking about what I personally can/want/cannot do:
>  - I do not see myself actively contributing (=adding new features and
> submitting solutions to address technical debt) to the project in the
> foreseeable future
>  - I will not be surprised if I will need to contribute some minor but
> critical bugfix in the nearest few months; for example, if someone
> exposes a major vulnerability in Mesos, I will probably have to work
> on that, and work on that quickly
>  - I'm ready to provide technical mentoring (and, hopefully,
> reasonably quick reviews) in areas I'm familiar with (this,
> unfortunately, excludes most of the inner workings of the agent, for
> example)
>  - I can provide mentoring and code reviews in areas I'm less familiar
> with, but do not expect quick feedback from me there, as meaningfully
> reviewing an unfamiliar code can be 2 to 100 times slower
>  - Despite having an opinion on where Mesos should not go, some
> understanding of the project's past mistakes, and some ideas on what
> it should drop, I have absolutely no idea where Mesos should go.
>
> Best,
> Andrei
>
> On Thu, 8 Apr 2021 at 17:50, Benjamin Bannier  wrote:
> >
> > Hi Shane,
> >
> > >> FWIW, I'm one of those people who said they were interested, and I
> > >> still voted to move it to the attic (even though my vote is non
> > >> binding as I'm not a committer).
> > >
> > >That's great!  One of the questions I have for the project is: why
> > >haven't they made you a committer yet?  (That's a question for the PMC,
> > >not for you, really, but it's one I'm betting the Board would be curious
> > >to hear about).
> >
> > I fear the problem is less that contributors haven't been voted in as
> > committers, but IMO that unfortunately the project has seen hardly any
> > contributions of bigger patches from the community in the last couple of
> > years, (I know we merged patches from Charles, but when some people
> > voiced interested here that's the first at least I heard of them). This
> > makes me pessimistic regarding a handover which medium term can improve
> > the situation.
> >
> > I summarized some issues I saw in an earlier thread (yes, this discussion
> > has been going on since Feb 2021), and I think the issues were only if
> > at all marginally related to committership,
> > https://lists.apache.org/thread.html/r4bccbf048a9bcde3f0bb66d5e2c57f585296e1f5e2769486413b2758%40%3Cdev.mesos.apache.org%3E
> > .
> >
> >
> > Cheers,
> >
> > Benjamin


Re: [VOTE] Move Apache Mesos to Attic

2021-04-08 Thread Andrei Sekretenko
Sorry guys, let me try to fail this vote.
-1 (binding)

To continue any development (including maintenance), the entry bar for
the people allowed to merge code and participate in project decisions
has to be lowered, this way or another.

I do not see how the approach "let's move the project out of ASF and
lower the bar there" benefits the _future_ of the project, compared to
lowering the committer bar while remaining in ASF.
(I will reconsider my vote if someone explains how the latter is less
harmful to the project than the former)

To me, the elimination of the ASF reporting overhead is quite
marginal, compared to the added overhead of:
 - potential for future trademark hurdles (which can scare any
potential contribution there might be)
 - the need to immediately switch CI and issue tracker
 - the old JIRA going read-only (yes, this should be counted as a
separate concern)
and so on

I'm not saying that Mesos should stick to the ASF-provided
infrastructure forever, but this is a different story.
There are a number of things in Mesos that require major overhaul to
lower the contribution overhead; in the new conditions, this might be
one of them.

Getting current (and potential future) contributors unstuck and
relieving Vinod from the duties of heading the project (given that he
wants to step down from being the PMC chair) is something we should do
ASAP, however I doubt that moving into Attic is the right way to do
that.

I do not feel especially optimistic about the future prospects of the
project, but IMO moving to Attic makes them even worse.


---
Talking about what I personally can/want/cannot do:
 - I do not see myself actively contributing (=adding new features and
submitting solutions to address technical debt) to the project in the
foreseeable future
 - I will not be surprised if I will need to contribute some minor but
critical bugfix in the nearest few months; for example, if someone
exposes a major vulnerability in Mesos, I will probably have to work
on that, and work on that quickly
 - I'm ready to provide technical mentoring (and, hopefully,
reasonably quick reviews) in areas I'm familiar with (this,
unfortunately, excludes most of the inner workings of the agent, for
example)
 - I can provide mentoring and code reviews in areas I'm less familiar
with, but do not expect quick feedback from me there, as meaningfully
reviewing an unfamiliar code can be 2 to 100 times slower
 - Despite having an opinion on where Mesos should not go, some
understanding of the project's past mistakes, and some ideas on what
it should drop, I have absolutely no idea where Mesos should go.

Best,
Andrei

On Thu, 8 Apr 2021 at 17:50, Benjamin Bannier  wrote:
>
> Hi Shane,
>
> >> FWIW, I'm one of those people who said they were interested, and I
> >> still voted to move it to the attic (even though my vote is non
> >> binding as I'm not a committer).
> >
> >That's great!  One of the questions I have for the project is: why
> >haven't they made you a committer yet?  (That's a question for the PMC,
> >not for you, really, but it's one I'm betting the Board would be curious
> >to hear about).
>
> I fear the problem is less that contributors haven't been voted in as
> committers, but IMO that unfortunately the project has seen hardly any
> contributions of bigger patches from the community in the last couple of
> years, (I know we merged patches from Charles, but when some people
> voiced interested here that's the first at least I heard of them). This
> makes me pessimistic regarding a handover which medium term can improve
> the situation.
>
> I summarized some issues I saw in an earlier thread (yes, this discussion
> has been going on since Feb 2021), and I think the issues were only if
> at all marginally related to committership,
> https://lists.apache.org/thread.html/r4bccbf048a9bcde3f0bb66d5e2c57f585296e1f5e2769486413b2758%40%3Cdev.mesos.apache.org%3E
> .
>
>
> Cheers,
>
> Benjamin


Future technical direction of Mesos?

2021-04-06 Thread Andrei Sekretenko
Hi all,
while the vote on moving to Attic is ongoing (I hope that the current
standstill will end quite soon - by Attic+non-ASF fork, by lowering
the committer bar or in some other way), I think we need to (once
again) ask recent and potential future contributors and users:

In which exact direction would you like to see the project moving?
What are the things you need in Mesos and would be willing to contribute?
For the people and organizations who are maintaining private Mesos
forks - what is there in those forks that might be of value to the
community and you would like to merge into upstream?

I'm asking, because I'm under impression that an (unconscious) view of
some/many of the PMC is that Mesos is not  - and will never be -
solving any task that Kubernetes is not solving now and will not be
solving in the foreseeable future. _If_ one views Mesos+something as a
"contender in a container orchestration war" then that "battle" is
clearly lost.

Using the "bad analogy" Curl/Firefox by Charles-François Natali: what
are the tasks for which you use this "Curl" where it cannot be
reasonably substituted by "Firefox"? And do you envision other tasks
of these kinds?
--
In the other words: what do you want Mesos to be in the future?
A thing capable of running legacy Mesos frameworks for container orchestration?
A core of a system which struggles to keep feature parity with
solutions on top of K8s, has performance characteristics similar to
those solutions, but is not built on top of K8s?
Or... something entirely different?

In the two first cases, I would agree that Mesos has no future and
should be retired (and retired not only as an ASF project, but in
principle).
In the latter case - can you be more specific?

Note that, although the answers might impact the ongoing Attic vote,
the project needs them regardless of the outcome of that vote.

Best regards,
Andrei Sekretenko


Re: Next Steps

2021-02-18 Thread Andrei Sekretenko
IIUC, Attic is not intended for projects which still have active users
and thus might be in need of fixing bugs.

Key items about moving project to Attic:
> It is not intended to:
> - Rebuild community
> - Make bugfixes
> - Make releases

>Projects whose PMC are unable to muster 3 votes for a release, who have no 
>active committers or are unable to fulfill their reporting duties to the board 
>are all good candidates for the Attic.

As a D2iQ employee, I can say that if we find a bug critical for our
customers, we will be interested in fixing that. Should the project be
moved into Attic, the fix will be present only in forks (which might
mean our internal forks).

I could imagine that other entities and people using Mesos are in a
similar position with regards to bugfixes.
If this is true, then moving the project to Attic in the near future
is not a proper solution to the issue of insufficient bandwidth of the
active PMC members/chair.

---
A long-term future of the project is a different story, which, in my
personal view, will "end" either in moving the project into Attic or
in shifting the project direction from what it used to be in the
recent few years to something substantially different. IMO, this
requires a  _separate_ discussion.

Damien's questions sound like a good starting point for that
discussion, I'll try to answer them from my committer/PMC member
perspective when I have enough time.

On Thu, 18 Feb 2021 at 12:49, Charles-François Natali
 wrote:
>
> Thanks Tomek, that's what I suspected.
> It would therefore make it much more difficult for anyone to carry on since 
> it would effectively have to be a fork, etc.
> I think it'd be a bit of a shame, but I understand Benjamin's point.
> I hope it can be avoided.
>
>
> Cheers,
>
>
>
> On Thu, 18 Feb 2021, 11:02 Tomek Janiszewski,  wrote:
>>
>> Moving to attic is making project read only
>> https://attic.apache.org/
>> https://attic.apache.org/projects/aurora.html
>>
>> czw., 18 lut 2021, 11:56 użytkownik Charles-François Natali 
>>  napisał:
>>>
>>> I'm not familiar with the attic but would it still allow to actually
>>> develop, make commits to the repository etc?
>>>
>>>
>>> On Thu, 18 Feb 2021, 08:27 Benjamin Bannier,  wrote:
>>>
>>> > Hi Vinod,
>>> >
>>> > > I would like to start a discussion around the future of the Mesos
>>> > project.
>>> > >
>>> > > As you are probably aware, the number of active committers and
>>> > contributors
>>> > > to the project have declined significantly over time. As of today,
>>> > there's
>>> > > no active development of any features or a public release planned. On 
>>> > > the
>>> > > flip side, I do know there are a few companies who are still actively
>>> > using
>>> > > Mesos.
>>> >
>>> > Thanks for starting this discussion Vinod. Looking at Slack, mailing
>>> > lists, JIRA and reviewboard/github the project has wound down a lot in
>>> > the last 12+ months.
>>> >
>>> > > Given that, we need to assess if there's interest in the community to
>>> > keep
>>> > > this project moving forward. Specifically, we need some active 
>>> > > committers
>>> > > and PMC members who are going to manage the project. Ideally, these 
>>> > > would
>>> > > be people who are using Mesos in some capacity and can make code
>>> > > contributions.
>>> >
>>> > While I have seen a few non-committer folks contribute patches in the
>>> > last months, I feel it might be too late to bootstrap an active
>>> > community at this point.
>>> >
>>> > Apache Mesos is still mentioned prominently in the docs of a number of
>>> > other projects which gives off the impression of an active and
>>> > maintained project. In reality almost nobody is working on issues or
>>> > available to help users, and basing a new project on Apache Mesos these
>>> > days is probably not a good idea. I honestly do not see that to change
>>> > should new people step up and IMO the most honest way forward would be
>>> > to move the project to the attic to clearly communicate that the project
>>> > has moved into another phase; this wouldn't preclude folks from using or
>>> > further developing Apache Mesos, but would give a clear signal to users.
>>> >
>>> > > If there is no active interest, we will likely need to figure out steps
>>> > for
>>> > > retiring the project.
>>> > >
>>> > > *Call for action: If you are interested in becoming a committer/PMC
>>> > member
>>> > > (including PMC chair) and actively maintain the project, please reply to
>>> > > this email.*
>>> >
>>> > Like I wrote above, I would be in favor of a vote to move Apache Mesos
>>> > to the attic.
>>> >
>>> >
>>> > Cheers,
>>> >
>>> > Benjamin
>>> >


Subject: [RESULT][VOTE] Release Apache Mesos 1.11.0 (rc1)

2020-11-24 Thread Andrei Sekretenko
Hi all,

The vote for Mesos 1.11.0 (rc1) has passed with the
following votes.

+1 (Binding)
--
Vinod Kone
Till Toenshoff
Qian Zhang
Andrei Sekretenko

There were no 0 or -1 votes.

Please find the release at:
https://dist.apache.org/repos/dist/release/mesos/1.11.0

It is recommended to use a mirror to download the release:
http://www.apache.org/dyn/closer.cgi

The CHANGELOG for the release is available at:
https://gitbox.apache.org/repos/asf?p=mesos.git;a=blob_plain;f=CHANGELOG;hb=1.11.0

The mesos-1.11.0.jar has been released to:
https://repository.apache.org

The website (http://mesos.apache.org) will be updated shortly to
reflect this release.

Thanks,
Andrei Sekretenko


Re: Subject: [VOTE] Release Apache Mesos 1.11.0 (rc1)

2020-11-24 Thread Andrei Sekretenko
+1 (binding). 
Built, ran tests and benchmarked on Ubuntu 18.05 with gcc 7.5.0 (also built and 
ran binaries/tests with clang 12.0.0).

Regards,
Andrei Sekretenko

On 2020/11/19 01:32:28, Qian Zhang  wrote: 
> +1
> 
> Regards,
> Qian Zhang
> 
> 
> On Thu, Nov 19, 2020 at 4:16 AM Till Toenshoff  wrote:
> 
> > +1
> >
> > Build using Apple clang version 12.0.0 (clang-1200.0.32.27) and ran on
> > macOS 11.0.1 (20B29).
> >
> > > On 17. Nov 2020, at 15:53, Andrei Sekretenko 
> > wrote:
> > >
> > > Hi all,
> > >
> > > Please vote on releasing the following candidate as Apache Mesos 1.11.0.
> > >
> > > 1.11.0 includes the following:
> > >
> > 
> > >  * CSI external volumes support: now, Mesos Containerizer supports using
> > >pre-provisioned external CSI storage volumes by means of the new
> > > `volume/csi`
> > >isolator. Also, the latter significantly extends the range of
> > > compatible 3rd party
> > >   CSI plugins compared to the previous SLRP-based solution
> > > (MESOS-10141).
> > >
> > >  * Constraints-based offer filtering: the Scheduler API adds an
> > > interface allowing
> > >frameworks to put constraints  on agent attributes in resource
> > > offers to help "picky"
> > >frameworks significantly reduce scheduling latency when close to
> > > being out of quota
> > >(MESOS-10161).
> > >
> > >  * CMake build becomes usable for deploying in production (MESOS-898).
> > >
> > > The CHANGELOG for the release is available at:
> > >
> > https://gitbox.apache.org/repos/asf?p=mesos.git;a=blob_plain;f=CHANGELOG;hb=1.11.0-rc1
> > >
> > 
> > >
> > > The candidate for Mesos 1.11.0 release is available at:
> > >
> > https://dist.apache.org/repos/dist/dev/mesos/1.11.0-rc1/mesos-1.11.0.tar.gz
> > >
> > > The tag to be voted on is 1.11.0-rc1:
> > > https://gitbox.apache.org/repos/asf?p=mesos.git;a=commit;h=1.11.0-rc1
> > >
> > > The SHA512 checksum of the tarball can be found at:
> > >
> > https://dist.apache.org/repos/dist/dev/mesos/1.11.0-rc1/mesos-1.11.0.tar.gz.sha512
> > >
> > > The signature of the tarball can be found at:
> > >
> > https://dist.apache.org/repos/dist/dev/mesos/1.11.0-rc1/mesos-1.11.0.tar.gz.asc
> > >
> > > The PGP key used to sign the release is here:
> > > https://dist.apache.org/repos/dist/release/mesos/KEYS
> > >
> > > The JAR is in a staging repository here:
> > > https://repository.apache.org/content/repositories/orgapachemesos-1260
> > >
> > > Please vote on releasing this package as Apache Mesos 1.11.0!
> > >
> > > The vote is open until 2020 Nov 20th 15:00 UTC at least, and passes if
> > > a majority of at least 3 +1 PMC votes are cast.
> > >
> > > [ ] +1 Release this package as Apache Mesos 1.11.0
> > > [ ] -1 Do not release this package because ...
> > >
> > > Thanks,
> > > Andrei Sekretenko
> >
> >
> 


Subject: [VOTE] Release Apache Mesos 1.11.0 (rc1)

2020-11-17 Thread Andrei Sekretenko
Hi all,

Please vote on releasing the following candidate as Apache Mesos 1.11.0.

1.11.0 includes the following:

  * CSI external volumes support: now, Mesos Containerizer supports using
pre-provisioned external CSI storage volumes by means of the new
`volume/csi`
isolator. Also, the latter significantly extends the range of
compatible 3rd party
   CSI plugins compared to the previous SLRP-based solution
(MESOS-10141).

  * Constraints-based offer filtering: the Scheduler API adds an
interface allowing
frameworks to put constraints  on agent attributes in resource
offers to help "picky"
frameworks significantly reduce scheduling latency when close to
being out of quota
(MESOS-10161).

  * CMake build becomes usable for deploying in production (MESOS-898).

The CHANGELOG for the release is available at:
https://gitbox.apache.org/repos/asf?p=mesos.git;a=blob_plain;f=CHANGELOG;hb=1.11.0-rc1


The candidate for Mesos 1.11.0 release is available at:
https://dist.apache.org/repos/dist/dev/mesos/1.11.0-rc1/mesos-1.11.0.tar.gz

The tag to be voted on is 1.11.0-rc1:
https://gitbox.apache.org/repos/asf?p=mesos.git;a=commit;h=1.11.0-rc1

The SHA512 checksum of the tarball can be found at:
https://dist.apache.org/repos/dist/dev/mesos/1.11.0-rc1/mesos-1.11.0.tar.gz.sha512

The signature of the tarball can be found at:
https://dist.apache.org/repos/dist/dev/mesos/1.11.0-rc1/mesos-1.11.0.tar.gz.asc

The PGP key used to sign the release is here:
https://dist.apache.org/repos/dist/release/mesos/KEYS

The JAR is in a staging repository here:
https://repository.apache.org/content/repositories/orgapachemesos-1260

Please vote on releasing this package as Apache Mesos 1.11.0!

The vote is open until 2020 Nov 20th 15:00 UTC at least, and passes if
a majority of at least 3 +1 PMC votes are cast.

[ ] +1 Release this package as Apache Mesos 1.11.0
[ ] -1 Do not release this package because ...

Thanks,
Andrei Sekretenko


Re: Hung website job

2020-10-27 Thread Andrei Sekretenko
Hi Zoran,
sure, this particular job can be cancelled/retried.

>From looking at the log, there seems to be some infrastructure issue on the
ASF Jenkins / Git plugin/system/... level:
the build has yet to reach the build script, and looks stuck at the git
checkout by the SCM plugin.
This might be worth investigating on the infra side (especially given that
Jenkins hadn't aborted the stuck job in 300 minutes; I'm not 100% sure if
Jenkins is supposed to abort a stuck job at the SCM checkout step, though).

On Tue, Oct 27, 2020 at 1:32 PM Zoran Regvart  wrote:

> Hi Mesos folk,
> perhaps someone from your side can take a look?
>
> thanks!
>
> zoran
> -- Forwarded message -
> From: Zoran Regvart 
> Date: Tue, Oct 27, 2020 at 11:49 AM
> Subject: Hung website job
> To: builds 
>
>
> Hi Builders,
> seems that a job has hung[1] on the website1 node preventing other
> website jobs to schedule.
>
> Can it be canceled/retried?
>
> zoran
> [1] https://ci-builds.apache.org/job/Mesos/job/Mesos-Websitebot/716/
> --
> Zoran Regvart
>
>
> --
> Zoran Regvart
>


-- 
--
Andrei Sekretenko


Design document: constraints-based offer filtering

2020-07-28 Thread Andrei Sekretenko
Hi all,
Recently, I and my colleagues have been designing a mechanism in Mesos that
will allow a framework to put constraints on the contents of the offers it
receives: on the attributes of offered agents, and, as a next step, on
resources in the offers, so that the framework is more likely to receive an
offer it really needs.
The primary aim of this design is to help "picky" frameworks running in
presence of quota reduce scheduling latency.

I've distilled the implementation proposal on the Mesos side into a design
doc draft:
https://docs.google.com/document/d/1MV048BwjLSoa8sn_5hs4kIH4YJMf6-Gsqbij3YuT1No
<https://docs.google.com/document/d/1MV048BwjLSoa8sn_5hs4kIH4YJMf6-Gsqbij3YuT1No/edit#heading=h.wq9atl6k4yq0>
/edit#heading=h.wq9atl6k4yq0
<https://docs.google.com/document/d/1MV048BwjLSoa8sn_5hs4kIH4YJMf6-Gsqbij3YuT1No/edit#heading=h.wq9atl6k4yq0>

--
Best regards,
Andrei Sekretenko


Subject: [RESULT][VOTE] Release Apache Mesos 1.10.0 (rc1)

2020-05-28 Thread Andrei Sekretenko
Hi all,

The vote for Mesos 1.10.0 (rc1) has passed with the
following votes.

+1 (Binding)
--
Vinod Kone
Benjamin Mahler
Qian Zhang
Greg Mann

There were no 0 or -1 votes.

Please find the release at:
https://dist.apache.org/repos/dist/release/mesos/1.10.0

It is recommended to use a mirror to download the release:
http://www.apache.org/dyn/closer.cgi

The CHANGELOG for the release is available at:
https://gitbox.apache.org/repos/asf?p=mesos.git;a=blob_plain;f=CHANGELOG;hb=1.10.0

The mesos-1.10.0.jar has been released to:
https://repository.apache.org

The website (http://mesos.apache.org) will be updated shortly to reflect
this release.

Thanks,
Andrei Sekretenko


Re: Subject: [VOTE] Release Apache Mesos 1.10.0 (rc1)

2020-05-26 Thread Andrei Sekretenko
Thanks for checking this!

The first one (centos, non-SSL, gcc, autotools) seems to be a race between
several instances of `javah` attempting to check for existence and create
the output directory.
I believe there were no related changes in 1.10.x compared to 1.9.x.

The second one (ubuntu, SSL, clang, autotools) is somewhat tricky.
The immediate cause of the failure seems to be an attempt to compile
src/tests/http_tests.proto with a not yet built protoc.
src/tests/http_tests.proto has been added in 1.10; there were no tests-only
protobuf definitions in Mesos before that.
However, I'm not quite getting how protobuf compilation in the automake
build is supposed to work at all with a bundled protoc.

When the bundled protobuf is used, I don't see any dependency on protoc
injected into the pb.cc/pb.h targets in the generated Makefile.
Neither do I see how src/Makefile.am is supposed to introduce this
dependency.
(See
https://github.com/apache/mesos/blob/5a04a1693e4f1d51007c23728f1884a307e22src/testssrc/tests9a1/src/Makefile.am#L499
<https://github.com/apache/mesos/blob/5a04a1693e4f1d51007c23728f1884a307e229a1/src/Makefile.am#L499>
and below).
Looks like all other protobufs (usually?) compile due to sheer luck.

The workaround for the javah race and the fix for missing dependency on
protoc seem to be rather straightforward.
If any of these two should be considered a blocker for 1.10.0, please vote
-1.





On Tue, May 19, 2020 at 6:55 PM Vinod Kone  wrote:

> Ran it in Apache CI. Found 2 build issues (issue 1
> <https://builds.apache.org/view/M-R/view/Mesos/job/Mesos-Release/77/BUILDTOOL=autotools,COMPILER=gcc,CONFIGURATION=--verbose%20--disable-libtool-wrappers%20--disable-parallel-test-execution,ENVIRONMENT=GLOG_v=1%20MESOS_VERBOSE=1,OS=centos%3A7,label_exp=(docker%7C%7CHadoop%7C%7Cbeam)&&(!ubuntu-us1)&&(!ubuntu-eu2)/console>,
> issue 2
> <https://builds.apache.org/view/M-R/view/Mesos/job/Mesos-Release/77/BUILDTOOL=autotools,COMPILER=clang,CONFIGURATION=--verbose%20--disable-libtool-wrappers%20--disable-parallel-test-execution%20--enable-libevent%20--enable-ssl,ENVIRONMENT=GLOG_v=1%20MESOS_VERBOSE=1,OS=ubuntu%3A16.04,label_exp=(docker%7C%7CHadoop%7C%7Cbeam)&&(!ubuntu-us1)&&(!ubuntu-eu2)/console>)
> which seem to be related to race condition due to parallel build.
>
> @Andrei Sekretenko  Can you confirm this is
> not a regression in the build system?
>
> *Revision*: 1fb36dcc5a0099f147cd01bd82cd7b4f0aec2256
>
>- refs/tags/1.10.0-rc1
>
> Configuration Matrix gcc clang
> centos:7 --verbose --disable-libtool-wrappers
> --disable-parallel-test-execution --enable-libevent --enable-ssl autotools
> [image: Success]
> <https://builds.apache.org/view/M-R/view/Mesos/job/Mesos-Release/77/BUILDTOOL=autotools,COMPILER=gcc,CONFIGURATION=--verbose%20--disable-libtool-wrappers%20--disable-parallel-test-execution%20--enable-libevent%20--enable-ssl,ENVIRONMENT=GLOG_v=1%20MESOS_VERBOSE=1,OS=centos%3A7,label_exp=(docker%7C%7CHadoop%7C%7Cbeam)&&(!ubuntu-us1)&&(!ubuntu-eu2)/>
> [image: Not run]
> cmake
> [image: Success]
> <https://builds.apache.org/view/M-R/view/Mesos/job/Mesos-Release/77/BUILDTOOL=cmake,COMPILER=gcc,CONFIGURATION=--verbose%20--disable-libtool-wrappers%20--disable-parallel-test-execution%20--enable-libevent%20--enable-ssl,ENVIRONMENT=GLOG_v=1%20MESOS_VERBOSE=1,OS=centos%3A7,label_exp=(docker%7C%7CHadoop%7C%7Cbeam)&&(!ubuntu-us1)&&(!ubuntu-eu2)/>
> [image: Not run]
> --verbose --disable-libtool-wrappers --disable-parallel-test-execution
> autotools
> [image: Failed]
> <https://builds.apache.org/view/M-R/view/Mesos/job/Mesos-Release/77/BUILDTOOL=autotools,COMPILER=gcc,CONFIGURATION=--verbose%20--disable-libtool-wrappers%20--disable-parallel-test-execution,ENVIRONMENT=GLOG_v=1%20MESOS_VERBOSE=1,OS=centos%3A7,label_exp=(docker%7C%7CHadoop%7C%7Cbeam)&&(!ubuntu-us1)&&(!ubuntu-eu2)/>
> [image: Not run]
> cmake
> [image: Success]
> <https://builds.apache.org/view/M-R/view/Mesos/job/Mesos-Release/77/BUILDTOOL=cmake,COMPILER=gcc,CONFIGURATION=--verbose%20--disable-libtool-wrappers%20--disable-parallel-test-execution,ENVIRONMENT=GLOG_v=1%20MESOS_VERBOSE=1,OS=centos%3A7,label_exp=(docker%7C%7CHadoop%7C%7Cbeam)&&(!ubuntu-us1)&&(!ubuntu-eu2)/>
> [image: Not run]
> ubuntu:16.04 --verbose --disable-libtool-wrappers
> --disable-parallel-test-execution --enable-libevent --enable-ssl autotools
> [image: Success]
> <https://builds.apache.org/view/M-R/view/Mesos/job/Mesos-Release/77/BUILDTOOL=autotools,COMPILER=gcc,CONFIGURATION=--verbose%20--disable-libtool-wrappers%20--disable-parallel-test-execution%20--enable-libevent%20--enable-ssl,ENVIRONMENT=GLOG_v=1%20MESOS_VERBOSE=1,OS=ubuntu%3A16.04,label_exp=(docker%7C%7CHadoop%7C%7Cbeam)&&(!ubuntu-us1)&&(!ubuntu

Subject: [VOTE] Release Apache Mesos 1.10.0 (rc1)

2020-05-18 Thread Andrei Sekretenko
Hi all,

Please vote on releasing the following candidate as Apache Mesos 1.10.0.

1.10.0 includes the following major improvements:

* support for resource bursting (setting task resource limits separately
from requests) on Linux
* ability for an executor to communicate with an agent via Unix domain
socket instead of TCP
* ability for operators to modify reservations via the RESERVE_RESOURCES
master API call
* performance improvements of V1 operator API read-only calls bringing them
on par with V0 HTTP endpoints
* ability for a scheduler to expect that effects of calls sent through the
same connection will not be reordered/interleaved by master

NOTE: 1.10.0 includes a breaking change for custom authorizer modules.
Now, `ObjectApprover`s may be stored by Mesos indefinitely and must be kept
up-to-date by an authorizer throughout their lifetime.
This allowed for several bugfixes and performance improvements.

The CHANGELOG for the release is available at:
https://gitbox.apache.org/repos/asf?p=mesos.git;a=blob_plain;f=CHANGELOG;hb=1.10.0-rc1


The candidate for Mesos 1.10.0 release is available at:
https://dist.apache.org/repos/dist/dev/mesos/1.10.0-rc1/mesos-1.10.0.tar.gz

The tag to be voted on is 1.10.0-rc1:
https://gitbox.apache.org/repos/asf?p=mesos.git;a=commit;h=1.10.0-rc1

The SHA512 checksum of the tarball can be found at:
https://dist.apache.org/repos/dist/dev/mesos/1.10.0-rc1/mesos-1.10.0.tar.gz.sha512

The signature of the tarball can be found at:
https://dist.apache.org/repos/dist/dev/mesos/1.10.0-rc1/mesos-1.10.0.tar.gz.asc

The PGP key used to sign the release is here:
https://dist.apache.org/repos/dist/release/mesos/KEYS

The JAR is in a staging repository here:
https://repository.apache.org/content/repositories/orgapachemesos-1259

Please vote on releasing this package as Apache Mesos 1.10.0!

The vote is open until Fri, May 21, 19:00 CEST  and passes if a majority of
at least 3 +1 PMC votes are cast.

[ ] +1 Release this package as Apache Mesos 1.10.0
[ ] -1 Do not release this package because ...

Thanks,
Andrei Sekretenko


Re: 1.10 release is nearing - please check `Target Version`s in JIRA

2020-05-08 Thread Andrei Sekretenko
Hi,
sorry, I probably should have kept things more transparent, here are the
recent updates:
 - 1.10.x was branched off yesterday.  All the work that lands on master
from now on goes into 1.11 unless backported into 1.10. The preliminary
1.10 changelog can be seen here:
https://github.com/apache/mesos/blob/1.10.x/CHANGELOG
 - I'm wrapping up the final checks before tagging 1.10.0-rc1 release
candidate and initiating a release vote (hope that nothing comes up in the
remaining checks).

On Thu, May 7, 2020 at 10:28 PM Charles-François Natali 
wrote:

> Hey,
>
> Sorry if I missed something, but I didn't see any follow up on the release
> - is there an ETA?
>
> Cheers,
>
>
> On Fri, 20 Mar 2020, 22:57 Andrei Sekretenko, 
> wrote:
>
> > Hi all,
> > as some of you probably know, Mesos 1.10 release is near; expect that
> next
> > week the release candidate will be prepared and voted.
> >
> > At this point, please make sure that the work that needs to be landed
> into
> > 1.10 release but has not landed in master branch yet, is labelled by
> > TargetVersion = 1.10.0 in the corresponding ASF JIRA ticket.
> > ---
> > There seem to be unresolved issues in JIRA that had recent activity, but
> > are not labelled with TargetVersion = 1.10.0.
> > It might be the case that some of them actually need to get to 1.10, but
> > are effectively orphaned.
> >
> > Manually browsing through all the project in a search for these "1.10
> > orphans", so I attempted to filter two lists of issues that have an
> > increased probability of containing the orphans:
> > https://jira.apache.org/jira/issues/?filter=12348429
> > https://jira.apache.org/jira/issues/?filter=12348426
> >
> > Most of these issues do not look like 1.10 blockers to me, but if you
> find
> > among them (or elsewhere!) some that absolutely need to block 1.10
> release,
> > please label them with a proper TargetVersion.
> >
> > Best,
> > Andrei Sekretenko
> >
>


-- 
--
Andrei Sekretenko


1.10 release is nearing - please check `Target Version`s in JIRA

2020-03-20 Thread Andrei Sekretenko
Hi all,
as some of you probably know, Mesos 1.10 release is near; expect that next
week the release candidate will be prepared and voted.

At this point, please make sure that the work that needs to be landed into
1.10 release but has not landed in master branch yet, is labelled by
TargetVersion = 1.10.0 in the corresponding ASF JIRA ticket.
---
There seem to be unresolved issues in JIRA that had recent activity, but
are not labelled with TargetVersion = 1.10.0.
It might be the case that some of them actually need to get to 1.10, but
are effectively orphaned.

Manually browsing through all the project in a search for these "1.10
orphans", so I attempted to filter two lists of issues that have an
increased probability of containing the orphans:
https://jira.apache.org/jira/issues/?filter=12348429
https://jira.apache.org/jira/issues/?filter=12348426

Most of these issues do not look like 1.10 blockers to me, but if you find
among them (or elsewhere!) some that absolutely need to block 1.10 release,
please label them with a proper TargetVersion.

Best,
Andrei Sekretenko


Design proposal: synchronous authorization in Mesos

2020-01-06 Thread Andrei Sekretenko
Hi all,
Probably you are aware that authorization in Mesos, as currently
implemented, always requires an asynchronous  step: after sending
authorization request to Authorizer (or requesting a short-lived
ObjectApprover from Authorizer), the caller has to wait (in one way or
another) for completion of a Future.

Recently, myself and a few of my colleagues started to explore an option to
implement synchronous authorization, which will allow to improve Operator
event API throughput, to simplify Master code and, most importantly, to get
rid of several bugs that are otherwise hard to fix cleanly.

The design proposal is here:
https://docs.google.com/document/d/12IVdCY668SGJTLhfEWciZfy2EreM_SJ-nzsTb33Tp44/edit?usp=sharing

Please take a look and comment!

Thanks,
Andrei Sekretenko


[custom Authorizers] changing ObjectApprover life cycle inside of Mesos

2019-12-19 Thread Andrei Sekretenko
Hi all,
as you probably know, asynchronous authorization step in Mesos master and
agent code causes a number of performance/design issues and outright bugs
(see https://issues.apache.org/jira/browse/MESOS-10056).
To deal with those issues, myself and a few of my colleagues are
considering a design which will reuse the existing ObjectApprover interface
for  synchronous authorization.

The main drawback of this design is that existing custom Authorizer
implementations will need to be adapted.

Currently, Mesos code uses the returned Approver for authorizing a single
API call/event, and disposes of the approver immediately after that. Note
that Authorizer interface documentation does not require Mesos to behave
this way; it states nothing about the lifetime/expiration of the returned
ObjectApprover inside of Mesos. The reference implementation
(LocalAuthorizer) does not rely on this behaviour, but custom
implementations might be implicitly expecting it (some of them indeed are).

Using ObjectApprover interface for synchronous authorization will change
the ObjectApprover life cycle in Mesos.

Mesos master, and probably also agent, will be storing ObjectApprovers for
long-lived principals for at least as long as the principal is subscribed.
Thus, custom Authorizers will be required to implement a method that
returns an ObjectApprover which never expires, with its state refreshed by
the Authorizer when needed, in a thread-safe way.

We intend to put part of the code needed for approver refresh alongside
Mesos, both to facilitate adding Mesos tests with non-idempotent approvers
and to simplify adapting custom authorizers.

A more detailed design proposal will follow, but before that we would like
to figure out:
how many custom Authorizer implementations there actually are and how hard
will it be for their maintainers to adapt them for this breaking change?

If you own (or are aware of) an existing actively used custom Authorizer
implementation, please let us know.

Thanks,

Andrei Sekretenko
Software Engineer at D2iQ (previously known as Mesosphere)


Changing behaviour of suppressOffers() to preserve suppressed state on transparent re-registration by the scheduler driver

2019-06-21 Thread Andrei Sekretenko
Hi all,

we are intending to change the behavior of the suppressOffers() method of
MesosSchedulerDriver with regard to the transparent re-registration.

Currently, when driver becomes disconnected from a master, it performs on
its own a re-registration with an empty set of suppressed roles. This
causes un-suppression
of all the suppressed roles of the framework.

The plan is to alter this behavior into preserving the suppression state on
this re-registration.

The required set of suppressed roles will be stored in the driver, which
will be now performing re-registration with this set (instead of an empty
one),
and updating the stored set whenever a call modifying the suppression state
of the roles in the allocator is performed.
Currently, the driver has two methods which perform such calls:
suppressOffers()  and reviveOffers().

Please feel free to raise any concerns or objections - especially if you
are aware of any V0 frameworks which (probably implicitly) depend on
un-suppression of the roles when this re-registration occurs.



Note that:
 - Frameworks which do not call suppressOffers() are, obviously, unaffected
by this change.

 - Frameworks that reliably prevent transparent-re-registration (for
example, by calling driver.abort() immediately from the disconnected()
callback), should also be not affected.

 - Storing the suppressed roles list for re-registration and clearing it in
reviveOffers() do not change anything for the existing frameworks. It is
setting this list in suppressOffers() which might be a cause of concerns.

 - I'm using the word "un-suppression" because re-registering with roles
removed from the suppressed roles list is NOT equivalent to performing
REVIVE call for these roles (unlike REVIVE, it does not clear offerFilters
in the allocator).

=
A bit of background on why this change is needed.

To properly support V0 frameworks with large number of roles, it is
necessary for the driver not to change the suppression state of the roles
on its own.
Therefore, due to the existence of the transparent re-registration in the
driver, we will need to store the required suppression state in the driver
and make it re-register using this state.

We could possibly avoid the proposed change of suppressOffers() by adding
to the driver new interface for changing the suppression state, leaving
suppressOffers() as it is, and marking it as deprecated.

However, this will leave the behaviour of suppressOffers() deeply
inconsistent with everything else.
Compare the following two sequences of events.
First one:
 - The framework creates and starts a driver with roles "role1", "role2"...
"role500", the driver registers
 - The framework calls a new method driver.suppressOffersForRoles({"role1",
..., "role500"}), the driver performs SUPPRESS call for these roles and
stores them in its suppressed roles set.
   (Alternative with the same result: the framework calls
driver.updateFramework(FrameworkInfo, suppressedRoles={"role1", ...,
"role500"}), the driver performs UPDATE_FRAMEWORK call with those
parameters and stores the new suppressed roles set).
 - The driver, due to some reason, disconnects and re-registers with the
same master, providing the stored suppressed roles set.
 - All the roles are still suppressed
Second one:
 - The framework creates and starts a driver with roles "role1", "role2"...
"role500", the driver registers
 - The framework calls driver.suppressOffers(), the driver performs
SUPPRESS call for all roles, but doesn't modify required suppression state.
 - The driver, due to some reason, disconnects and re-registers with the
same master, providing the stored suppressed roles set, which is empty.
 - Now, none of the roles are suppressed, allocator generates offers for
500 roles which will likely be declined by the framework.

This is one of the examples which makes us strongly consider altering the
interaction between suppressOffers() and the transparent re-registration
when we add storing the suppression state to the driver.

Regards,
Andrei Sekretenko


Bundled glog update from 0.3.3 to 0.4.0

2019-03-26 Thread Andrei Sekretenko
Hi all,
We are intending to update the bundled glog from 0.3.3 to 0.4.0.

If you have any objections/concerns, or know about any issues introduced
into glog between 0.3.3 and 0.4.0, please raise them.

Corresponding glog changelogs:
https://github.com/google/glog/releases/tag/v0.4.0
https://github.com/google/glog/releases/tag/v0.3.5
https://github.com/google/glog/releases/tag/v0.3.4

Regards,
Andrei Sekretenko