Re: Mesos loses track of Docker containers

2016-08-10 Thread Jeff Schroeder
Have you considered upgrading Mesos and Marathon? Those are quite old versions of both with some fairly glaring problems with the docker containerizer if memory serves. Also what version of docker? On Wednesday, August 10, 2016, Paul Bell wrote: > Hello, > > One of our

Re: [VOTE] Release Apache Mesos 1.0.0 (rc4)

2016-07-27 Thread Jeff Schroeder
Small nit but can you s/experimnental/experimental/ under the "Storage" header in the release post please? Great work otherwise everyone! On Wednesday, July 27, 2016, Vinod Kone wrote: > Hi all, > > The vote for Mesos 1.0.0 (rc4) has passed with the following votes. > > >

Re: Initial Design Document Apache Mesos Federation (JIRA 3548)

2016-07-14 Thread Jeff Schroeder
; > I would love to hear more from others too. > > Regards, > Dhilip > > On Wed, Jul 13, 2016 at 8:46 PM, Jeff Schroeder < > jeffschroe...@computer.org > <javascript:_e(%7B%7D,'cvml','jeffschroe...@computer.org');>> wrote: > >> Would this mean introdu

Re: Initial Design Document Apache Mesos Federation (JIRA 3548)

2016-07-13 Thread Jeff Schroeder
ted-mesos-clusters-for-global-data-center-designs > Video : > https://www.youtube.com/watch?v=kqyVQzwwD5E=17=PLGeM09tlguZQVL7ZsfNMffX9h1rGNVqnC > > -- Jeff Schroeder Don't drink and derive, alcohol and analysis don't mix. http://www.digitalprognosis.com

Re: Rack awareness support for Mesos

2016-06-07 Thread Jeff Schroeder
On Tuesday, June 7, 2016, Du, Fan wrote: > > > On 2016/6/6 23:48, Jörg Schad wrote: > >> Hi, >> thanks for your idea and design doc! >> Just a few thoughts: >> a) The scheduling part would be implemented in a framework scheduler and >> not the Mesos Core, or? >> > > I'm not

Re: 1.0 Release Candidate

2016-05-25 Thread Jeff Schroeder
Does this mean the work to deprecate the docker containerizer will be post-1.0, or have those plans changed? On Wednesday, May 25, 2016, Vinod Kone wrote: > Hi folks, > > As discussed in the previous community sync, we plan to cut a release > candidate for our next release

Re: Status of MESOS-2533?

2016-05-04 Thread Jeff Schroeder
heck? > > On Thu, May 5, 2016 at 1:16 AM, Benjamin Mahler <bmah...@apache.org > <javascript:_e(%7B%7D,'cvml','bmah...@apache.org');>> wrote: > >> +AlexR >> >> On Mon, May 2, 2016 at 2:31 PM, Jeff Schroeder < >> jeffschroe...@computer.org >> &l

Re: How to use a complete host

2016-05-02 Thread Jeff Schroeder
In addition to actually using roles, you might also consider subscribing to MESOS-4138, which will implement this idea in a generic way (but won't help you today). On Monday, May 2, 2016, Christoph Heer wrote: > Hi everyone, > > sometimes in my Mesos use-case it's

Status of MESOS-2533?

2016-05-02 Thread Jeff Schroeder
Some frameworks like Aurora use custom executors to distribute the healthchecks with the tasks. This allows the task to survive a network partition without the scheduler setting it to TASK_LOST. Marathon uses mesos-health-check for command based health checks, but does TCP and HTTP healthchecks

Re: mesos docker vs native container

2016-04-26 Thread Jeff Schroeder
I think you might be a bit confused now this all works. Docker by default uses bridged networking, which by default spins up a little crappy docker-proxy process for every port. You can disable docker-proxy and instead use hairpin routing mode if you have a modern kernel. However, I'm almost

Mesos on RHEL 5.X

2016-04-19 Thread Jeff Schroeder
ng Mesos in production on RHEL 5.4. You need >> devtoolset-2 to build Mesos. >> >> On Tue, Apr 19, 2016 at 10:50 AM, Jeff Schroeder < >> jeffschroe...@computer.org> wrote: >> >>> The RHEL5 kernel will not support the necessary bits for mesos.

Re: Mesos on RHEL 5.X

2016-04-19 Thread Jeff Schroeder
The RHEL5 kernel will not support the necessary bits for mesos. RHEL6 also lacks the overwhelming majority of support for namespaces and control groups. Try upgrading to RHEL7 and then giving Mesos a go. It doesn't support older kernels. On Tuesday, April 19, 2016, Manivannan

Re: Mesos interconnection among clusters project calico

2016-04-13 Thread Jeff Schroeder
Stefano, you might also follow the jira issue MESOS-3548, which is for mesos to support federation amongst multiple clusters natively. On Wednesday, April 13, 2016, Stefano Bianchi wrote: > Ah ok. > No problem. > See you and best regards!!! > Il 13/apr/2016 21:09, "June

Re: Mesos agents across a WAN?

2016-04-01 Thread Jeff Schroeder
On Thursday, March 31, 2016, Evan Krall <kr...@yelp.com> wrote: > On Wed, Mar 30, 2016 at 6:56 PM, Jeff Schroeder < > jeffschroe...@computer.org > <javascript:_e(%7B%7D,'cvml','jeffschroe...@computer.org');>> wrote: > >> Given regional bare metal M

Re: Mesos agents across a WAN?

2016-03-31 Thread Jeff Schroeder
/browse/MESOS-3548 > > On Thu, Mar 31, 2016 at 3:56 AM, Jeff Schroeder < > jeffschroe...@computer.org > <javascript:_e(%7B%7D,'cvml','jeffschroe...@computer.org');>> wrote: > >> Given regional bare metal Mesos clusters on multiple continents, are >> there

Mesos agents across a WAN?

2016-03-30 Thread Jeff Schroeder
es-cluster-federation -- Jeff Schroeder Don't drink and derive, alcohol and analysis don't mix. http://www.digitalprognosis.com

Re: How to make full version available in /version endpoint

2016-03-23 Thread Jeff Schroeder
ipt:_e(%7B%7D,'cvml','eweath...@groupon.com');>> wrote: > >> The extra "-2.0.16" portion of that version number is an artifact from >> Mesosphere's build system, and my understanding is they are going to get >> rid of it. So perhaps this will not be a problem in the

Re: How to make full version available in /version endpoint

2016-03-23 Thread Jeff Schroeder
Perhaps building your own version, with your own version string would be sufficient? A general purpose feature to override the stated version with an environment variable doesn't seem very applicable in many environments. Perhaps there is a different way you could accomplish the same ultimate

Re: marathon-lb at scale

2016-03-03 Thread Jeff Schroeder
Being able to set HAPROXY_0_VHOST to the mesos-dns name and having everything just magically work is a pretty fantastic user experience however. Especially for users who might need to talk to SysAdmin teams to manually change DNS. Any alternatives? We have marathon-lb running in a container with

Re: deploy mesos cluster on aws

2016-01-10 Thread Jeff Schroeder
On Sunday, January 10, 2016, Rodrick Brown wrote: > We run 100% on AWS and have been running Mesos in production since version > 0.19 > Our cluster consists of 3 dedicated zookeeper nodes (M3.2lx), 3 dedicated > masters (M3.2lx), 8 dedicated slaves (M4.4xl) and 2 haproxy

Re: Role-related configuration in Mesos

2015-12-28 Thread Jeff Schroeder
Perhaps we could also support HTTP PATCH so you could just update one small thing vs's PUT's get and set method. On Thursday, December 17, 2015, Adam Bordelon wrote: > First off, if we're going to have a /reservations endpoint, we should > follow the same PUT+DELETE pattern

Re: Mesos at Moz

2015-12-07 Thread Jeff Schroeder
>From a quick skim, this looks excellent. Are there any plans to try getting the bamboo changes back upstream, or is it going to be a permanent fork? On Monday, December 7, 2015, Arunabha Ghosh wrote: > Hi Folks, > We, at Moz have been working for a while on

Re: [VOTE] Release Apache Mesos 0.24.0 (rc1)

2015-08-28 Thread Jeff Schroeder
Also, if you want to Accept application/json, perhaps you could do something like: [recordLength, recordData], [record2Length, record2Data] That gives you the same semantics, and the record data can be anything, but also falls inline with the server returning Content-Type: application/json For

Re: Mesos/Marathon/HAProxy Logging

2015-08-25 Thread Jeff Schroeder
This is the header that should be passed: https://en.m.wikipedia.org/wiki/X-Forwarded-For Most of the modern internet routes through reverse proxies and this is how we log the actual source clients to solve similar auditing and compliance needs. On Tuesday, August 25, 2015, John Omernik

Re: No CRAM-MD5 support for libsasl2 when compiling

2015-08-16 Thread Jeff Schroeder
? On 17/08/2015 2:45 PM, Jeff Schroeder wrote: Yes, the same thing happened to me at first. You need both the headers (the dev package and .h files) and the actual libraries installed. On Sunday, August 16, 2015, F21 f21.gro...@gmail.com javascript:_e(%7B%7D,'cvml','f21.gro...@gmail.com'); wrote

Re: No CRAM-MD5 support for libsasl2 when compiling

2015-08-16 Thread Jeff Schroeder
Yes, the same thing happened to me at first. You need both the headers (the dev package and .h files) and the actual libraries installed. On Sunday, August 16, 2015, F21 f21.gro...@gmail.com wrote: I am trying to build mesos 0.23. In my case, I am trying to build mesos and all of its

Re: SSL in Mesos 0.23

2015-08-07 Thread Jeff Schroeder
javascript:; wrote: I've run ldd on /usr/sbin/mesos-master (on CentOS 7 using mesos 0.23 from mesosphere repo) and I see libssl.3.so and libssl.so.10 On Thu, Aug 6, 2015 at 12:20 PM, Jeff Schroeder jeffschroe...@computer.org javascript:; wrote: Can you run ldd on the mesos-master or mesos

Re: Problem building 0.23.0 under mock for RHEL7

2015-08-04 Thread Jeff Schroeder
Will do, I'll be in around 9am CST. Thanks Tim On Tuesday, August 4, 2015, Tim St Clair tstcl...@redhat.com wrote: I'll take a look in the a.m. feel free to ping me on irc. Cheers, Tim -- *From: *Jeff Schroeder jeffschroe...@computer.org javascript:_e(%7B%7D

Re: Problem building 0.23.0 under mock for RHEL7

2015-08-04 Thread Jeff Schroeder
free to ping me on irc. Cheers, Tim -- *From: *Jeff Schroeder jeffschroe...@computer.org javascript:_e(%7B%7D,'cvml','jeffschroe...@computer.org'); *To: *Mesos Users user@mesos.apache.org javascript:_e(%7B%7D,'cvml','user@mesos.apache.org'); *Sent: *Tuesday

Re: Problem building 0.23.0 under mock for RHEL7

2015-08-04 Thread Jeff Schroeder
' in namespace 'std' does not name a type const std::functionvoid(void) connected, ^ / ``` On Wed, Aug 5, 2015 at 10:21 AM, Jeff Schroeder jeffschroe...@computer.org javascript:_e(%7B%7D,'cvml','jeffschroe...@computer.org'); wrote: Because gcc can compile mesos

Problem building 0.23.0 under mock for RHEL7

2015-08-04 Thread Jeff Schroeder
if it has been seen before. [1] http://pkgs.fedoraproject.org/cgit/mesos.git/tree/?h=f23 [2] https://gist.githubusercontent.com/SEJeff/7c01db2704a1352c8d7e/raw/b606fa77b73984cfa02d62cc1c90f4ae736fae0a/build.log -- Jeff Schroeder Don't drink and derive, alcohol and analysis don't mix. http

Re: Questions about framework development - (HA and reconciling state)

2015-07-25 Thread Jeff Schroeder
Not sure how much more difficult it would be, but Apache Aurora uses the native mesos replicated log construct for data persistence (where you store data in memory). It requires one manual setup to deploy the framework, but seems like it is worth it for what you get out of it. Here is how I just

Re: Cluster of Workstations type design for a Mesos cluster

2015-07-21 Thread Jeff Schroeder
As far as mesos is concerned, compute is a commodity. This should work just fine. Put Aurora or Marathon ontop of mesos if you need a general purpose scheduler and you're good to go. The nice thing is that you can add additional slaves as you need. I believe heterogeneous clusters are best if

Re: [VOTE] Release Apache Mesos 0.23.0 (rc3)

2015-07-16 Thread Jeff Schroeder
What about MESOS-3055 in 0.23? Is that going to get passed up on even if we are going to cut another rc? On Thursday, July 16, 2015, Vinod Kone vinodk...@gmail.com wrote: -1 so that we can cherry pick MESOS-3055. The master crash bug is MESOS-3070

Re: [VOTE] Release Apache Mesos 0.23.0 (rc2)

2015-07-08 Thread Jeff Schroeder
-1 (non-binding) The Python test ExamplesTest.PythonFramework still fails on RHEL7, unless building the mesos native Python package isn't part of make check. It pops on an ImportError for mesos.native See here for more details: https://gist.github.com/SEJeff/05a31c1d2d10fecc97a1 On Wednesday,

Re: How to upgrade mesos version from a running mesos cluster

2015-06-18 Thread Jeff Schroeder
Hello Tommy, have you read the documentation? If not, please take a look and then follow up with any specific questions here: http://mesos.apache.org/documentation/latest/upgrades/ On Thursday, June 18, 2015, tommy xiao xia...@gmail.com wrote: Hi, I have a question on upgrade strategy: How

Re: slave work_dir filling up

2015-06-09 Thread Jeff Schroeder
http://mesos.apache.org/documentation/latest/configuration/ http://mesos.apache.org/documentation/latest/configuration/#slave-options http://mesos.apache.org/documentation/latest/configuration/#slave-optionsLook under Slave Options at --gc_delay and --gc_disk_headroom On Tuesday, June 9,

Re: Mesos Security Recommendations

2015-06-04 Thread Jeff Schroeder
For securing insecure network communication you can use something like stunnel, then point the app at the local stunnel. It would be a fair bit of hoops to configure it all with any your config management system, but is totally doable. On Thursday, June 4, 2015, John Webb webbj1...@hotmail.com

Re: Batch Scheduler with dependency support

2015-05-13 Thread Jeff Schroeder
Apache Aurora does this and you can be explicit about the ordering On Wednesday, May 13, 2015, Aaron Carey aca...@ilm.com wrote: Hi All, I was just wondering if anyone out there knew of a good mesos batch scheduler which supports dependencies between tasks? (ie Task B cannot run until Task

Re: Docker 1.6 and Registry 2.0 support

2015-04-21 Thread Jeff Schroeder
On Tuesday, April 21, 2015, craig w codecr...@gmail.com wrote: That's good to know. I suppose Mesos/Marathon working with Docker Registry 2.0 is not ready, would like confirmation. It has to do with the docker client you have and nothing to do with the mesos / marathon version that I'm aware

Re: Current State of Service Discovery

2015-04-12 Thread Jeff Schroeder
This discussion reminds me of a few excellent blog posts on solving similar problems. Smart clients vs dumb load balancers: http://blog.lusis.org/blog/2013/05/13/smart-clients/ Then there is the local haproxy idea, which I think is a lot less applicable to a mesos environment, but still worth

Re: Using mesos-dns in an enterprise

2015-04-02 Thread Jeff Schroeder
You could also just use keepalived for a vip on each mesos-dns instance assuming they are in the same lan. On Thursday, April 2, 2015, Tom Arnfeld t...@duedil.com wrote: We're using a BGP based solution currently to solve the problem of highly available DNS resolvers. That might be a route

Re: Mesos Hadoop Framework 0.1.0

2015-03-28 Thread Jeff Schroeder
Arnfeld t...@duedil.com wrote: To follow up, this is also a decent solution to a nasty problem in the current framework detailed here, https://github.com/mesos/hadoop/issues/32 . -- Tom Arnfeld Developer // DueDil On Sat, Mar 28, 2015 at 2:40 PM, Jeff Schroeder jeffschroe

Re: Slave recovery not recovering tasks when using systemd

2015-03-26 Thread Jeff Schroeder
On Thursday, March 26, 2015, Joerg Schad jo...@mesosphere.io wrote: Dear Mesos Users, I just wanted to point out a solved issue ( https://issues.apache.org/jira/browse/MESOS-2419) where the *systemd* default behaviour prevents tasks from recovering. The problem is that the default KillMode

Re: [VOTE] Release Apache Mesos 0.22.0 (rc4)

2015-03-23 Thread Jeff Schroeder
How does everyone verify each release works? Obviously make check is important, but are there any sets of automated integration tests that anyone runs for a new release of mesos? If not, should there be? On Monday, March 23, 2015, Dario Rexin da...@mesosphere.io wrote: +1 (non-binding) tested

Re: mesos-collectd-plugin

2015-03-10 Thread Jeff Schroeder
.) So e.g for a hadoop job, when it's finished, we can collect the general metrics of it? Ideally although there are many jobs from different frameworks running at the same time on mesos, we still could get their metrics respectively. Cheers, Dan 2015-03-10 15:46 GMT-05:00 Jeff

Question on Monitoring a Mesos Cluster

2015-03-07 Thread Jeff Schroeder
utilization for capacity planning? We can have the NOC watch this and simply add more hardware when the number starts getting low. Thanks -- Jeff Schroeder Don't drink and derive, alcohol and analysis don't mix. http://www.digitalprognosis.com

Re: Question on Monitoring a Mesos Cluster

2015-03-07 Thread Jeff Schroeder
to the original questions: How does everyone do proper monitoring and capacity planning for large mesos clusters? I expect my cluster to grow beyond what it currently is by quite a bit. -- Jeff Schroeder Don't drink and derive, alcohol and analysis don't mix. http://www.digitalprognosis.com