Re: Mess cluster resources utilization

2015-05-07 Thread Adam Bordelon
Yaron, I meant by comparing the available info. You could query Marathon's /v2/apps endpoint to get the list of pending tasks and the resources requested for each of them, and you could check the Mesos master and slave /statistics.json to see the total amount of unallocated resources to estimate

Re: Apache Mesos Community Sync

2015-05-07 Thread Vinod Kone
Friendly reminder that the community sync is happening today. Same time, same doc https://docs.google.com/document/d/153CUCj5LOJCFAVpdDZC7COJDwKh9RDjxaTA0S7lzwDA/edit#, same deal. On Wed, Apr 1, 2015 at 3:18 AM, Adam Bordelon a...@mesosphere.io wrote: Reminder: We're having another Mesos

preventing registry failures from happening in mesos-master?

2015-05-07 Thread Erik Weathers
I know we're supposed to run the mesos daemons under supervision (i.e., bring them back up automatically if they fail). But I'm interested in not having the mesos-master fail at all, especially a failure in the registry / replicated_log, which I am already a little scared of. Situation: -

Re: Debugging hadoop-mesos

2015-05-07 Thread Brian Topping
Thanks guys, this was helpful. I started the job tracker as a service, but apparently I never started the task tracker (or it failed to start and I didn't notice). I started it after Haosdent's message, but wasn't able to see any difference and I kept poking around. After making some changes

Re: Debugging hadoop-mesos

2015-05-07 Thread Brian Topping
Thanks Tom! I do see activity in the cluster: 1. mesos-master.WARNING log -- sequence of repeat messages being generated: W0507 18:10:21.794231 11729 master.cpp:2661] Cannot kill task Task_Tracker_34 of framework 20150507-164120-272093962-5050-11711-0003 (Hadoop: (RPC port: 9001, WebUI port

Re: cpu hard limit for docker containerizer?

2015-05-07 Thread Chengwei Yang
Thanks Tim, I'll take a look if I can help. -- Thanks, Chengwei On Thu, May 07, 2015 at 09:56:35PM -0700, Tim Chen wrote: Hi Chengwei, It's a known issue and there is a open JIRA (MESOS-2154) and also a open reviewboard that hasn't been updated for a while. I'd like this to go into to

cpu hard limit for docker containerizer?

2015-05-07 Thread Chengwei Yang
Hi List, I see mesos-slave has `--cgroups_enable_cfs` option to enable CFS hard cpu limit, that's may real helpful to running online aand offline jobs within a single mesos cluster, since some offline jobs are very CPU bindings. However, after having a small source code trip, I saw

Re: cpu hard limit for docker containerizer?

2015-05-07 Thread Tim Chen
Hi Chengwei, It's a known issue and there is a open JIRA (MESOS-2154) and also a open reviewboard that hasn't been updated for a while. I'd like this to go into to 0.23 if we can get to it, if you like to pick up the reviewboard feel free to do so. Tim On Thu, May 7, 2015 at 7:21 PM, Chengwei

Brigade :: Powered By Mesos

2015-05-07 Thread John Miller
We're utilizing Mesos within our organization for multiple projects. Anyone with access please feel free to add us to the https://mesos.apache.org/documentation/latest/powered-by-mesos/ page. Cheers! John Miller Engineer | www.brigade.com