Re: Mesos 0.28 SSL in official packages

2016-04-11 Thread Zameer Manji
I have suggested this before and I will suggest it again here. I think the Apache Mesos project should build and distribute packages instead of relying on the generosity of a commercial vendor. The Apache Aurora project does this already with good success. As a user of Apache Mesos I don't care

Re: mesos/kafka issues (org.apache.mesos.Scheduler)

2016-04-11 Thread Justin Ryan
I’m now using /var/mesos as my work_dir, and don’t have any logs from when they’ve gone missing because getting them to start hasn’t happened for so long. :/ From: Greg Mann > Reply-To: "user@mesos.apache.org"

Re: mesos/kafka issues (org.apache.mesos.Scheduler)

2016-04-11 Thread Greg Mann
Hi Justin, Do you have master/agent logs from a time when these tasks would have gone missing from the Mesos UI? What location are you using for the work_dir on the agents? Cheers, Greg On Mon, Apr 11, 2016 at 1:41 PM, Justin Ryan wrote: > Update : I noticed one of the

Re: mesos/kafka issues (org.apache.mesos.Scheduler)

2016-04-11 Thread Justin Ryan
I have 3 hosts running zookeeper, mesos masters, and marathon, an HDFS namenode, and 10 worker nodes running mesos-slave and HDFS datanodes. I don’t remember having set LIBPROCESS_IP in the past, maybe it’s part of some slightly newer code, so I went ahead and did this and verified

Re: mesos/kafka issues (org.apache.mesos.Scheduler)

2016-04-11 Thread Justin Ryan
Update : I noticed one of the clusters had a framework registered before I cleared ZK, but is now seeing the same failure at scheduler start. When the brokers do launch, in recent times, they disappear from mesos within a day, although they keep running. I have another thread on this list

Re: mesos/kafka issues (org.apache.mesos.Scheduler)

2016-04-11 Thread Kevin Lu
What's your master/slave setup? What are your ports/firewall rules? In the past, when weird situations like these happen to me, it's usually because of some firewall rule, and at that point I'll ssh into the machine where my framework is running and see what ports it's trying to talk to via

Re: mesos/kafka issues (org.apache.mesos.Scheduler)

2016-04-11 Thread Justin Ryan
I have tried it without the &&, and i can ‘broker stop’, then ‘broker start’ with no change, though I’ll make sure on my next zk clear to be sure to try without the && I am, indeed, not seeing the framework at all, and when this happens, the last line of output running the scheduler is:

Re: mesos/kafka issues (org.apache.mesos.Scheduler)

2016-04-11 Thread Kevin Lu
Can you try it step-by-step without the "&&"? Also, IIRC, mesos creates separate tasks for the kafka framework and the broker. Are you not even seeing the framework in the mesos UI? On Mon, Apr 11, 2016 at 1:29 PM, Justin Ryan wrote: > Hi, folks! > > In pursuit of a

mesos/kafka issues (org.apache.mesos.Scheduler)

2016-04-11 Thread Justin Ryan
Hi, folks! In pursuit of a mesos-based solution for distributed logging and processing, I’ve been experimenting with the mesos/kafka project (github.com/mesos/kafka). I had great success for some weeks during initial testing, and am now having trouble getting brokers to launch at all. This

Re: [VOTE] Release Apache Mesos 0.28.1 (rc2)

2016-04-11 Thread Kapil Arya
+1 (binding) CI runs with: amd64/centos/6 amd64/centos/7 amd64/debian/jessie amd64/ubuntu/precise amd64/ubuntu/trusty amd64/ubuntu/vivid amd64/ubuntu/wily On Wed, Apr 6, 2016 at 11:51 PM, Vinod Kone wrote: > +1 (binding) > > Tested on ASF CI. There was one flaky test

Re: SharedFilesystemIsolator (filesystem/shared)

2016-04-11 Thread Jie Yu
Hi Stephan, Last time I asked, looks like you're the only one that are using filesystem/shared isolator. Have you switched to filesystem/linux isolator? Please let us know if you run into any issue when switching. We plan to retire filesystem/shared isolator in the next Mesos release. I'll send

Re: orphaned_tasks cleanup and prevention method

2016-04-11 Thread June Taylor
While I was waiting for more info the app finally did start up. I am trying to figure out why it took so long. Thanks, June Taylor System Administrator, Minnesota Population Center University of Minnesota On Mon, Apr 11, 2016 at 9:50 AM, haosdent wrote: > Could you find

Re: orphaned_tasks cleanup and prevention method

2016-04-11 Thread haosdent
Could you find marathon in http://${YOUR_MASTER_IP}:${YOUR_MASTER_PORT}/#/frameworks page? And >While deploying I am looking at mesos-master.WARNING, mesos-master.INFO and mesos-master.ERROR log files, but I never see anything show up that would indicate a problem, or even an attempt. When you

Re: orphaned_tasks cleanup and prevention method

2016-04-11 Thread June Taylor
Hello again. I am not sure this has been resolved yet, because I am still unable to get Marathon deployments to start. I have deleted the /marathon/ node from Zookeeper, and I now have the Marathon WebUI accessible again. I try to add a new task to deploy, and there seem to be available

Re: Backup a Mesos Cluster

2016-04-11 Thread haosdent
Hi, @Paul. Mesos support recovery well when your server crash. For Mesos Master, I suggest to set up multiple masters with zookeeper, so that the Mesos cluster would not be affected after some of Mesos Master down. For Mesos Agent, it would recover tasks information after restart. About backup, I

Re: SharedFilesystemIsolator (filesystem/shared)

2016-04-11 Thread Erb, Stephan
Given that the "filesystem/linux" isolator has landed, is it now considered to be a drop-in replacement for the "filesystem/shared" isolator? From: Erb, Stephan Sent: Wednesday, July 29, 2015 21:08 To: user@mesos.apache.org Subject:

Re: Backup a Mesos Cluster

2016-04-11 Thread Paul Bell
Piotr, Thank you for this link. I am looking at it now where I right away notice that Exhibitor is designed to monitor (and backup) Zookeeper (but not anything related to Mesos itself). Don't the Mesos master & agent nodes keep at least some state outside of the ZK znodes, e.g., under the default

Re: Backup a Mesos Cluster

2016-04-11 Thread Piotr Szwed
Do you know Exhibitor? https://github.com/Netflix/exhibitor This could be a good starting point as it has implemented sort of backup mechanism of Zookeeper cluster status to S3. Cheers, 2016-04-11 13:27 GMT+02:00 Paul Bell : > Hi All, > > As we get closer to shipping a

Backup a Mesos Cluster

2016-04-11 Thread Paul Bell
Hi All, As we get closer to shipping a Mesos-based version of our product, we've turned our attention to "protecting" (supporting backup & recovery) of not only our application databases, but the cluster as well. I'm not quite sure how to begin thinking about this, but I suppose the usual

Re: Mesos 0.28 SSL in official packages

2016-04-11 Thread Adam Bordelon
Hi Kamil, Technically, there are no "official" Apache-built packages for Apache Mesos. At least once company (Mesosphere) chooses to build and distribute Mesos packages, but does not currently offer SSL builds. It wouldn't be hard to add an SSL build to our regular builds, but it hasn't been