Re: Review Request 13187: Fixed slave to properly handle terminated tasks that have pending updates.

2013-08-01 Thread Benjamin Hindman
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/13187/#review24419 --- Ship it! src/slave/slave.cpp

make check failure...

2013-08-01 Thread Kevin Lyda
Running make check on master from github (bfdfdce) and I'm getting a failure: [ OK ] AllocatorZooKeeperTest/0.SlaveReregistersFirst (2098 ms) [--] 2 tests from AllocatorZooKeeperTest/0 (4242 ms total) [--] Global test environment tear-down [==] 240 tests from 46

Re: Review Request 13045: cgroup_isolator: Isolate the exectuor and tasks in a pid namespace.

2013-08-01 Thread Matthew Farrellee
On July 31, 2013, 6:41 p.m., Matthew Farrellee wrote: Non-code comment from the peanut gallery - technically cgroups and namespaces are separate. You get the benefits of killing an entire group of processes w/ cgroups and get group stats (mem, cpu, blk), but you don't get the

Re: Review Request 13187: Fixed slave to properly handle terminated tasks that have pending updates.

2013-08-01 Thread Vinod Kone
On Aug. 1, 2013, 6:48 a.m., Benjamin Hindman wrote: src/slave/slave.cpp, line 865 https://reviews.apache.org/r/13187/diff/1/?file=332291#file332291line865 Remove framework framework. ;) Why the change? The reason I initially changed it was because I was writing a test where I

[jira] [Created] (MESOS-608) Move Jenkins code out of the mesos repo to Jenkins CI repo

2013-08-01 Thread Vinod Kone (JIRA)
Vinod Kone created MESOS-608: Summary: Move Jenkins code out of the mesos repo to Jenkins CI repo Key: MESOS-608 URL: https://issues.apache.org/jira/browse/MESOS-608 Project: Mesos Issue Type:

[jira] [Commented] (MESOS-608) Move Jenkins code out of the mesos repo to Jenkins CI repo

2013-08-01 Thread Chris Aniszczyk (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13726655#comment-13726655 ] Chris Aniszczyk commented on MESOS-608: --- For now we can put it here Vinod:

Re: Review Request 13045: cgroup_isolator: Isolate the exectuor and tasks in a pid namespace.

2013-08-01 Thread Timothy St. Clair
On July 31, 2013, 6:41 p.m., Matthew Farrellee wrote: Non-code comment from the peanut gallery - technically cgroups and namespaces are separate. You get the benefits of killing an entire group of processes w/ cgroups and get group stats (mem, cpu, blk), but you don't get the

[jira] [Commented] (MESOS-608) Move Jenkins code out of the mesos repo to Jenkins CI repo

2013-08-01 Thread Benjamin Hindman (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13726681#comment-13726681 ] Benjamin Hindman commented on MESOS-608: Any reason not to go to Jenkins CI github

[jira] [Commented] (MESOS-608) Move Jenkins code out of the mesos repo to Jenkins CI repo

2013-08-01 Thread Chris Aniszczyk (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13726682#comment-13726682 ] Chris Aniszczyk commented on MESOS-608: --- Let me see how quickly I can get an account

[jira] [Commented] (MESOS-608) Move Jenkins code out of the mesos repo to Jenkins CI repo

2013-08-01 Thread Chris Aniszczyk (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13726692#comment-13726692 ] Chris Aniszczyk commented on MESOS-608: --- I sent an email to jenkins dev asking for

Re: java7 jars

2013-08-01 Thread Matei Zaharia
BTW, out of curiosity, are the earlier Mesos JARS (like 0.11) built with Java 6? I'd like to switch back to one that depends on Java 6 temporarily in Spark, since we now switched to 0.12 and some people have trouble building it. There's no code change required to use an older version AFAIK so

[jira] [Updated] (MESOS-543) proper auto-tools dependency checking

2013-08-01 Thread Timothy St. Clair (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-543?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Timothy St. Clair updated MESOS-543: Description: Currently mesos repo includes direct version dependencies in tarballs and does

Re: make check failure...

2013-08-01 Thread Ian Downes
The documentation is out of date and lxc has been deprecated; cgroups are used directly instead. We'll be updating the documentation shortly to reflect this. Installing lxc possibly mounted cgroups and led to the slew of busy errors. Could you please post the output of `mount` and, if you have

Re: java7 jars

2013-08-01 Thread Benjamin Hindman
Flo's stop-gap will get things building Matei (I was building Spark with 0.14.0 for the past week). The 0.12.1 vote will be out today, and hopefully released 72 hours after that! On Thu, Aug 1, 2013 at 12:00 PM, Florian Leibert f...@mesosphe.re wrote: We have a 0.14.0 jar in our public repo:

Re: Review Request 13194: Updated configure.ac to properly check for Python.h.

2013-08-01 Thread Vinod Kone
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/13194/#review24463 --- Ship it! Ship It! - Vinod Kone On Aug. 1, 2013, 6:52 p.m.,

Re: Review Request 13006: Link to executor sandbox from framework template

2013-08-01 Thread Benjamin Hindman
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/13006/#review24464 --- Ship it! LESS CLICKS! ;) - Benjamin Hindman On Aug. 1, 2013,

Re: make check failure...

2013-08-01 Thread Ian Downes
Looks like cgroups are mounted on /sys/fs/cgroup so I'd expect the tests to fail again. Are you running systemd? If the tests do fail, and you're not running systemd, you should try manually umount'ing cgroups and re-running the tests (no need to rebuild, just run `make check` again). Ian On

Re: rpm dependency packaging (stout libprocess)

2013-08-01 Thread Tim St Clair
Thanks vinod! - Original Message - From: Vinod Kone vinodk...@gmail.com To: dev dev@mesos.apache.org Sent: Thursday, August 1, 2013 2:10:46 PM Subject: Re: rpm dependency packaging (stout libprocess) Hey Tim. We are in the midst of releasing 0.12.1. I and other members of the

Re: make check failure...

2013-08-01 Thread Kevin Lyda
Not running systemd. Ubuntu uses upstart. cgroups are mounted by /etc/init/cgconfig.conf which is installed by cgroup-bin (which I need for lscgroup). I turned that off by doing: service cgconfig stop echo manual /etc/init/cgconfig.override The echo line turns it off permanently. OK, ran

Re: make check failure...

2013-08-01 Thread Benjamin Mahler
That test is broken on master currently, the ticket is here: MESOS-487https://issues.apache.org/jira/browse/MESOS-487 On Thu, Aug 1, 2013 at 1:18 PM, Kevin Lyda ke...@ie.suberic.net wrote: Not running systemd. Ubuntu uses upstart. cgroups are mounted by /etc/init/cgconfig.conf which is

Re: make check failure...

2013-08-01 Thread Ian Downes
The failing test is an integration test that seems to be triggering a kernel bug. All other tests are passing so you should just ignore it for now; Mesos will run. There is some discussion here resulting from this same test failing: https://lkml.org/lkml/2013/7/28/74 Does running the test

[jira] [Comment Edited] (MESOS-510) Multi-slave local runs broken due to non generated libprocess Process ID names.

2013-08-01 Thread Vinod Kone (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-510?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13726936#comment-13726936 ] Vinod Kone edited comment on MESOS-510 at 8/1/13 9:45 PM: -- Should

Re: Review Request 13176: Cleaned up the pre-installed wrappers.

2013-08-01 Thread Benjamin Hindman
On Aug. 1, 2013, 12:08 a.m., Vinod Kone wrote: bin/mesos-slave-flags.sh.in, lines 24-26 https://reviews.apache.org/r/13176/diff/1/?file=331631#file331631line24 s/libmesos/limesos-@{VERSION}/ to be consistent with how we do inside launcher and test environment? Done.

Re: Review Request 13182: Add '-fno-strict-aliasing' for gcc 4.4.*.

2013-08-01 Thread Benjamin Hindman
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/13182/ --- (Updated Aug. 1, 2013, 9:47 p.m.) Review request for mesos, Ben Mahler and

Re: Review Request 13006: Link to executor sandbox from framework template

2013-08-01 Thread Ben Mahler
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/13006/#review24478 --- Ship it! Ship It! - Ben Mahler On Aug. 1, 2013, 4:24 a.m., Ross

[jira] [Assigned] (MESOS-485) Latest trunk fails on strict aliasing on CentOS

2013-08-01 Thread Benjamin Hindman (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-485?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benjamin Hindman reassigned MESOS-485: -- Assignee: Benjamin Hindman Latest trunk fails on strict aliasing on CentOS

[jira] [Resolved] (MESOS-485) Latest trunk fails on strict aliasing on CentOS

2013-08-01 Thread Benjamin Hindman (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-485?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benjamin Hindman resolved MESOS-485. Resolution: Fixed Fix Version/s: 0.14.0 For now, we'll just add

[jira] [Commented] (MESOS-485) Latest trunk fails on strict aliasing on CentOS

2013-08-01 Thread Benjamin Hindman (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-485?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13726952#comment-13726952 ] Benjamin Hindman commented on MESOS-485: https://reviews.apache.org/r/13182/

Re: Review Request 13196: Improved StatusUpdateManager's API by flattening the return types.

2013-08-01 Thread Benjamin Hindman
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/13196/#review24480 --- Ship it! src/slave/slave.cpp

Re: Review Request 13196: Improved StatusUpdateManager's API by flattening the return types.

2013-08-01 Thread Ben Mahler
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/13196/#review24481 --- Ship it! Akin to stout/error.hpp, how do you both feel about

Re: Review Request 13196: Improved StatusUpdateManager's API by flattening the return types.

2013-08-01 Thread Benjamin Hindman
On Aug. 1, 2013, 10:04 p.m., Ben Mahler wrote: Akin to stout/error.hpp, how do you both feel about adding something like Failure or Failed that is implicitly cast-able to the right FutureT type? Sounds great, I'd be happy with Error = failed Future too, but maybe start with 'Failure'.

Re: java7 jars

2013-08-01 Thread Matei Zaharia
I've just reverted to 0.9 until there's another JAR in Maven Central that I can use. Note that it's fine if you build it with Java 7, just set the target to Java 6. Matei On Aug 1, 2013, at 12:04 PM, Benjamin Hindman b...@eecs.berkeley.edu wrote: Flo's stop-gap will get things building Matei

Re: Review Request 13194: Updated configure.ac to properly check for Python.h.

2013-08-01 Thread Benjamin Hindman
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/13194/ --- (Updated Aug. 1, 2013, 10:50 p.m.) Review request for mesos, Ben Mahler,

Re: make check failure...

2013-08-01 Thread Eric W. Biederman
Benjamin Mahler benjamin.mah...@gmail.com writes: That test is broken on master currently, the ticket is here: MESOS-487https://issues.apache.org/jira/browse/MESOS-487 And the fix for the broken test is in: https://reviews.apache.org/r/13034/ Kevin Your first run of the tests is current

[jira] [Resolved] (MESOS-179) Need to check for Python development headers

2013-08-01 Thread Benjamin Hindman (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benjamin Hindman resolved MESOS-179. Resolution: Fixed Fix Version/s: 0.12.1 0.14.0 Assignee:

[jira] [Updated] (MESOS-179) Need to check for Python development headers

2013-08-01 Thread Benjamin Hindman (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benjamin Hindman updated MESOS-179: --- Affects Version/s: 0.10.0 0.11.0 Need to check for Python

[jira] [Resolved] (MESOS-597) Set MESOS_NATIVE_LIBRARY or (DY)LD_LIBRARY_PATH before launching an executor in order to enable JVM based executors to easily find libmesos.so.

2013-08-01 Thread Benjamin Hindman (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-597?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benjamin Hindman resolved MESOS-597. Resolution: Fixed Fix Version/s: 0.12.1 0.14.0

Re: java7 jars

2013-08-01 Thread Vinod Kone
yes, we updated our makefile to build all our jars to 6. all our releases from now on (starting from 0.12.1) will be built for java 6. On Thu, Aug 1, 2013 at 3:46 PM, Matei Zaharia matei.zaha...@gmail.comwrote: I've just reverted to 0.9 until there's another JAR in Maven Central that I can

[jira] [Updated] (MESOS-599) Make sure stderr/stdout get launcher output.

2013-08-01 Thread Benjamin Hindman (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-599?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benjamin Hindman updated MESOS-599: --- Affects Version/s: 0.9.0 0.10.0 0.11.0

[jira] [Resolved] (MESOS-599) Make sure stderr/stdout get launcher output.

2013-08-01 Thread Benjamin Hindman (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-599?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benjamin Hindman resolved MESOS-599. Resolution: Fixed Fix Version/s: 0.12.1 0.14.0 Make sure

[jira] [Created] (MESOS-609) Executor should remove the task from queuedTasks when it moves a queued task to terminatedTasks.

2013-08-01 Thread Vinod Kone (JIRA)
Vinod Kone created MESOS-609: Summary: Executor should remove the task from queuedTasks when it moves a queued task to terminatedTasks. Key: MESOS-609 URL: https://issues.apache.org/jira/browse/MESOS-609

Review Request 13205: Don't make the mesos-*-flags.sh executable.

2013-08-01 Thread Benjamin Hindman
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/13205/ --- Review request for mesos and Vinod Kone. Repository: mesos-git Description

Review Request 13206: Use 'exec' to launch mesos-slave from wrapper.

2013-08-01 Thread Benjamin Hindman
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/13206/ --- Review request for mesos and Vinod Kone. Repository: mesos-git Description

Re: Review Request 13206: Use 'exec' to launch mesos-slave from wrapper.

2013-08-01 Thread Benjamin Hindman
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/13206/ --- (Updated Aug. 1, 2013, 11:47 p.m.) Review request for mesos and Vinod Kone.

Re: Review Request 13205: Don't make the mesos-*-flags.sh executable.

2013-08-01 Thread Benjamin Hindman
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/13205/ --- (Updated Aug. 1, 2013, 11:46 p.m.) Review request for mesos and Vinod Kone.

Review Request 13207: Fixed slave to remove a queued task when the task is terminated.

2013-08-01 Thread Vinod Kone
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/13207/ --- Review request for mesos, Benjamin Hindman and Ben Mahler. Bugs: MESOS-609

[jira] [Created] (MESOS-610) Split slave specific tests out of master_tests

2013-08-01 Thread Vinod Kone (JIRA)
Vinod Kone created MESOS-610: Summary: Split slave specific tests out of master_tests Key: MESOS-610 URL: https://issues.apache.org/jira/browse/MESOS-610 Project: Mesos Issue Type: Improvement

Review Request 13211: Fixed bug where active TaskTrackers are killed.

2013-08-01 Thread Brenden Matthews
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/13211/ --- Review request for mesos. Repository: mesos-git Description --- Fixed

Re: Review Request 13077: Updated Mesos mapred-site.xml for Hadoop.

2013-08-01 Thread Brenden Matthews
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/13077/ --- (Updated Aug. 2, 2013, 1:47 a.m.) Review request for mesos. Changes ---

Re: Review Request 12924: Refactor Mesos scheduler into fixed/variable policies.

2013-08-01 Thread Brenden Matthews
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/12924/#review24499 --- I'm abandoning this review in favour of:

Re: Review Request 13211: Fixed bug where active TaskTrackers are killed.

2013-08-01 Thread Brenden Matthews
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/13211/ --- (Updated Aug. 2, 2013, 1:52 a.m.) Review request for mesos. Changes ---

Review Request 13213: Refactored JobTracker into fixed/variable policies.

2013-08-01 Thread Brenden Matthews
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/13213/ --- Review request for mesos. Repository: mesos-git Description ---

Re: Review Request 13212: Allowed JobTracker to use unknown trackers.

2013-08-01 Thread Brenden Matthews
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/13212/ --- (Updated Aug. 2, 2013, 1:52 a.m.) Review request for mesos. Changes ---

Re: Review Request 13207: Fixed slave to remove a queued task when the task is terminated.

2013-08-01 Thread Benjamin Hindman
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/13207/#review24506 --- Ship it! src/tests/master_tests.cpp

Review Request 13216: Refactored slave recovery so that slave recovers its state before the isolator and status update manager.

2013-08-01 Thread Vinod Kone
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/13216/ --- Review request for mesos, Benjamin Hindman and Ben Mahler. Bugs: MESOS-611

[jira] [Commented] (MESOS-611) Refactor slave recovery to ensure slave recovers its state first

2013-08-01 Thread Vinod Kone (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13727351#comment-13727351 ] Vinod Kone commented on MESOS-611: -- https://reviews.apache.org/r/13216/

Re: Review Request 13216: Refactored slave recovery so that slave recovers its state before the isolator and status update manager.

2013-08-01 Thread Vinod Kone
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/13216/ --- (Updated Aug. 2, 2013, 5:27 a.m.) Review request for mesos, Benjamin Hindman

[jira] [Created] (MESOS-612) Slave should not send redundant ExecutorTerminated messages during recovery

2013-08-01 Thread Vinod Kone (JIRA)
Vinod Kone created MESOS-612: Summary: Slave should not send redundant ExecutorTerminated messages during recovery Key: MESOS-612 URL: https://issues.apache.org/jira/browse/MESOS-612 Project: Mesos