[jira] [Updated] (MESOS-790) Make recovering frameworks in the Slave asynchronous.

2014-05-16 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benjamin Mahler updated MESOS-790: -- Fix Version/s: (was: 0.19.0) Make recovering frameworks in the Slave asynchronous

[jira] [Updated] (MESOS-1285) metrics names shouldn't include problematic characters

2014-05-16 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-1285?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benjamin Mahler updated MESOS-1285: --- Fix Version/s: (was: 0.19.0) metrics names shouldn't include problematic characters

[jira] [Updated] (MESOS-666) Include CHANGELOG and docs/ in distributions.

2014-05-16 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-666?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benjamin Mahler updated MESOS-666: -- Fix Version/s: (was: 0.19.0) Include CHANGELOG and docs/ in distributions

[jira] [Updated] (MESOS-551) Scheduler driver should acknowledge duplicate status updates

2014-05-16 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benjamin Mahler updated MESOS-551: -- Fix Version/s: (was: 0.19.0) Scheduler driver should acknowledge duplicate status updates

[jira] [Updated] (MESOS-786) Update semantics of when framework registered()/reregistered() get called

2014-05-16 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benjamin Mahler updated MESOS-786: -- Fix Version/s: (was: 0.19.0) Update semantics of when framework registered()/reregistered

[jira] [Updated] (MESOS-784) slave should allocate all the rest resources even given a resources flag

2014-05-16 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-784?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benjamin Mahler updated MESOS-784: -- Fix Version/s: (was: 0.19.0) slave should allocate all the rest resources even given

[jira] [Updated] (MESOS-1283) reintroduce LIBPROCESS_STATISTICS_WINDOW

2014-05-16 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-1283?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benjamin Mahler updated MESOS-1283: --- Fix Version/s: (was: 0.19.0) reintroduce LIBPROCESS_STATISTICS_WINDOW

Re: 0.19.0

2014-05-16 Thread Benjamin Mahler
to 0.20.0 arbitrarily. Please let me know if there are any issues with this. On Fri, May 16, 2014 at 1:03 PM, Benjamin Mahler benjamin.mah...@gmail.comwrote: Update: Aiming to cut for next Friday or so. Will start wrangling 0.19.0 JIRA tickets as appropriate. On Tue, May 6, 2014 at 12:27 PM

[jira] [Updated] (MESOS-1332) Improve Master and Slave metric names

2014-05-15 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-1332?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benjamin Mahler updated MESOS-1332: --- Assignee: Dominic Hamon Improve Master and Slave metric names

[jira] [Commented] (MESOS-1321) Add killTask integration tests for a strict registry.

2014-05-15 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-1321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13993774#comment-13993774 ] Benjamin Mahler commented on MESOS-1321: Part 1 committed: {noformat} commit

Re: Question on resource offers and framework failover

2014-05-15 Thread Benjamin Mahler
Where as, a TASK_LOST will make me (unnecessarily, in this case) try to ensure that the task is actually lost, not running away on the slave that got disconnected from Mesos master. Not all environments may need the distinction, but at least some do. To be clear, are you still planning to

[jira] [Commented] (MESOS-1332) Improve Master and Slave metric names

2014-05-15 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-1332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13997807#comment-13997807 ] Benjamin Mahler commented on MESOS-1332: The namespacing idea actually came up

[jira] [Commented] (MESOS-1208) 3rdparty/libprocess/3rdparty/boost-1.53.0/boost/math/special_functions/sign.hpp:113:55: error: typedef 'fp_tag' locally defined but not used [-Werror=unused-local-typed

2014-05-15 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-1208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13998355#comment-13998355 ] Benjamin Mahler commented on MESOS-1208: This looks like a problem in Boost 1.53

[jira] [Commented] (MESOS-1132) Port master stats.json over to new metrics library

2014-05-15 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-1132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13997911#comment-13997911 ] Benjamin Mahler commented on MESOS-1132: First review is committed: {noformat

[jira] [Commented] (MESOS-1132) Port master stats.json over to new metrics library

2014-05-15 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-1132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13998243#comment-13998243 ] Benjamin Mahler commented on MESOS-1132: {noformat} commit

[jira] [Created] (MESOS-1336) Subprocess can lead to a blocked child process when input/output splicing is not performed by the caller.

2014-05-15 Thread Benjamin Mahler (JIRA)
Benjamin Mahler created MESOS-1336: -- Summary: Subprocess can lead to a blocked child process when input/output splicing is not performed by the caller. Key: MESOS-1336 URL: https://issues.apache.org/jira/browse

[jira] [Created] (MESOS-1362) Flaky test: SlaveRecoveryTest/0.RemoveNonCheckpointingFramework

2014-05-14 Thread Benjamin Mahler (JIRA)
Benjamin Mahler created MESOS-1362: -- Summary: Flaky test: SlaveRecoveryTest/0.RemoveNonCheckpointingFramework Key: MESOS-1362 URL: https://issues.apache.org/jira/browse/MESOS-1362 Project: Mesos

[jira] [Commented] (MESOS-1359) The SlaveTest.ROOT_RunTaskWithCommandInfoWithUser test is flaky / broken.

2014-05-14 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-1359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13997860#comment-13997860 ] Benjamin Mahler commented on MESOS-1359: My hunch would be that the following

[jira] [Created] (MESOS-1348) The SlaveRecoveryTest.GCExecutor test leaks child processes.

2014-05-13 Thread Benjamin Mahler (JIRA)
Benjamin Mahler created MESOS-1348: -- Summary: The SlaveRecoveryTest.GCExecutor test leaks child processes. Key: MESOS-1348 URL: https://issues.apache.org/jira/browse/MESOS-1348 Project: Mesos

Re: Review Request 20850: Customize the configuration of logging level

2014-05-13 Thread Benjamin Mahler
Sorry for the delay, I've been bogged down quite a bit lately but I will take a look today! Sent from my iPhone On May 7, 2014, at 5:17 AM, Alexandra Sava alexandrasav...@gmail.com wrote: Hi guys, Any feedback for my changes would be welcomed. Thanks, Alexandra On 1 May 2014

[jira] [Created] (MESOS-1359) The SlaveTest.ROOT_RunTaskWithCommandInfoWithUser test is flaky / broken.

2014-05-13 Thread Benjamin Mahler (JIRA)
Benjamin Mahler created MESOS-1359: -- Summary: The SlaveTest.ROOT_RunTaskWithCommandInfoWithUser test is flaky / broken. Key: MESOS-1359 URL: https://issues.apache.org/jira/browse/MESOS-1359 Project

[jira] [Created] (MESOS-1361) Flaky test: SlaveRecoveryTest/0.RecoverCompletedExecutor

2014-05-13 Thread Benjamin Mahler (JIRA)
Benjamin Mahler created MESOS-1361: -- Summary: Flaky test: SlaveRecoveryTest/0.RecoverCompletedExecutor Key: MESOS-1361 URL: https://issues.apache.org/jira/browse/MESOS-1361 Project: Mesos

[jira] [Commented] (MESOS-1332) Improve Master and Slave metric names

2014-05-13 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-1332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13997123#comment-13997123 ] Benjamin Mahler commented on MESOS-1332: Great point [~adam-mesos], we definitely

Re: Build failed in Jenkins: Mesos-Trunk-Ubuntu-Build-Out-Of-Src-Disable-Java-Disable-Python-Disable-Webui #2120

2014-05-12 Thread Benjamin Mahler
Filed: https://issues.apache.org/jira/browse/MESOS-1348 On Fri, May 9, 2014 at 2:21 PM, Apache Jenkins Server jenk...@builds.apache.org wrote: See https://builds.apache.org/job/Mesos-Trunk-Ubuntu-Build-Out-Of-Src-Disable-Java-Disable-Python-Disable-Webui/2120/changes Changes:

Re: Build failed in Jenkins: Mesos-Ubuntu-distcheck #79

2014-05-12 Thread Benjamin Mahler
Filed https://issues.apache.org/jira/browse/MESOS-1347 On Tue, May 6, 2014 at 9:37 PM, Apache Jenkins Server jenk...@builds.apache.org wrote: See https://builds.apache.org/job/Mesos-Ubuntu-distcheck/79/changes Changes: [benjamin.hindman] Enabled unbundled usage of http-parser.

Re: Mesos Wire Protocol Documentation

2014-05-11 Thread Benjamin Mahler
Content-Type: application/x-protobuf Libprocess-From: scheduler(1)@127.0.0.1:8080 Accept-Encoding: gzip HTTP Response 2014/05/09 07:27:11 Response Body: HTTP/1.1 202 Accepted Content-Length: 0 Date: Fri, 09 May 2014 11:27:11 GMT On Tue, May 6, 2014 at 4:06 PM, Benjamin Mahler benjamin.mah

[jira] [Created] (MESOS-1331) SlaveRecoveryTest/0.NonCheckpointingFramework is flaky.

2014-05-10 Thread Benjamin Mahler (JIRA)
Benjamin Mahler created MESOS-1331: -- Summary: SlaveRecoveryTest/0.NonCheckpointingFramework is flaky. Key: MESOS-1331 URL: https://issues.apache.org/jira/browse/MESOS-1331 Project: Mesos

[jira] [Comment Edited] (MESOS-1332) Improve Master and Slave metric names

2014-05-10 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-1332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13993287#comment-13993287 ] Benjamin Mahler edited comment on MESOS-1332 at 5/9/14 5:57 PM

[jira] [Created] (MESOS-1311) 0.19.0 Release

2014-05-06 Thread Benjamin Mahler (JIRA)
Benjamin Mahler created MESOS-1311: -- Summary: 0.19.0 Release Key: MESOS-1311 URL: https://issues.apache.org/jira/browse/MESOS-1311 Project: Mesos Issue Type: Task Reporter

0.19.0

2014-05-06 Thread Benjamin Mahler
Hi all, In the coming weeks, we would like to cut 0.19.0 on the master branch. I would like to volunteer to be the release manager. I've added a ticket here to help track any dependencies: https://issues.apache.org/jira/browse/MESOS-1311 Namely, there are two major features going out: 1.

Re: 0.19.0

2014-05-06 Thread Benjamin Mahler
On second thought, keeping an accurate 'Fix Version' for 0.19.0 would be the better approach than linking things. I will start to update things that haven't made it in 0.19.0 over the coming weeks. On Tue, May 6, 2014 at 12:22 PM, Benjamin Mahler benjamin.mah...@gmail.comwrote: Hi all

Re: Mesos Wire Protocol Documentation

2014-05-06 Thread Benjamin Mahler
, Benjamin Mahler benjamin.mah...@gmail.com wrote: Brian, I'm curious whether most python HTTP libraries support using custom HTTP request headers. We'll land benh's patches to send 202 responses in 0.19.0: https://reviews.apache.org/r/20276/ https://reviews.apache.org/r/20277/ Once these get

[jira] [Created] (MESOS-1314) Update default registry to replicated_log.

2014-05-06 Thread Benjamin Mahler (JIRA)
Benjamin Mahler created MESOS-1314: -- Summary: Update default registry to replicated_log. Key: MESOS-1314 URL: https://issues.apache.org/jira/browse/MESOS-1314 Project: Mesos Issue Type

[jira] [Created] (MESOS-1315) Update master to use a strict registry, by default.

2014-05-06 Thread Benjamin Mahler (JIRA)
Benjamin Mahler created MESOS-1315: -- Summary: Update master to use a strict registry, by default. Key: MESOS-1315 URL: https://issues.apache.org/jira/browse/MESOS-1315 Project: Mesos Issue

[jira] [Created] (MESOS-1317) Add integration tests to enforce the semantics of a strict registry.

2014-05-06 Thread Benjamin Mahler (JIRA)
Benjamin Mahler created MESOS-1317: -- Summary: Add integration tests to enforce the semantics of a strict registry. Key: MESOS-1317 URL: https://issues.apache.org/jira/browse/MESOS-1317 Project

[jira] [Created] (MESOS-1319) Add recovery integration tests for a strict registry.

2014-05-06 Thread Benjamin Mahler (JIRA)
Benjamin Mahler created MESOS-1319: -- Summary: Add recovery integration tests for a strict registry. Key: MESOS-1319 URL: https://issues.apache.org/jira/browse/MESOS-1319 Project: Mesos

[jira] [Created] (MESOS-1320) Add reconciliation integration tests for a strict registry.

2014-05-06 Thread Benjamin Mahler (JIRA)
Benjamin Mahler created MESOS-1320: -- Summary: Add reconciliation integration tests for a strict registry. Key: MESOS-1320 URL: https://issues.apache.org/jira/browse/MESOS-1320 Project: Mesos

[jira] [Updated] (MESOS-1320) Add reconciliation integration tests for a strict registry.

2014-05-06 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-1320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benjamin Mahler updated MESOS-1320: --- Description: * Reconciliation: unknown slave - TASK_LOST * Reconciliation: transitioning

[jira] [Created] (MESOS-1321) Add killTask integration tests for a strict registry.

2014-05-06 Thread Benjamin Mahler (JIRA)
Benjamin Mahler created MESOS-1321: -- Summary: Add killTask integration tests for a strict registry. Key: MESOS-1321 URL: https://issues.apache.org/jira/browse/MESOS-1321 Project: Mesos

[jira] [Updated] (MESOS-1322) Add failover integration tests for a strict registry.

2014-05-06 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-1322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benjamin Mahler updated MESOS-1322: --- Description: * Once slave is removed, slave cannot re-register with a failed over master

[jira] [Updated] (MESOS-1321) Add killTask integration tests for a strict registry.

2014-05-06 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-1321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benjamin Mahler updated MESOS-1321: --- Description: * killTask: unknown slave - TASK_LOST * killTask: transitioning slave

[jira] [Created] (MESOS-1322) Add failover integration tests for a strict registry.

2014-05-06 Thread Benjamin Mahler (JIRA)
Benjamin Mahler created MESOS-1322: -- Summary: Add failover integration tests for a strict registry. Key: MESOS-1322 URL: https://issues.apache.org/jira/browse/MESOS-1322 Project: Mesos

Re: Build failed in Jenkins: Mesos-Ubuntu-distcheck #71

2014-05-02 Thread Benjamin Mahler
+adam, vinod Looks like this is re-authentication causing the master to call disconnect(Slave*) leading to TASK_LOST (MESOS-1264), can you guys take a look? On Thu, May 1, 2014 at 11:51 PM, Apache Jenkins Server jenk...@builds.apache.org wrote: See

Re: Build failed in Jenkins: Mesos-Ubuntu-distcheck #71

2014-05-02 Thread Benjamin Mahler
Should we disable this test until there's a fix for MESOS-1264? (Sorry, I initially thought this was a new flaky test). On Fri, May 2, 2014 at 10:13 AM, Benjamin Mahler benjamin.mah...@gmail.comwrote: +adam, vinod Looks like this is re-authentication causing the master to call disconnect

[jira] [Updated] (MESOS-1264) Slave authentication retries can trigger TASK_LOST for non-checkpointing frameworks.

2014-05-02 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-1264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benjamin Mahler updated MESOS-1264: --- Fix Version/s: 0.19.0 Slave authentication retries can trigger TASK_LOST for non

Re: [VOTE] Release Apache Mesos 0.18.1 (rc2)

2014-05-02 Thread Benjamin Mahler
Looks like --without-cxx11 is broken for gcc-4.2.1, not sure if that should be a blocker for this because I don't believe there is a fix for this yet! [bmahler@smf1-aye-26-sr4 mesos-0.18.1]$ ./configure --disable-optimize --without-cxx11 make check -j8 ... libtool: compile: g++

[jira] [Commented] (MESOS-1274) Disallow further operations in the Registrar when a failure occurs.

2014-05-01 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-1274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13987203#comment-13987203 ] Benjamin Mahler commented on MESOS-1274: https://reviews.apache.org/r/20981

Re: Review Request 20971: Update process::reap to use os::exists(pid).

2014-05-01 Thread Benjamin Mahler
Looks like you are missing the dependency here? On Thu, May 1, 2014 at 6:01 PM, Vinod Kone vinodk...@gmail.com wrote: --- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/20971/#review41994

Re: Mesos Wire Protocol Documentation

2014-04-30 Thread Benjamin Mahler
nikita.vetosh...@gmail.com wrote: BTW, there is also somehow related ticket https://issues.apache.org/jira/browse/MESOS-930 On Wed, Apr 9, 2014 at 9:54 PM, Benjamin Mahler benjamin.mah...@gmail.comwrote: I thought the low-level api being referred

Re: OPW Summer 2014

2014-04-30 Thread Benjamin Mahler
Welcome! Looking forward to seeing your contributions! On Tue, Apr 29, 2014 at 5:44 PM, Benjamin Hindman b...@eecs.berkeley.eduwrote: Welcome Isabel! On Mon, Apr 28, 2014 at 3:35 PM, Vinod Kone vinodk...@gmail.com wrote: Welcome and good luck with your internship! Also, see here

Re: Build failed in Jenkins: Mesos-Trunk-Ubuntu-Build-In-Src-Set-JAVA_HOME #1821

2014-04-30 Thread Benjamin Mahler
Is this being tracked? I've found another issue perhaps: [ RUN ] ExternalContainerizerTest.Launch ../../src/tests/cluster.hpp:462: Failure (wait).failure(): External containerizer failed (status: 1) F0430 15:44:34.437077 13360 logging.cpp:55] RAW: Pure virtual method called @

[jira] [Commented] (MESOS-1250) ReservationAllocatorTest.ResourcesReturned flaky - repeated_field.h:824 CHECK failed: (index) (size())

2014-04-30 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-1250?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13985667#comment-13985667 ] Benjamin Mahler commented on MESOS-1250: [~xujyan] Is this assigned to you

Re: Build failed in Jenkins: Mesos-Trunk-Ubuntu-Build-Out-Of-Src-Disable-Java-Disable-Python-Disable-Webui #2101

2014-04-30 Thread Benjamin Mahler
Just pushed the fix. On Wed, Apr 30, 2014 at 8:31 AM, Benjamin Mahler benjamin.mah...@gmail.comwrote: Thanks for digging in Adam! I will get this fixed. On Wed, Apr 30, 2014 at 12:29 AM, Adam Bordelon a...@mesosphere.iowrote: This is dying inside the first line

[jira] [Created] (MESOS-1274) Disallow further operations in the Registrar when a failure occurs.

2014-04-30 Thread Benjamin Mahler (JIRA)
Benjamin Mahler created MESOS-1274: -- Summary: Disallow further operations in the Registrar when a failure occurs. Key: MESOS-1274 URL: https://issues.apache.org/jira/browse/MESOS-1274 Project: Mesos

Re: Review Request 20807: [MESOS-1259]: Enrich the Java Docs in the src/java files.

2014-04-29 Thread Benjamin Mahler
Thanks for the javadoc pointer, your indentation changes SGTM! On Mon, Apr 28, 2014 at 11:03 PM, Bernardo Gomez Palacio bernardo.gomezpala...@gmail.com wrote: This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/20807/ On April 29th, 2014, 1:03 a.m.

[jira] [Commented] (MESOS-1253) Make HTTP endpoint browsable

2014-04-28 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-1253?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13983505#comment-13983505 ] Benjamin Mahler commented on MESOS-1253: Which part do you disagree with? Can you

[jira] [Commented] (MESOS-1258) 0.18.0-rc3: F0427 02:48:30.603756 62192 group.cpp:326] Check failed: state == CONNECTED || state == AUTHENTICATED || state == READY 1

2014-04-28 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-1258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13983648#comment-13983648 ] Benjamin Mahler commented on MESOS-1258: It's rare, even in large scale production

[jira] [Commented] (MESOS-1264) Slave authentication retries can trigger TASK_LOST for non-checkpointing frameworks.

2014-04-28 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-1264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13983751#comment-13983751 ] Benjamin Mahler commented on MESOS-1264: [~vinodkone] and [~adam-mesos], can one

[jira] [Created] (MESOS-1264) Slave authentication retries can trigger TASK_LOST for non-checkpointing frameworks.

2014-04-28 Thread Benjamin Mahler (JIRA)
Benjamin Mahler created MESOS-1264: -- Summary: Slave authentication retries can trigger TASK_LOST for non-checkpointing frameworks. Key: MESOS-1264 URL: https://issues.apache.org/jira/browse/MESOS-1264

[jira] [Commented] (MESOS-361) Restrict the character space of user provided TaskIDs.

2014-04-25 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-361?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13981291#comment-13981291 ] Benjamin Mahler commented on MESOS-361: --- [~jalaziz] is (2) a bug in Mesos or Chronos

Re: Build failed in Jenkins: Mesos-Ubuntu-distcheck #51

2014-04-24 Thread Benjamin Mahler
This issue should be resolved now with 397dc81. On Thu, Apr 24, 2014 at 10:48 AM, Apache Jenkins Server jenk...@builds.apache.org wrote: See https://builds.apache.org/job/Mesos-Ubuntu-distcheck/51/changes Changes: [benjamin.hindman] Cleanups in configure.ac for libjvm detection.

Re: Review Request 19176: Eliminated Framework.id in favor of Framework.info.id.

2014-04-24 Thread Benjamin Mahler
I will take a look, thanks Adam! On Wed, Apr 23, 2014 at 6:55 PM, Adam B a...@mesosphere.io wrote: --- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/19176/

Re: Build failed in Jenkins: Mesos-Trunk-Ubuntu-Build-Out-Of-Src-Set-JAVA_HOME #2081

2014-04-23 Thread Benjamin Mahler
https://issues.apache.org/jira/browse/MESOS-1152 On Wed, Apr 23, 2014 at 1:08 AM, Apache Jenkins Server jenk...@builds.apache.org wrote: See https://builds.apache.org/job/Mesos-Trunk-Ubuntu-Build-Out-Of-Src-Set-JAVA_HOME/2081/ -- [...truncated

[jira] [Created] (MESOS-1226) Add flags for replicated log backed registry.

2014-04-21 Thread Benjamin Mahler (JIRA)
Benjamin Mahler created MESOS-1226: -- Summary: Add flags for replicated log backed registry. Key: MESOS-1226 URL: https://issues.apache.org/jira/browse/MESOS-1226 Project: Mesos Issue Type

[jira] [Updated] (MESOS-1226) Add flags for replicated log backed registry.

2014-04-21 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-1226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benjamin Mahler updated MESOS-1226: --- Description: As we're planning to add a non-strict replicated log backed registry in 0.19.0

[jira] [Updated] (MESOS-1226) Add flags for replicated log backed registry.

2014-04-21 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-1226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benjamin Mahler updated MESOS-1226: --- Description: As we're planning to add a non-strict replicated log backed registry in 0.19.0

Re: Trying to get task reconciliation to work

2014-04-18 Thread Benjamin Mahler
unknown, lost, or will nothing respond? On Thursday, April 17, 2014, Sharma Podila spod...@netflix.com wrote: No problem, I have a better understanding now. And it was useful to see the three items you listed explicitly. On Thu, Apr 17, 2014 at 2:39 PM, Benjamin Mahler

[jira] [Updated] (MESOS-1221) Slave should update the containerizers with executor resources after recovery

2014-04-18 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-1221?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benjamin Mahler updated MESOS-1221: --- Issue Type: Bug (was: Improvement) Moving this to a bug since this is a regression

[jira] [Comment Edited] (MESOS-1221) Slave should update the containerizers with executor resources after recovery

2014-04-18 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-1221?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13974322#comment-13974322 ] Benjamin Mahler edited comment on MESOS-1221 at 4/18/14 6:22 PM

Re: Trying to get task reconciliation to work

2014-04-18 Thread Benjamin Mahler
for frameworks that fail to failover in a timely manner, and then ask for tasks that belonged to their previous FrameworkID. On Fri, Apr 18, 2014 at 1:55 PM, Benjamin Mahler benjamin.mah...@gmail.comwrote: Vinod, David is asking about tasks that belong to the framework in that they were

Re: Review Request 20502: Introduced exponential backoff for slave registration retries.

2014-04-18 Thread Benjamin Mahler
Vinod can you review this one? On Fri, Apr 18, 2014 at 3:09 PM, Jiang Yan Xu y...@jxu.me wrote: --- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/20502/

[jira] [Assigned] (MESOS-1220) Make check failure on OSX - IO error: Too many open files

2014-04-17 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-1220?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benjamin Mahler reassigned MESOS-1220: -- Assignee: Benjamin Mahler I'll take a look! Make check failure on OSX - IO error

[jira] [Commented] (MESOS-1152) ProcTest.MultipleThreads is flaky

2014-04-17 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-1152?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13973498#comment-13973498 ] Benjamin Mahler commented on MESOS-1152: Looks like it was flaky on Jenkins

[jira] [Commented] (MESOS-1213) --enable-gprof

2014-04-16 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-1213?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13971823#comment-13971823 ] Benjamin Mahler commented on MESOS-1213: Have you ever used {{perf}}? We also

Re: Review Request 20431: Used LogStorage for all tests.

2014-04-16 Thread Benjamin Mahler
Actually, I will update this to include the RegistrarTests as well. On Wed, Apr 16, 2014 at 3:59 PM, Ben Mahler benjamin.mah...@gmail.comwrote: This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/20431/ Review request for mesos, Benjamin Hindman and

[jira] [Closed] (MESOS-1210) OsTest.children test is flaky

2014-04-16 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-1210?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benjamin Mahler closed MESOS-1210. -- Resolution: Not a Problem Yes, let's close this as it expects a clean environment (no child

[jira] [Commented] (MESOS-982) Relax slave (re-)registration retries and add a backoff mechanism.

2014-04-14 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-982?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13969052#comment-13969052 ] Benjamin Mahler commented on MESOS-982: --- [~xujyan] Yes. Relax slave (re

[jira] [Updated] (MESOS-1160) Support flattening from Try into Future.

2014-04-09 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-1160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benjamin Mahler updated MESOS-1160: --- Description: We should consider adding support for constructing a FutureT from a TryT

Re: Mesos Wire Protocol Documentation

2014-04-09 Thread Benjamin Mahler
for scheduling. But, it's really administrative. I thought that would have been an opportunity for a Go binding that did not require the C++ libraries. Thanks anyway. On Tue, Apr 8, 2014 at 4:52 PM, Benjamin Mahler benjamin.mah...@gmail.comwrote: Sorry, I was not referring to implementing

[jira] [Resolved] (MESOS-682) Master should properly consolidate slaves and deactivated maps

2014-04-09 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-682?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benjamin Mahler resolved MESOS-682. --- Resolution: Fixed Ok, reading closely through what happened here, this situation

[jira] [Commented] (MESOS-1199) Subprocess is slow - gated by process::reap poll interval

2014-04-09 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-1199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13964755#comment-13964755 ] Benjamin Mahler commented on MESOS-1199: [~tknaup] can you run {{perf top

[jira] [Resolved] (MESOS-1160) Support flattening from Try into Future.

2014-04-09 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-1160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benjamin Mahler resolved MESOS-1160. Resolution: Fixed Fix Version/s: 0.19.0 Assignee: Ritwik Yadav {noformat

Re: [jira] [Commented] (MESOS-1199) Subprocess is slow - gated by process::reap poll interval

2014-04-09 Thread Benjamin Mahler
Ian, process::reap always uses waitpid() as we didn't implement the thread-per-pid wait() optimization. On Wed, Apr 9, 2014 at 4:41 PM, Ian Downes (JIRA) j...@apache.org wrote: [

Re: Mesos Wire Protocol Documentation

2014-04-08 Thread Benjamin Mahler
. Sender is encoded in User-Agent header, e.g: libprocess/slave(1)@127.0.0.1:5051. Body contains protobuf message, Transfer-Encoding is always chunked. On Mon, Apr 7, 2014 at 10:42 PM, Benjamin Mahler benjamin.mah...@gmail.comwrote: Unfortunately you will need to learn

[jira] [Commented] (MESOS-1199) Subprocess is slow - gated by process::reap poll interval

2014-04-08 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-1199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13963326#comment-13963326 ] Benjamin Mahler commented on MESOS-1199: [~t.st.clair]: {code} // TODO(bmahler

Re: Mesos Wire Protocol Documentation

2014-04-08 Thread Benjamin Mahler
Vivien vladimir.viv...@gmail.comwrote: Ben, That is exactly what I am asking. Is that something coming up soon, is there a JIRA I can look at? I wanna get early start on a native json Go api or even help out if possible. On Tue, Apr 8, 2014 at 3:25 PM, Benjamin Mahler benjamin.mah

[jira] [Resolved] (MESOS-841) Enforce only leading master can write to the Registrar.

2014-04-08 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benjamin Mahler resolved MESOS-841. --- Resolution: Fixed This was done by having a newly elected master write its MasterInfo

[jira] [Created] (MESOS-1200) Add SlaveID to KillTaskMessage to provide feedback for unknown slaves.

2014-04-08 Thread Benjamin Mahler (JIRA)
Benjamin Mahler created MESOS-1200: -- Summary: Add SlaveID to KillTaskMessage to provide feedback for unknown slaves. Key: MESOS-1200 URL: https://issues.apache.org/jira/browse/MESOS-1200 Project

[jira] [Commented] (MESOS-783) Master::killTask must not answer with TASK_LOST when the task is unknown.

2014-04-08 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13963643#comment-13963643 ] Benjamin Mahler commented on MESOS-783: --- This was fixed as part of the Registrar

Re: Mesos Wire Protocol Documentation

2014-04-07 Thread Benjamin Mahler
Unfortunately you will need to learn this by looking at the code in libprocess, as the message passing format is not explicitly documented at the current time. Start with calls like ProtobufProcess::send() and dig your way down. On Sat, Apr 5, 2014 at 7:52 AM, Vladimir Vivien

Re: Load simulator/benchmark tool

2014-04-07 Thread Benjamin Mahler
Jie recently pointed me to the Sparrow talk: http://www.youtube.com/watch?v=A4k0WqjUY9A In light of the concerns over the latency penalty of centralized scheduler systems, it would be awesome to measure task / update / message latencies when dealing with very large clusters. Does mesosaurus aim

Re: Question about LOST status on custom executor

2014-04-07 Thread Benjamin Mahler
should inform RUNNING, FINISHED, and FAILED? On Mon, Apr 7, 2014 at 4:54 PM, Benjamin Mahler benjamin.mah...@gmail.comwrote: Why is your executor failing? When you say failing, is your executor crashing or simply exiting after doing the required work? You will need to manage the task

[jira] [Comment Edited] (MESOS-982) Relax slave (re-)registration retries and add a backoff mechanism.

2014-04-07 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-982?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13962279#comment-13962279 ] Benjamin Mahler edited comment on MESOS-982 at 4/7/14 9:25 PM

[jira] [Reopened] (MESOS-982) Relax slave (re-)registration retries and add a backoff mechanism.

2014-04-07 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-982?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benjamin Mahler reopened MESOS-982: --- [~adam-mesos] thanks for bringing that up, we should indeed be prudent and add the retry logic

Re: Review Request 20106: Added support for flattening from Try and Result objects into Future objects as suggested in the issue MESOS-1160.

2014-04-07 Thread Benjamin Mahler
Hi Ritwik, please add 'mesos' to the 'Groups' field on the review as well, this ensures it is sent out on the dev@ list. On Mon, Apr 7, 2014 at 2:33 PM, Ritwik ritwik.ya...@gmail.com wrote: +dev@mesos.apache.org -- Forwarded message -- From: Ritwik Yadav

[jira] [Commented] (MESOS-1160) Support flattening from {Try,Result} into Future.

2014-04-04 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-1160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13960555#comment-13960555 ] Benjamin Mahler commented on MESOS-1160: I was also thinking this would be done

[jira] [Assigned] (MESOS-1156) make check-local fail on OEL6

2014-03-28 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-1156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benjamin Mahler reassigned MESOS-1156: -- Assignee: Vinod Kone Passing the torch on to [~vinodkone] for possible updates to his

[jira] [Created] (MESOS-1160) Support flattening from {Try,Result} into Future.

2014-03-28 Thread Benjamin Mahler (JIRA)
Benjamin Mahler created MESOS-1160: -- Summary: Support flattening from {Try,Result} into Future. Key: MESOS-1160 URL: https://issues.apache.org/jira/browse/MESOS-1160 Project: Mesos Issue

[jira] [Updated] (MESOS-1160) Support flattening from {Try,Result} into Future.

2014-03-28 Thread Benjamin Mahler (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-1160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benjamin Mahler updated MESOS-1160: --- Description: We should consider adding support for constructing a FutureT from a TryT

[jira] [Created] (MESOS-1162) Add a 'Percentage' abstraction.

2014-03-28 Thread Benjamin Mahler (JIRA)
Benjamin Mahler created MESOS-1162: -- Summary: Add a 'Percentage' abstraction. Key: MESOS-1162 URL: https://issues.apache.org/jira/browse/MESOS-1162 Project: Mesos Issue Type: Improvement

<    4   5   6   7   8   9   10   11   12   13   >