date:20161101

[jira] [Commented] (MESOS-6213) Build failure on macOS Sierra: Protobuf atomics deprecated.

2016-11-01 Thread Tim Harper (JIRA)


[ 
https://issues.apache.org/jira/browse/MESOS-6213?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15627854#comment-15627854
 ] 

Tim Harper commented on MESOS-6213:
---

As a workaround, you can run {{make CPPFLAGS="-Wno-deprecated-declarations"}}

> Build failure on macOS Sierra: Protobuf atomics deprecated.
> ---
>
> Key: MESOS-6213
> URL: https://issues.apache.org/jira/browse/MESOS-6213
> Project: Mesos
>  Issue Type: Bug
>  Components: build
>Reporter: Charles Allen
>
> Building on OSX is giving the following error.
> {code}
> In file included from 
> ../3rdparty/protobuf-2.6.1/src/google/protobuf/stubs/atomicops.h:184:
> ../3rdparty/protobuf-2.6.1/src/google/protobuf/stubs/atomicops_internals_macosx.h:173:9:
>  error: 'OSAtomicCompareAndSwap64Barrier' is deprecated: first
>   deprecated in macOS 10.12 - Use std::atomic_compare_exchange_strong() 
> from  instead [-Werror,-Wdeprecated-declarations]
> if (OSAtomicCompareAndSwap64Barrier(
> ^
> /Applications/Xcode.app/Contents/Developer/Platforms/MacOSX.platform/Developer/SDKs/MacOSX10.12.sdk/usr/include/libkern/OSAtomicDeprecated.h:645:9:
>  note:
>   'OSAtomicCompareAndSwap64Barrier' has been explicitly marked deprecated 
> here
> boolOSAtomicCompareAndSwap64Barrier( int64_t __oldValue, int64_t 
> __newValue,
> ^
> {code}
> Protobuf is not listed as a component so I just set it as {{build}}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (MESOS-5662) Call parent class `SetUpTestCase` function in our test fixtures.

2016-11-01 Thread Manuwela Kanade (JIRA)


[ 
https://issues.apache.org/jira/browse/MESOS-5662?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15627820#comment-15627820
 ] 

Manuwela Kanade commented on MESOS-5662:


Hi [~anandmazumdar]: Created a pull request for this:
https://reviews.apache.org/r/53372/

> Call parent class `SetUpTestCase` function in our test fixtures.
> 
>
> Key: MESOS-5662
> URL: https://issues.apache.org/jira/browse/MESOS-5662
> Project: Mesos
>  Issue Type: Bug
>  Components: test
>Reporter: Anand Mazumdar
>Assignee: Manuwela Kanade
>  Labels: mesosphere, newbie
>
> There are some occurrences in our code where we don't invoke the parent's 
> {{SetUpTestCase}} method from a child test fixture. This can be a bit 
> problematic if someone adds the method in the parent class sometime in the 
> future. It would be good to do a sweep across the code and explicitly invoke 
> the parent class's method.
> Some examples (there are more):
> https://github.com/apache/mesos/blob/master/src/tests/mesos.cpp#L80
> https://github.com/apache/mesos/blob/master/src/tests/module_tests.cpp#L59



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Assigned] (MESOS-6489) Better support for containers that want to manage their own cgroup.

2016-11-01 Thread Anindya Sinha (JIRA)


 [ 
https://issues.apache.org/jira/browse/MESOS-6489?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anindya Sinha reassigned MESOS-6489:


Assignee: Anindya Sinha

> Better support for containers that want to manage their own cgroup.
> ---
>
> Key: MESOS-6489
> URL: https://issues.apache.org/jira/browse/MESOS-6489
> Project: Mesos
>  Issue Type: Improvement
>  Components: cgroups
>Reporter: Jie Yu
>Assignee: Anindya Sinha
>  Labels: cgroups
>
> Some containers want to manage their cgroup by sub-dividing the cgroup that 
> Mesos allocates to them into multiple sub-cgroups and put subprocess into the 
> corresponding sub-cgroups.
> For instance, someone wants to run Docker daemon in a Mesos container. Docker 
> daemon will manage the cgroup assigned to it by Mesos (with the help , for 
> example, cgroups namespace).
> Problems arise during the teardown of the container because two entities 
> might be manipulating the same cgroup simultaneously. For example, the Mesos 
> cgroups::destroy might fail if the task running inside is trying to delete 
> the same nested cgroup at the same time.
> To support that case, we should consider kill all the processes in the Mesos 
> cgroup first, making sure that no one will be creating sub-cgroups and moving 
> new processes into sub-cgroups. And then, destroy the cgroups recursively.
> And we need freezer because we want to make sure all processes are stopped 
> while we are sending kill signals to avoid TOCTTOU race problem. I think it 
> makes more sense to freezer the cgroups (and sub-cgroups) from top down 
> (rather than bottom up because typically, processes in the parent cgroup 
> manipulate sub-cgroups).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MESOS-6523) Agent cgroup assignment should precede agent initialization.

2016-11-01 Thread Anindya Sinha (JIRA)


 [ 
https://issues.apache.org/jira/browse/MESOS-6523?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anindya Sinha updated MESOS-6523:
-
Shepherd: Yan Xu

> Agent cgroup assignment should precede agent initialization.
> 
>
> Key: MESOS-6523
> URL: https://issues.apache.org/jira/browse/MESOS-6523
> Project: Mesos
>  Issue Type: Improvement
>  Components: cgroups
>Reporter: Anindya Sinha
>Assignee: Anindya Sinha
>Priority: Minor
>  Labels: cgroups, initialization
>
> If mesos-slave is launched with agent_subsystems flag, it would be preferred 
> to move the agent process to its own cgroup for all the subsystems before the 
> mesos-slave starts accepting traffic, ie. before port 5051 is opened up.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (MESOS-2952) Provide user namespaces for privileged access inside containers

2016-11-01 Thread Aaron Wood (JIRA)


[ 
https://issues.apache.org/jira/browse/MESOS-2952?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15627630#comment-15627630
 ] 

Aaron Wood commented on MESOS-2952:
---

This would be great to have. If there's anything I can help with please let me 
know!

> Provide user namespaces for privileged access inside containers
> ---
>
> Key: MESOS-2952
> URL: https://issues.apache.org/jira/browse/MESOS-2952
> Project: Mesos
>  Issue Type: Epic
>Reporter: Paul Brett
>Assignee: Vaibhav Khanduja
>
> User namespaces allow per-namespace mappings of user and group IDs. This 
> means that a process's user and group IDs inside a user namespace can be 
> different from its IDs outside of the namespace. Most notably, a process can 
> have a nonzero user ID outside a namespace while at the same time having a 
> user ID of zero inside the namespace; in other words, the process is 
> unprivileged for operations outside the user namespace but has root 
> privileges inside the namespace.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (MESOS-6518) Mesos dashboard: Symlinks not displayed in sandbox view

2016-11-01 Thread haosdent (JIRA)


[ 
https://issues.apache.org/jira/browse/MESOS-6518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15627612#comment-15627612
 ] 

haosdent commented on MESOS-6518:
-

Hi, [~Makman2] It is expected behaviour. Because you execute symbol links in 
the docker container. Those symbol links only valid in the docker container. 
Although those symbol links exist in the host, but their target files are 
invalid. So Mesos would filter them out and don't display them in WebUI.

> Mesos dashboard: Symlinks not displayed in sandbox view
> ---
>
> Key: MESOS-6518
> URL: https://issues.apache.org/jira/browse/MESOS-6518
> Project: Mesos
>  Issue Type: Improvement
>  Components: webui
>Reporter: Mischa Krüger
>Assignee: haosdent
>Priority: Minor
> Attachments: sandbox.png, tmp.png
>
>
> Soft symbolic links do not appear in the sandbox, though they exist.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Assigned] (MESOS-6530) Add support for incremental gzip decompression.

2016-11-01 Thread Benjamin Mahler (JIRA)


 [ 
https://issues.apache.org/jira/browse/MESOS-6530?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Benjamin Mahler reassigned MESOS-6530:
--

Assignee: Benjamin Mahler

> Add support for incremental gzip decompression.
> ---
>
> Key: MESOS-6530
> URL: https://issues.apache.org/jira/browse/MESOS-6530
> Project: Mesos
>  Issue Type: Improvement
>  Components: stout
>Reporter: Benjamin Mahler
>Assignee: Benjamin Mahler
>
> We currently only support compressing and decompressing based on the entire 
> input being available at once. We can add a {{gzip::Decompressor}} to support 
> incremental decompression.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (MESOS-6531) Add support for incremental gzip compression.

2016-11-01 Thread Benjamin Mahler (JIRA)

Benjamin Mahler created MESOS-6531:
--

 Summary: Add support for incremental gzip compression.
 Key: MESOS-6531
 URL: https://issues.apache.org/jira/browse/MESOS-6531
 Project: Mesos
  Issue Type: Improvement
  Components: stout
Reporter: Benjamin Mahler


We currently only support compression / decompression assuming the entire input 
is available at once. We can add a {{gzip::Compressor}} to support incremental 
compression.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (MESOS-6530) Add support for incremental gzip decompression.

2016-11-01 Thread Benjamin Mahler (JIRA)

Benjamin Mahler created MESOS-6530:
--

 Summary: Add support for incremental gzip decompression.
 Key: MESOS-6530
 URL: https://issues.apache.org/jira/browse/MESOS-6530
 Project: Mesos
  Issue Type: Improvement
  Components: stout
Reporter: Benjamin Mahler


We currently only support compressing and decompressing based on the entire 
input being available at once. We can add a {{gzip::Decompressor}} to support 
incremental decompression.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (MESOS-6522) Ability to set global maximum executor runtime for an agent

2016-11-01 Thread Benjamin Mahler (JIRA)


[ 
https://issues.apache.org/jira/browse/MESOS-6522?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15627454#comment-15627454
 ] 

Benjamin Mahler commented on MESOS-6522:


The current thinking w.r.t. to maintenance is that we ask for resources back 
from the schedulers (via inverse offers). Schedulers know the deadline and 
should co-operate with these requests. Once the maintenance window begins, the 
operator could force the draining of the agent but this will potentially cause 
SLA violations or data loss if the maintenance is destructive. Because of this, 
we'd like the operator to make this call. Also worth noting that it is to be 
expected that some attempts to do maintenance do not succeed since they would 
have led to SLA violations for the frameworks, or data loss in the case of 
destructive maintenance. In these cases the operator can follow up on the 
"stragglers" with a more suitable maintenance plan.

A maximum executor lifetime is interesting in that it forces churn in the 
cluster, but it would make it very difficult to implement certain classes of 
workloads (e.g. data storage) and I suspect it would frustrate framework 
developers since they have no control over it. In general we try to give 
control to the frameworks, since only they understand the workload.

> Ability to set global maximum executor runtime for an agent
> ---
>
> Key: MESOS-6522
> URL: https://issues.apache.org/jira/browse/MESOS-6522
> Project: Mesos
>  Issue Type: Improvement
>  Components: slave
>Reporter: Will Rouesnel
>Priority: Minor
>
> With the developing concept of agent maintenance mode, it would be nice to 
> have some blunt-force ability to reason about the behavior of agents on 
> uncooperative frameworks.
> Ideally there would be a new parameter --executor_maximum_lifetime which 
> would specify a maximum duration for which *any* executor on an agent can run 
> before being terminated.
> Even when using persistent schedulers such as Marathon, the ability to 
> enforce reasonable gurantees about when an agent's tasks definitely must end 
> can help contribute to keeping the cluster turning over and prevent nodes 
> becoming "special" or jammed up with jobs which will not end.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (MESOS-6526) `mesos-containerizer launch --environment` exposes executor env vars in `ps`.

2016-11-01 Thread Jie Yu (JIRA)


[ 
https://issues.apache.org/jira/browse/MESOS-6526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15627299#comment-15627299
 ] 

Jie Yu commented on MESOS-6526:
---

That will be great! I can help  review the code. Thanks!

> `mesos-containerizer launch --environment` exposes executor env vars in `ps`.
> -
>
> Key: MESOS-6526
> URL: https://issues.apache.org/jira/browse/MESOS-6526
> Project: Mesos
>  Issue Type: Bug
>  Components: containerization
>Affects Versions: 1.1.0
>Reporter: Yan Xu
>Priority: Critical
>
> With MESOS-6323, the helper {{mesos-containerizer launch}} takes a 
> `--environment` flag for the env vars used by the executor. This is 
> unpleasant because its a common practice that people use env vars to hide 
> configs that are sensitive and now it's visible to non-root users on the host 
> with a {{ps}} command.
> Given that we want to separate the environments of {{mesos-containerizer 
> launch}} and the executor itself, perhaps we can just package and serialize 
> the executor env vars in one env var {{MESOS_EXECUTOR_ENVIRONMENT}} and pass 
> that to {{mesos-containerizer launch}} which could then get it through a flag 
> the usual way. 
> In general Mesos should do more to protect env vars but I'll file separate 
> issues for them.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Comment Edited] (MESOS-6526) `mesos-containerizer launch --environment` exposes executor env vars in `ps`.

2016-11-01 Thread Yan Xu (JIRA)


[ 
https://issues.apache.org/jira/browse/MESOS-6526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15627289#comment-15627289
 ] 

Yan Xu edited comment on MESOS-6526 at 11/2/16 1:24 AM:


Will you guys be able to get to this one soon? If not we can pick it up.


was (Author: xujyan):
Will you guys able to get to this one soon? If not we can pick it up.

> `mesos-containerizer launch --environment` exposes executor env vars in `ps`.
> -
>
> Key: MESOS-6526
> URL: https://issues.apache.org/jira/browse/MESOS-6526
> Project: Mesos
>  Issue Type: Bug
>  Components: containerization
>Affects Versions: 1.1.0
>Reporter: Yan Xu
>Priority: Critical
>
> With MESOS-6323, the helper {{mesos-containerizer launch}} takes a 
> `--environment` flag for the env vars used by the executor. This is 
> unpleasant because its a common practice that people use env vars to hide 
> configs that are sensitive and now it's visible to non-root users on the host 
> with a {{ps}} command.
> Given that we want to separate the environments of {{mesos-containerizer 
> launch}} and the executor itself, perhaps we can just package and serialize 
> the executor env vars in one env var {{MESOS_EXECUTOR_ENVIRONMENT}} and pass 
> that to {{mesos-containerizer launch}} which could then get it through a flag 
> the usual way. 
> In general Mesos should do more to protect env vars but I'll file separate 
> issues for them.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (MESOS-6526) `mesos-containerizer launch --environment` exposes executor env vars in `ps`.

2016-11-01 Thread Yan Xu (JIRA)


[ 
https://issues.apache.org/jira/browse/MESOS-6526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15627289#comment-15627289
 ] 

Yan Xu commented on MESOS-6526:
---

Will you guys able to get to this one soon? If not we can pick it up.

> `mesos-containerizer launch --environment` exposes executor env vars in `ps`.
> -
>
> Key: MESOS-6526
> URL: https://issues.apache.org/jira/browse/MESOS-6526
> Project: Mesos
>  Issue Type: Bug
>  Components: containerization
>Affects Versions: 1.1.0
>Reporter: Yan Xu
>Priority: Critical
>
> With MESOS-6323, the helper {{mesos-containerizer launch}} takes a 
> `--environment` flag for the env vars used by the executor. This is 
> unpleasant because its a common practice that people use env vars to hide 
> configs that are sensitive and now it's visible to non-root users on the host 
> with a {{ps}} command.
> Given that we want to separate the environments of {{mesos-containerizer 
> launch}} and the executor itself, perhaps we can just package and serialize 
> the executor env vars in one env var {{MESOS_EXECUTOR_ENVIRONMENT}} and pass 
> that to {{mesos-containerizer launch}} which could then get it through a flag 
> the usual way. 
> In general Mesos should do more to protect env vars but I'll file separate 
> issues for them.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (MESOS-6526) `mesos-containerizer launch --environment` exposes executor env vars in `ps`.

2016-11-01 Thread Yan Xu (JIRA)


[ 
https://issues.apache.org/jira/browse/MESOS-6526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15627278#comment-15627278
 ] 

Yan Xu commented on MESOS-6526:
---

Cool. Yeah you are right, what would you suggest? 
{{MESOS_COMMAND_ENVIRONMENT}}? (Since it's all commands and there's a --command 
flag to go with it)

> `mesos-containerizer launch --environment` exposes executor env vars in `ps`.
> -
>
> Key: MESOS-6526
> URL: https://issues.apache.org/jira/browse/MESOS-6526
> Project: Mesos
>  Issue Type: Bug
>  Components: containerization
>Affects Versions: 1.1.0
>Reporter: Yan Xu
>Priority: Critical
>
> With MESOS-6323, the helper {{mesos-containerizer launch}} takes a 
> `--environment` flag for the env vars used by the executor. This is 
> unpleasant because its a common practice that people use env vars to hide 
> configs that are sensitive and now it's visible to non-root users on the host 
> with a {{ps}} command.
> Given that we want to separate the environments of {{mesos-containerizer 
> launch}} and the executor itself, perhaps we can just package and serialize 
> the executor env vars in one env var {{MESOS_EXECUTOR_ENVIRONMENT}} and pass 
> that to {{mesos-containerizer launch}} which could then get it through a flag 
> the usual way. 
> In general Mesos should do more to protect env vars but I'll file separate 
> issues for them.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (MESOS-6529) Support ZFS sandbox isolation

2016-11-01 Thread Will Rouesnel (JIRA)

Will Rouesnel created MESOS-6529:


 Summary: Support ZFS sandbox isolation
 Key: MESOS-6529
 URL: https://issues.apache.org/jira/browse/MESOS-6529
 Project: Mesos
  Issue Type: Improvement
  Components: containerization
Reporter: Will Rouesnel
Priority: Minor


ZFS has excellent support for filesystem quotas under Linux/BSD operating 
systems. It would be good to support to support isolating workloads on separate 
ZFS filesystems that are dynamically created when the task is executed, and 
which could have disk quotas properly and invidually enforced.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MESOS-6527) Memory leak in the libprocess request decoder.

2016-11-01 Thread Alexander Rukletsov (JIRA)


 [ 
https://issues.apache.org/jira/browse/MESOS-6527?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alexander Rukletsov updated MESOS-6527:
---
Target Version/s: 0.28.3, 1.0.2, 1.1.0  (was: 0.28.3, 1.0.2, 1.1.1)

> Memory leak in the libprocess request decoder.
> --
>
> Key: MESOS-6527
> URL: https://issues.apache.org/jira/browse/MESOS-6527
> Project: Mesos
>  Issue Type: Bug
>  Components: libprocess
>Reporter: Anand Mazumdar
>Assignee: Anand Mazumdar
>Priority: Blocker
>  Labels: mesosphere
>
> The libprocess decoder can leak a {{Request}} object in cases when a client 
> disconnects while the request is in progress. In such cases, the decoder's 
> destructor won't delete the active {{Request}} object that it had allocated 
> on the heap.
> https://github.com/apache/mesos/blob/master/3rdparty/libprocess/src/decoder.hpp#L271



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MESOS-6280) Task group executor should support command health checks.

2016-11-01 Thread Alexander Rukletsov (JIRA)


 [ 
https://issues.apache.org/jira/browse/MESOS-6280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alexander Rukletsov updated MESOS-6280:
---
Issue Type: Improvement  (was: Bug)

> Task group executor should support command health checks.
> -
>
> Key: MESOS-6280
> URL: https://issues.apache.org/jira/browse/MESOS-6280
> Project: Mesos
>  Issue Type: Improvement
>Reporter: Alexander Rukletsov
>Priority: Critical
>  Labels: mesosphere
>
> Currently, the default (aka pod) executor supports only HTTP and TCP health 
> checks. We should also support command health checks as well.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MESOS-3039) Allow executors binding IP to be different than Slave binding IP.

2016-11-01 Thread Alexander Rukletsov (JIRA)


 [ 
https://issues.apache.org/jira/browse/MESOS-3039?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alexander Rukletsov updated MESOS-3039:
---
Issue Type: Improvement  (was: Bug)
   Summary: Allow executors binding IP to be different than Slave binding 
IP.  (was: Allow executors binding IP to be different than Slave binding IP)

> Allow executors binding IP to be different than Slave binding IP.
> -
>
> Key: MESOS-3039
> URL: https://issues.apache.org/jira/browse/MESOS-3039
> Project: Mesos
>  Issue Type: Improvement
>Reporter: Kapil Arya
>Assignee: Kapil Arya
>Priority: Critical
>  Labels: mesosphere
>
> Currently, the Slave will bind either to the loopback IP (127.0.0.1) or to 
> the IP passed via the '--ip' flag. When it launches a containerized executor 
> (e.g, via Mesos Containerizer), the executor inherits the binding IP of the 
> Slave. This is due to the fact that the '--ip' flags sets the environment 
> variable `LIBPROCESS_IP` to the passed IP. The executor then inherits this 
> environment variable and is forced to bind to the Slave IP.
> If an executor is running in its own containerized environment, with a 
> separate IP than that of the Slave, currently there is no way of forcing it 
> to bind to its own IP. A potential solution is to use the executor 
> environment decorator hooks to update LIBPROCESS_IP environment variable for 
> the executor.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MESOS-3000) Failing test - NsTest.ROOT_setns

2016-11-01 Thread Alexander Rukletsov (JIRA)


 [ 
https://issues.apache.org/jira/browse/MESOS-3000?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alexander Rukletsov updated MESOS-3000:
---
Priority: Major  (was: Blocker)

> Failing test - NsTest.ROOT_setns
> 
>
> Key: MESOS-3000
> URL: https://issues.apache.org/jira/browse/MESOS-3000
> Project: Mesos
>  Issue Type: Bug
>  Components: test
>Affects Versions: 0.23.0
>Reporter: Ian Downes
>Assignee: Chris Lambert
>
> Appears to be the same issue plaguing MESOS-2199
> {noformat}
> [root@hostname build]# MESOS_VERBOSE=1 ./bin/mesos-tests.sh 
> --gtest_filter=NsTest.ROOT_setns
> ...
> [==] Running 1 test from 1 test case.
> [--] Global test environment set-up.
> [--] 1 test from NsTest
> [ RUN  ] NsTest.ROOT_setns
> ABORT: (../../../3rdparty/libprocess/src/subprocess.cpp:163): Failed to 
> os::execvpe in childMain: Permission denied*** Aborted at 1436292540 (unix 
> time) try "date -d @1436292540" if you are using GNU date ***
> PC: @ 0x7f7a1229e625 __GI_raise
> *** SIGABRT (@0xfffe0001) received by PID 1 (TID 0x7f7a19afc820) from PID 
> 1; stack trace: ***
> @ 0x7f7a13421710 (unknown)
> @ 0x7f7a1229e625 __GI_raise
> @ 0x7f7a1229fe05 __GI_abort
> @   0x860ba1 (unknown)
> @   0x860bcf (unknown)
> @ 0x7f7a1826f118 (unknown)
> @ 0x7f7a18274594 (unknown)
> @ 0x7f7a18273b88 (unknown)
> @ 0x7f7a18273098 (unknown)
> @  0x1180720 (unknown)
> @  0x117a5d7 (unknown)
> @ 0x7f7a123548fd clone
> ../../src/tests/ns_tests.cpp:121: Failure
> Failed to wait 15secs for status
> [  FAILED  ] NsTest.ROOT_setns (15004 ms)
> [--] 1 test from NsTest (15004 ms total)
> [--] Global test environment tear-down
> ../../src/tests/environment.cpp:441: Failure
> Failed
> Tests completed with child processes remaining:
> -+- 40531 /home/idownes/workspace/mesos/build/src/.libs/lt-mesos-tests 
> --gtest_filter=NsTest.ROOT_setns
>  \--- 40565 /home/idownes/workspace/mesos/build/src/.libs/lt-mesos-tests 
> --gtest_filter=NsTest.ROOT_setns
> [==] 1 test from 1 test case ran. (15034 ms total)
> [  PASSED  ] 0 tests.
> [  FAILED  ] 1 test, listed below:
> [  FAILED  ] NsTest.ROOT_setns
> {noformat}
> Relevant strace for the forked child:
> {noformat}
> ...
> getpid()= 1
> dup2(6, 0) = 0
> dup2(7, 1) = 1
> dup2(8, 2) = 2
> close(6) = 0
> close(7) = 0
> close(8) = 0
> execve("/home/idownes/workspace/mesos/build/src/setns-test-helper", 
> ["setns-test-helper", "SetnsTestHelper"], [/* 24 vars */]) = -1 EACCES 
> (Permission denied)
> write(2, "ABORT: (../../../3rdparty/libpro"..., 62) = 62
> write(2, "Failed to os::execvpe in childMa"..., 53) = 53
> ...
> {noformat}
> Binary that it's trying to exec:
> {noformat}
> [root@hostname build]# stat 
> /home/idownes/workspace/mesos/build/src/setns-test-helper
>   File: `/home/idownes/workspace/mesos/build/src/setns-test-helper'
>   Size: 7948Blocks: 16 IO Block: 4096   regular file
> Device: 801h/2049d  Inode: 22949249Links: 1
> Access: (0755/-rwxr-xr-x)  Uid: (13118/ idownes)   Gid: ( 1500/employee)
> Access: 2015-07-07 17:58:09.569861237 +
> Modify: 2015-07-07 17:58:09.573861290 +
> Change: 2015-07-07 17:58:09.573861290 +
> [root@hostname build]# 
> /home/idownes/workspace/mesos/build/src/setns-test-helper
> Usage: /home/idownes/workspace/mesos/build/src/.libs/lt-setns-test-helper 
>  [OPTIONS]
> Available subcommands:
> help
> SetnsTestHelper
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MESOS-2759) Could not create MesosContainerizer: Could not create isolator network/port_mapping: Routing library check failed: Capability ROUTE_LINK_VETH_GET_PEER_OWN_REFERENCE is no

2016-11-01 Thread Alexander Rukletsov (JIRA)


 [ 
https://issues.apache.org/jira/browse/MESOS-2759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alexander Rukletsov updated MESOS-2759:
---
Target Version/s: 0.22.1, 0.22.0  (was: 0.22.0, 0.22.1)
Priority: Major  (was: Critical)

> Could not create MesosContainerizer: Could not create isolator 
> network/port_mapping: Routing library check failed: Capability 
> ROUTE_LINK_VETH_GET_PEER_OWN_REFERENCE is not available
> -
>
> Key: MESOS-2759
> URL: https://issues.apache.org/jira/browse/MESOS-2759
> Project: Mesos
>  Issue Type: Bug
>Reporter: Anuj Gupta
>
> [root@localhost ~]# mesos-slave --master=127.0.0.1:5050 
> --log_dir=/var/log/mesos --work_dir=/var/lib/mesos 
> --isolation=cgroups/cpu,cgroups/mem,network/port_mapping 
> --resources=ephemeral_ports:[32768-57344] --ephemeral_ports_per_container=1024
> mesos-slave: /usr/lib64/libnl-3.so.200: no version information available 
> (required by /usr/local/lib/libmesos-0.22.0.so)
> mesos-slave: /usr/lib64/libnl-route-3.so.200: no version information 
> available (required by /usr/local/lib/libmesos-0.22.0.so)
> mesos-slave: /usr/lib64/libnl-3.so.200: no version information available 
> (required by /lib/libnl-idiag-3.so.200)
> I0521 14:10:16.727126 13214 logging.cpp:172] INFO level logging started!
> I0521 14:10:16.727409 13214 main.cpp:156] Build: 2015-05-21 13:21:45 by root
> I0521 14:10:16.727432 13214 main.cpp:158] Version: 0.22.0
> I0521 14:10:16.727727 13214 containerizer.cpp:110] Using isolation: 
> cgroups/cpu,cgroups/mem,network/port_mapping
> Failed to create a containerizer: Could not create MesosContainerizer: Could 
> not create isolator network/port_mapping: Routing library check failed: 
> Capability ROUTE_LINK_VETH_GET_PEER_OWN_REFERENCE is not available



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MESOS-2828) Refactor 'model' functions and use JSON::Protobuf instead

2016-11-01 Thread Alexander Rukletsov (JIRA)


 [ 
https://issues.apache.org/jira/browse/MESOS-2828?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alexander Rukletsov updated MESOS-2828:
---
  Priority: Major  (was: Critical)
Issue Type: Improvement  (was: Bug)

> Refactor 'model' functions and use JSON::Protobuf instead
> -
>
> Key: MESOS-2828
> URL: https://issues.apache.org/jira/browse/MESOS-2828
> Project: Mesos
>  Issue Type: Improvement
>Reporter: Alexander Rojas
>  Labels: mesosphere
>
> The current patter used to serialize objects returned by endpoints requieres 
> the use of a {{model}} function which receives the object to be serialized 
> and returns a JSON object. (See slave.cpp and master.cpp).
> If possible, it would be better to use JSON::Protobuf to serialize protocol 
> buffer objects, and if possible to write a jsonfy function similar to 
> stringify.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MESOS-1547) option --cgroups_enable_cfs - not working on Oracle Linux

2016-11-01 Thread Alexander Rukletsov (JIRA)


 [ 
https://issues.apache.org/jira/browse/MESOS-1547?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alexander Rukletsov updated MESOS-1547:
---
Priority: Major  (was: Critical)

> option --cgroups_enable_cfs - not working on Oracle Linux 
> --
>
> Key: MESOS-1547
> URL: https://issues.apache.org/jira/browse/MESOS-1547
> Project: Mesos
>  Issue Type: Bug
>  Components: isolation
>Affects Versions: 0.19.0
> Environment: Oracle Enterprise Linux - version 
> 2.6.39-400.215.3.el6uek.x86_64
>Reporter: Umesh Batra
>
> I am not able to make option "cgroups_enable_cfs" work with OEL? 
>  
> Couple of posts that I came across talks about, rebuilding the kernel with a 
> different configuration. 
>  
> Appreciate, if you can confirm or guide me on the same. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MESOS-1256) Enable running all existing containerizer tests with different containerizer implementations.

2016-11-01 Thread Alexander Rukletsov (JIRA)


 [ 
https://issues.apache.org/jira/browse/MESOS-1256?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alexander Rukletsov updated MESOS-1256:
---
  Priority: Major  (was: Critical)
Issue Type: Improvement  (was: Bug)

> Enable running all existing containerizer tests with different containerizer 
> implementations.
> -
>
> Key: MESOS-1256
> URL: https://issues.apache.org/jira/browse/MESOS-1256
> Project: Mesos
>  Issue Type: Improvement
>  Components: slave, test
>Reporter: Benjamin Hindman
>
> Right now the tests get run with the MesosContainerizer but they should 
> really be pluggable so that new containerizer implementations can be plugged 
> in directly. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Assigned] (MESOS-6528) Container status of a task in a pod is not correct.

2016-11-01 Thread Jie Yu (JIRA)


 [ 
https://issues.apache.org/jira/browse/MESOS-6528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jie Yu reassigned MESOS-6528:
-

Assignee: Jie Yu

> Container status of a task in a pod is not correct.
> ---
>
> Key: MESOS-6528
> URL: https://issues.apache.org/jira/browse/MESOS-6528
> Project: Mesos
>  Issue Type: Bug
>  Components: containerization, slave
>Affects Versions: 1.1.0
>Reporter: Jie Yu
>Assignee: Jie Yu
>  Labels: mesosphere
>
> Currently, the container status is for the top level executor container. This 
> is not ideal. Ideally, we should get the container status for the 
> corresponding nested container and report that with the task status update.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MESOS-6528) Container status of a task in a pod is not correct.

2016-11-01 Thread Jie Yu (JIRA)


 [ 
https://issues.apache.org/jira/browse/MESOS-6528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jie Yu updated MESOS-6528:
--
Component/s: slave
 containerization

> Container status of a task in a pod is not correct.
> ---
>
> Key: MESOS-6528
> URL: https://issues.apache.org/jira/browse/MESOS-6528
> Project: Mesos
>  Issue Type: Bug
>  Components: containerization, slave
>Affects Versions: 1.1.0
>Reporter: Jie Yu
>  Labels: mesosphere
>
> Currently, the container status is for the top level executor container. This 
> is not ideal. Ideally, we should get the container status for the 
> corresponding nested container and report that with the task status update.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MESOS-6528) Container status of a task in a pod is not correct.

2016-11-01 Thread Jie Yu (JIRA)


 [ 
https://issues.apache.org/jira/browse/MESOS-6528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jie Yu updated MESOS-6528:
--
Affects Version/s: 1.1.0

> Container status of a task in a pod is not correct.
> ---
>
> Key: MESOS-6528
> URL: https://issues.apache.org/jira/browse/MESOS-6528
> Project: Mesos
>  Issue Type: Bug
>  Components: containerization, slave
>Affects Versions: 1.1.0
>Reporter: Jie Yu
>  Labels: mesosphere
>
> Currently, the container status is for the top level executor container. This 
> is not ideal. Ideally, we should get the container status for the 
> corresponding nested container and report that with the task status update.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (MESOS-6528) Container status of a task in a pod is not correct.

2016-11-01 Thread Jie Yu (JIRA)

Jie Yu created MESOS-6528:
-

 Summary: Container status of a task in a pod is not correct.
 Key: MESOS-6528
 URL: https://issues.apache.org/jira/browse/MESOS-6528
 Project: Mesos
  Issue Type: Bug
Reporter: Jie Yu


Currently, the container status is for the top level executor container. This 
is not ideal. Ideally, we should get the container status for the corresponding 
nested container and report that with the task status update.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MESOS-6528) Container status of a task in a pod is not correct.

2016-11-01 Thread Jie Yu (JIRA)


 [ 
https://issues.apache.org/jira/browse/MESOS-6528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jie Yu updated MESOS-6528:
--
Labels: mesosphere  (was: )

> Container status of a task in a pod is not correct.
> ---
>
> Key: MESOS-6528
> URL: https://issues.apache.org/jira/browse/MESOS-6528
> Project: Mesos
>  Issue Type: Bug
>  Components: containerization, slave
>Affects Versions: 1.1.0
>Reporter: Jie Yu
>  Labels: mesosphere
>
> Currently, the container status is for the top level executor container. This 
> is not ideal. Ideally, we should get the container status for the 
> corresponding nested container and report that with the task status update.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (MESOS-6526) `mesos-containerizer launch --environment` exposes executor env vars in `ps`.

2016-11-01 Thread Jie Yu (JIRA)


[ 
https://issues.apache.org/jira/browse/MESOS-6526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15627103#comment-15627103
 ] 

Jie Yu commented on MESOS-6526:
---

Yeah, i think we can use environment variable to pass the environment 
variables. I think we should not call it `MESOS_EXECUTOR_ENVIRONMENT` because 
it can be used for launching nested containers as well. I would use a different 
name.

> `mesos-containerizer launch --environment` exposes executor env vars in `ps`.
> -
>
> Key: MESOS-6526
> URL: https://issues.apache.org/jira/browse/MESOS-6526
> Project: Mesos
>  Issue Type: Bug
>  Components: containerization
>Affects Versions: 1.1.0
>Reporter: Yan Xu
>Priority: Critical
>
> With MESOS-6323, the helper {{mesos-containerizer launch}} takes a 
> `--environment` flag for the env vars used by the executor. This is 
> unpleasant because its a common practice that people use env vars to hide 
> configs that are sensitive and now it's visible to non-root users on the host 
> with a {{ps}} command.
> Given that we want to separate the environments of {{mesos-containerizer 
> launch}} and the executor itself, perhaps we can just package and serialize 
> the executor env vars in one env var {{MESOS_EXECUTOR_ENVIRONMENT}} and pass 
> that to {{mesos-containerizer launch}} which could then get it through a flag 
> the usual way. 
> In general Mesos should do more to protect env vars but I'll file separate 
> issues for them.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MESOS-6247) Enable Framework to set weight.

2016-11-01 Thread Alexander Rukletsov (JIRA)


 [ 
https://issues.apache.org/jira/browse/MESOS-6247?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alexander Rukletsov updated MESOS-6247:
---
  Priority: Major  (was: Critical)
Issue Type: Wish  (was: Bug)
   Summary: Enable Framework to set weight.  (was: Enable Framework to set 
weight)

> Enable Framework to set weight.
> ---
>
> Key: MESOS-6247
> URL: https://issues.apache.org/jira/browse/MESOS-6247
> Project: Mesos
>  Issue Type: Wish
>  Components: allocation
> Environment: all
>Reporter: Klaus Ma
>
> We'd like to enable framework's weight when it register. So the framework can 
> share resources based on weight within the same role.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MESOS-2018) Dynamic Reservation

2016-11-01 Thread Alexander Rukletsov (JIRA)


 [ 
https://issues.apache.org/jira/browse/MESOS-2018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alexander Rukletsov updated MESOS-2018:
---
Priority: Major  (was: Critical)

> Dynamic Reservation
> ---
>
> Key: MESOS-2018
> URL: https://issues.apache.org/jira/browse/MESOS-2018
> Project: Mesos
>  Issue Type: Epic
>  Components: allocation, framework, master, slave
>Reporter: Adam B
>Assignee: Michael Park
>  Labels: mesosphere, offer, persistence, reservations, resource, 
> stateful, storage
>
> h3. Overview
> This is a feature to provide better support for running stateful services on 
> Mesos such as HDFS (Distributed Filesystem), Cassandra (Distributed 
> Database), or MySQL (Local Database).
> Current resource reservations (henceforth called "static" reservations) are 
> statically determined by the slave operator at slave start time, and 
> individual frameworks have no authority to reserve resources themselves.
> Dynamic reservations allow a framework to dynamically reserve offered 
> resources, such that those resources will only be re-offered to the same 
> framework (or other frameworks with the same role).
> This is especially useful if the framework's task stored some state on the 
> slave, and needs a guaranteed set of resources reserved so that it can 
> re-launch a task on the same slave to recover that state.
> h3. Planned Stages
> 1. MESOS-2489: Enable a framework to perform reservation operations.
> The goal of this stage is to allow the framework to send back a 
> Reserve/Unreserve operation which gets validated by the master and updates 
> the allocator resources. The allocator's {{allocate}} logic is left unchanged 
> and the resources get offered back to the framework's role as desired.
> 2. MESOS-2491: Persist the reservation state on the slave.
> The goal of this stage is to persist the reservation state on the slave. 
> Currently the master knows to store the persistent volumes in the 
> {{checkpointedResources}} data structure which gets sent to individual slaves 
> to be checkpointed. We will update the master such that dynamically reserved 
> resources are stored in the {{checkpointedResources}} as well. This stage 
> also involves subtasks such as updating the slave re(register) logic to 
> support slave re-starts.
> 3. MESOS-2600: Introduce reservation HTTP endpoints on the master.
> The goal of this stage is to enable operators to perform reservation 
> operations via HTTP endpoints on the master.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (MESOS-6527) Memory leak in the libprocess request decoder.

2016-11-01 Thread Anand Mazumdar (JIRA)

Anand Mazumdar created MESOS-6527:
-

 Summary: Memory leak in the libprocess request decoder.
 Key: MESOS-6527
 URL: https://issues.apache.org/jira/browse/MESOS-6527
 Project: Mesos
  Issue Type: Bug
  Components: libprocess
Reporter: Anand Mazumdar
Assignee: Anand Mazumdar
Priority: Blocker


The libprocess decoder can leak a {{Request}} object in cases when a client 
disconnects while the request is in progress. In such cases, the decoder's 
destructor won't delete the active {{Request}} object that it had allocated on 
the heap.

https://github.com/apache/mesos/blob/master/3rdparty/libprocess/src/decoder.hpp#L271



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (MESOS-6526) `mesos-containerizer launch --environment` exposes executor env vars in `ps`.

2016-11-01 Thread Yan Xu (JIRA)


[ 
https://issues.apache.org/jira/browse/MESOS-6526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15626946#comment-15626946
 ] 

Yan Xu commented on MESOS-6526:
---

Fetcher does it in a similar way: 
https://github.com/apache/mesos/blob/fdd9304c414cbd8cfc47f8d575315c539e55843f/src/slave/containerizer/fetcher.cpp#L826

> `mesos-containerizer launch --environment` exposes executor env vars in `ps`.
> -
>
> Key: MESOS-6526
> URL: https://issues.apache.org/jira/browse/MESOS-6526
> Project: Mesos
>  Issue Type: Bug
>  Components: containerization
>Affects Versions: 1.1.0
>Reporter: Yan Xu
>Priority: Critical
>
> With MESOS-6323, the helper {{mesos-containerizer launch}} takes a 
> `--environment` flag for the env vars used by the executor. This is 
> unpleasant because its a common practice that people use env vars to hide 
> configs that are sensitive and now it's visible to non-root users on the host 
> with a {{ps}} command.
> Given that we want to separate the environments of {{mesos-containerizer 
> launch}} and the executor itself, perhaps we can just package and serialize 
> the executor env vars in one env var {{MESOS_EXECUTOR_ENVIRONMENT}} and pass 
> that to {{mesos-containerizer launch}} which could then get it through a flag 
> the usual way. 
> In general Mesos should do more to protect env vars but I'll file separate 
> issues for them.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (MESOS-6526) `mesos-containerizer launch --environment` exposes executor env vars in `ps`.

2016-11-01 Thread Yan Xu (JIRA)


[ 
https://issues.apache.org/jira/browse/MESOS-6526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15626939#comment-15626939
 ] 

Yan Xu commented on MESOS-6526:
---

I mean we can have one {{MESOS_EXECUTOR_ENVIRONMENT}} to contain the 
{{JSON::Object}} that has all the env vars the executor is supposed to use.

If it's passed over from {{mesos-agent}} to {{mesos-containerizer}} as an 
environment variable, it's not shown in {{ps}}. Since Mesos flags can load 
flags from env vars, the code in {{launch.cpp}} doesn't have to change much: it 
can still get it as if it was passed in {{argv}}.

> `mesos-containerizer launch --environment` exposes executor env vars in `ps`.
> -
>
> Key: MESOS-6526
> URL: https://issues.apache.org/jira/browse/MESOS-6526
> Project: Mesos
>  Issue Type: Bug
>  Components: containerization
>Affects Versions: 1.1.0
>Reporter: Yan Xu
>Priority: Critical
>
> With MESOS-6323, the helper {{mesos-containerizer launch}} takes a 
> `--environment` flag for the env vars used by the executor. This is 
> unpleasant because its a common practice that people use env vars to hide 
> configs that are sensitive and now it's visible to non-root users on the host 
> with a {{ps}} command.
> Given that we want to separate the environments of {{mesos-containerizer 
> launch}} and the executor itself, perhaps we can just package and serialize 
> the executor env vars in one env var {{MESOS_EXECUTOR_ENVIRONMENT}} and pass 
> that to {{mesos-containerizer launch}} which could then get it through a flag 
> the usual way. 
> In general Mesos should do more to protect env vars but I'll file separate 
> issues for them.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (MESOS-6526) `mesos-containerizer launch --environment` exposes executor env vars in `ps`.

2016-11-01 Thread Gilbert Song (JIRA)


[ 
https://issues.apache.org/jira/browse/MESOS-6526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15626929#comment-15626929
 ] 

Gilbert Song commented on MESOS-6526:
-

Thanks [~xujyan], could you elaborate a little bit about serializing the 
executor env var into {MESOS_EXECUTOR_ENVIRONMENT}? Isn't it still exposed to 
'ps'?

> `mesos-containerizer launch --environment` exposes executor env vars in `ps`.
> -
>
> Key: MESOS-6526
> URL: https://issues.apache.org/jira/browse/MESOS-6526
> Project: Mesos
>  Issue Type: Bug
>  Components: containerization
>Affects Versions: 1.1.0
>Reporter: Yan Xu
>Priority: Critical
>
> With MESOS-6323, the helper {{mesos-containerizer launch}} takes a 
> `--environment` flag for the env vars used by the executor. This is 
> unpleasant because its a common practice that people use env vars to hide 
> configs that are sensitive and now it's visible to non-root users on the host 
> with a {{ps}} command.
> Given that we want to separate the environments of {{mesos-containerizer 
> launch}} and the executor itself, perhaps we can just package and serialize 
> the executor env vars in one env var {{MESOS_EXECUTOR_ENVIRONMENT}} and pass 
> that to {{mesos-containerizer launch}} which could then get it through a flag 
> the usual way. 
> In general Mesos should do more to protect env vars but I'll file separate 
> issues for them.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MESOS-6526) `mesos-containerizer launch --environment` exposes executor env vars in `ps`.

2016-11-01 Thread Yan Xu (JIRA)


 [ 
https://issues.apache.org/jira/browse/MESOS-6526?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yan Xu updated MESOS-6526:
--
Description: 
With MESOS-6323, the helper {{mesos-containerizer launch}} takes a 
`--environment` flag for the env vars used by the executor. This is unpleasant 
because its a common practice that people use env vars to hide configs that are 
sensitive and now it's visible to non-root users on the host with a {{ps}} 
command.

Given that we want to separate the environments of {{mesos-containerizer 
launch}} and the executor itself, perhaps we can just package and serialize the 
executor env vars in one env var {{MESOS_EXECUTOR_ENVIRONMENT}} and pass that 
to {{mesos-containerizer launch}} which could then get it through a flag the 
usual way. 

In general Mesos should do more to protect env vars but I'll file separate 
issues for them.

  was:
With MESOS-6323, the helper {{mesos-containerizer launch}} takes a 
`--environment` flag for the env vars used by the executor. This is unpleasant 
because its a common practice that people use env vars to hide configs that are 
sensitive and not it's visible to non-root users on the host with a {{ps}} 
command.

Given that we want to separate the environments of {{mesos-containerizer 
launch}} and the executor itself, perhaps we can just package and serialize the 
executor env vars in one env var {{MESOS_EXECUTOR_ENVIRONMENT}} and pass that 
to {{mesos-containerizer launch}} which could then get it through a flag the 
usual way. 

In general Mesos should do more to protect env vars but I'll file separate 
issues for them.


> `mesos-containerizer launch --environment` exposes executor env vars in `ps`.
> -
>
> Key: MESOS-6526
> URL: https://issues.apache.org/jira/browse/MESOS-6526
> Project: Mesos
>  Issue Type: Bug
>  Components: containerization
>Affects Versions: 1.1.0
>Reporter: Yan Xu
>Priority: Critical
>
> With MESOS-6323, the helper {{mesos-containerizer launch}} takes a 
> `--environment` flag for the env vars used by the executor. This is 
> unpleasant because its a common practice that people use env vars to hide 
> configs that are sensitive and now it's visible to non-root users on the host 
> with a {{ps}} command.
> Given that we want to separate the environments of {{mesos-containerizer 
> launch}} and the executor itself, perhaps we can just package and serialize 
> the executor env vars in one env var {{MESOS_EXECUTOR_ENVIRONMENT}} and pass 
> that to {{mesos-containerizer launch}} which could then get it through a flag 
> the usual way. 
> In general Mesos should do more to protect env vars but I'll file separate 
> issues for them.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MESOS-6477) Build a standalone python client for connecting to our Mock HTTP Server that implements the new Debug APIs

2016-11-01 Thread Kevin Klues (JIRA)


 [ 
https://issues.apache.org/jira/browse/MESOS-6477?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kevin Klues updated MESOS-6477:
---
Assignee: Steven Locke  (was: Kevin Klues)

> Build a standalone python client for connecting to our Mock HTTP Server that 
> implements the new Debug APIs
> --
>
> Key: MESOS-6477
> URL: https://issues.apache.org/jira/browse/MESOS-6477
> Project: Mesos
>  Issue Type: Task
>Reporter: Kevin Klues
>Assignee: Steven Locke
>  Labels: debugging, mesosphere
>
> This client prototype should have a similar CLI to what we eventually want to 
> build into the DC/OS CLI.
> {noformat}
> Streaming HTTP Client
> Usage:
>   client task exec [--tty] [--interactive]   [...]
>   client task attach [--tty] [--interactive] 
> Options:
>   --tty  Allocate a tty on the server before
>  attaching to the container.
>   --interactive  Connect the stdin of the client to
>  the stdin of the container.
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MESOS-6476) Build a Mock HTTP Server that implements the new Debugging API calls

2016-11-01 Thread Kevin Klues (JIRA)


 [ 
https://issues.apache.org/jira/browse/MESOS-6476?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kevin Klues updated MESOS-6476:
---
Assignee: Steven Locke  (was: Kevin Klues)

> Build a Mock HTTP Server that implements the new Debugging API calls
> 
>
> Key: MESOS-6476
> URL: https://issues.apache.org/jira/browse/MESOS-6476
> Project: Mesos
>  Issue Type: Task
>Reporter: Kevin Klues
>Assignee: Steven Locke
>  Labels: debugging, mesosphere
>
> The mock server should simply launch a process to run whatever command is 
> passed to it, rather than attempt to launch an actual nested container in 
> mesos. However, it should do everything necessary to deal with attaching a 
> {{pty}}  / redirecting {{stdin/stdout/stderr}} properly.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (MESOS-6525) Add API protos for managing debug containers

2016-11-01 Thread Vinod Kone (JIRA)

Vinod Kone created MESOS-6525:
-

 Summary: Add API protos for managing debug containers
 Key: MESOS-6525
 URL: https://issues.apache.org/jira/browse/MESOS-6525
 Project: Mesos
  Issue Type: Task
Reporter: Vinod Kone
Assignee: Vinod Kone


The API calls that we should add are:

LAUNCH_NESTED_CONTAINER_SESSION
ATTACH_CONTAINER_INPUT
ATTACH_CONTAINER_OUTPUT



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MESOS-6471) Build support for LAUNCH_NESTED_CONTAINER_SESSION call into the Agent API in Mesos

2016-11-01 Thread Kevin Klues (JIRA)


 [ 
https://issues.apache.org/jira/browse/MESOS-6471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kevin Klues updated MESOS-6471:
---
Sprint:   (was: Mesosphere Sprint 46)

> Build support for LAUNCH_NESTED_CONTAINER_SESSION call into the Agent API in 
> Mesos
> --
>
> Key: MESOS-6471
> URL: https://issues.apache.org/jira/browse/MESOS-6471
> Project: Mesos
>  Issue Type: Task
>Reporter: Kevin Klues
>Assignee: Kevin Klues
>  Labels: debugging, mesosphere
>
> This HTTP API call will launch a nested container whose life-cycle is tied to 
> the lifetime of the connection used to make this call. Once the agent 
> receives the request, it will hold onto it until the container runs to 
> completion or there is an error. Upon success, a 200 response will be 
> initiated with an “infinite” chunked response (but no data will ever be sent 
> over this connection). On error, an appropriate 400 error will be returned. 
> If the connection is ever broken by the client, the container will be 
> destroyed. 
> This will likely involve modifications to some existing existing protobuf 
> messages. It will also involve changes to {{launch.cpp}} to satisfy the new 
> namespace requirements.
> We will create subtickets as we figure out the details for this.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MESOS-6472) Build support for ATTACH_CONTAINER_INPUT into the Agent API in Mesos

2016-11-01 Thread Kevin Klues (JIRA)


 [ 
https://issues.apache.org/jira/browse/MESOS-6472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kevin Klues updated MESOS-6472:
---
Sprint:   (was: Mesosphere Sprint 46)

> Build support for ATTACH_CONTAINER_INPUT into the Agent API in Mesos
> 
>
> Key: MESOS-6472
> URL: https://issues.apache.org/jira/browse/MESOS-6472
> Project: Mesos
>  Issue Type: Task
>Reporter: Kevin Klues
>Assignee: Kevin Klues
>  Labels: debugging, mesosphere
>
> Coupled with the ATTACH_CONTAINER_OUTPUT call, this call will attach a remote 
> client to the the input/output of the entrypoint of a container. All 
> input/output data will be packed into I/O messages and interleaved with 
> control messages sent between a client and the agent. A single chunked 
> request will be used to stream messages to the agent over the input stream, 
> and a single chunked response will be used to stream messages to the  client 
> over the output stream.
> This call will integrate with the Mesos internal support for "attaching" to 
> an already running container through the new logger interfaces.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MESOS-6476) Build a Mock HTTP Server that implements the new Debugging API calls

2016-11-01 Thread Kevin Klues (JIRA)


 [ 
https://issues.apache.org/jira/browse/MESOS-6476?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kevin Klues updated MESOS-6476:
---
Sprint: Mesosphere Sprint 46

> Build a Mock HTTP Server that implements the new Debugging API calls
> 
>
> Key: MESOS-6476
> URL: https://issues.apache.org/jira/browse/MESOS-6476
> Project: Mesos
>  Issue Type: Task
>Reporter: Kevin Klues
>Assignee: Kevin Klues
>  Labels: debugging, mesosphere
>
> The mock server should simply launch a process to run whatever command is 
> passed to it, rather than attempt to launch an actual nested container in 
> mesos. However, it should do everything necessary to deal with attaching a 
> {{pty}}  / redirecting {{stdin/stdout/stderr}} properly.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MESOS-6471) Build support for LAUNCH_NESTED_CONTAINER_SESSION call into the Agent API in Mesos

2016-11-01 Thread Kevin Klues (JIRA)


 [ 
https://issues.apache.org/jira/browse/MESOS-6471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kevin Klues updated MESOS-6471:
---
Sprint: Mesosphere Sprint 46

> Build support for LAUNCH_NESTED_CONTAINER_SESSION call into the Agent API in 
> Mesos
> --
>
> Key: MESOS-6471
> URL: https://issues.apache.org/jira/browse/MESOS-6471
> Project: Mesos
>  Issue Type: Task
>Reporter: Kevin Klues
>Assignee: Kevin Klues
>  Labels: debugging, mesosphere
>
> This HTTP API call will launch a nested container whose life-cycle is tied to 
> the lifetime of the connection used to make this call. Once the agent 
> receives the request, it will hold onto it until the container runs to 
> completion or there is an error. Upon success, a 200 response will be 
> initiated with an “infinite” chunked response (but no data will ever be sent 
> over this connection). On error, an appropriate 400 error will be returned. 
> If the connection is ever broken by the client, the container will be 
> destroyed. 
> This will likely involve modifications to some existing existing protobuf 
> messages. It will also involve changes to {{launch.cpp}} to satisfy the new 
> namespace requirements.
> We will create subtickets as we figure out the details for this.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MESOS-6472) Build support for ATTACH_CONTAINER_INPUT into the Agent API in Mesos

2016-11-01 Thread Kevin Klues (JIRA)


 [ 
https://issues.apache.org/jira/browse/MESOS-6472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kevin Klues updated MESOS-6472:
---
Sprint: Mesosphere Sprint 46

> Build support for ATTACH_CONTAINER_INPUT into the Agent API in Mesos
> 
>
> Key: MESOS-6472
> URL: https://issues.apache.org/jira/browse/MESOS-6472
> Project: Mesos
>  Issue Type: Task
>Reporter: Kevin Klues
>Assignee: Kevin Klues
>  Labels: debugging, mesosphere
>
> Coupled with the ATTACH_CONTAINER_OUTPUT call, this call will attach a remote 
> client to the the input/output of the entrypoint of a container. All 
> input/output data will be packed into I/O messages and interleaved with 
> control messages sent between a client and the agent. A single chunked 
> request will be used to stream messages to the agent over the input stream, 
> and a single chunked response will be used to stream messages to the  client 
> over the output stream.
> This call will integrate with the Mesos internal support for "attaching" to 
> an already running container through the new logger interfaces.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MESOS-6465) Add a task_id -> container_id mapping in state.json

2016-11-01 Thread Kevin Klues (JIRA)


 [ 
https://issues.apache.org/jira/browse/MESOS-6465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kevin Klues updated MESOS-6465:
---
Sprint: Mesosphere Sprint 46

> Add a task_id -> container_id mapping in state.json
> ---
>
> Key: MESOS-6465
> URL: https://issues.apache.org/jira/browse/MESOS-6465
> Project: Mesos
>  Issue Type: Task
>Reporter: Kevin Klues
>Assignee: Jie Yu
>  Labels: debugging, mesosphere
>
> Currently, there is no way to get the {{container-id}} of a task from hitting 
> the mesos master alone.  You must first hit the master to get the {{task_id 
> -> agent_id}} and {{task_id -> executor_id}} mappings, then hit the 
> corresponding agent with {{agent_id}} to get the {{executor_id -> 
> container_id}} mapping.
> It would simplify things alot if the {{container_id}} information was 
> immediately available in the {{/tasks}} and {{/state}} endpoints of the 
> master itself.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MESOS-6466) Add support for streaming HTTP requests in Mesos

2016-11-01 Thread Kevin Klues (JIRA)


 [ 
https://issues.apache.org/jira/browse/MESOS-6466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kevin Klues updated MESOS-6466:
---
Sprint: Mesosphere Sprint 46

> Add support for streaming HTTP requests in Mesos
> 
>
> Key: MESOS-6466
> URL: https://issues.apache.org/jira/browse/MESOS-6466
> Project: Mesos
>  Issue Type: Task
>Reporter: Kevin Klues
>Assignee: Anand Mazumdar
>  Labels: debugging, mesosphere
>
> We already have support for streaming HTTP responses in Mesos. We now also 
> need to add support for streaming HTTP requests.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MESOS-5900) Consider supporting Unix domain socket connections in libprocess

2016-11-01 Thread Kevin Klues (JIRA)


 [ 
https://issues.apache.org/jira/browse/MESOS-5900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kevin Klues updated MESOS-5900:
---
Assignee: Benjamin Hindman

> Consider supporting Unix domain socket connections in libprocess
> 
>
> Key: MESOS-5900
> URL: https://issues.apache.org/jira/browse/MESOS-5900
> Project: Mesos
>  Issue Type: Improvement
>  Components: libprocess
>Reporter: Neil Conway
>Assignee: Benjamin Hindman
>  Labels: mesosphere
>
> We should consider allowing two programs on the same host using libprocess to 
> communicate via Unix domain sockets rather than TCP. This has a few 
> advantages:
> * Security: remote hosts cannot connect to the Unix socket. Domain sockets 
> also offer additional support for 
> [authentication|https://docs.fedoraproject.org/en-US/Fedora_Security_Team/1/html/Defensive_Coding/sect-Defensive_Coding-Authentication-UNIX_Domain.html].
> * Performance: domain sockets are marginally faster than localhost TCP.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MESOS-6474) Add fine-grained ACLs for authorization with the new debugging APIs

2016-11-01 Thread Kevin Klues (JIRA)


 [ 
https://issues.apache.org/jira/browse/MESOS-6474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kevin Klues updated MESOS-6474:
---
Assignee: Alexander Rojas  (was: Kevin Klues)

> Add fine-grained ACLs for authorization with the new debugging APIs
> ---
>
> Key: MESOS-6474
> URL: https://issues.apache.org/jira/browse/MESOS-6474
> Project: Mesos
>  Issue Type: Task
>Reporter: Kevin Klues
>Assignee: Alexander Rojas
>  Labels: debugging, mesosphere, security
>
> We already have ACLs in place for determining if a user has access to see a 
> certain task when querying {{state.json}} on the master/agent, or 
> browse/download a task's sandbox. However, we will have to add similar ACLs 
> for making sure they have the correct permissions to execute the new 
> Debugging APs on behalf of those tasks.  



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MESOS-6465) Add a task_id -> container_id mapping in state.json

2016-11-01 Thread Kevin Klues (JIRA)


 [ 
https://issues.apache.org/jira/browse/MESOS-6465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kevin Klues updated MESOS-6465:
---
Assignee: Jie Yu  (was: Kevin Klues)

> Add a task_id -> container_id mapping in state.json
> ---
>
> Key: MESOS-6465
> URL: https://issues.apache.org/jira/browse/MESOS-6465
> Project: Mesos
>  Issue Type: Task
>Reporter: Kevin Klues
>Assignee: Jie Yu
>  Labels: debugging, mesosphere
>
> Currently, there is no way to get the {{container-id}} of a task from hitting 
> the mesos master alone.  You must first hit the master to get the {{task_id 
> -> agent_id}} and {{task_id -> executor_id}} mappings, then hit the 
> corresponding agent with {{agent_id}} to get the {{executor_id -> 
> container_id}} mapping.
> It would simplify things alot if the {{container_id}} information was 
> immediately available in the {{/tasks}} and {{/state}} endpoints of the 
> master itself.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (MESOS-6524) Command health checks ignore some fields from CommandInfo.

2016-11-01 Thread Alexander Rukletsov (JIRA)

Alexander Rukletsov created MESOS-6524:
--

 Summary: Command health checks ignore some fields from CommandInfo.
 Key: MESOS-6524
 URL: https://issues.apache.org/jira/browse/MESOS-6524
 Project: Mesos
  Issue Type: Improvement
Reporter: Alexander Rukletsov
Priority: Minor


Command health are specified via {{CommandInfo}}, but ignore several fields: 
{{user}} and {{uris}}. We should either respect these fields or advice users 
not to use them, i.e. fail validation and/or print warnings.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MESOS-6229) Default to using hardened compilation flags

2016-11-01 Thread Aaron Wood (JIRA)


 [ 
https://issues.apache.org/jira/browse/MESOS-6229?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aaron Wood updated MESOS-6229:
--
Description: 
Provide a default set of hardened compilation flags to help protect against 
overflows and other attacks. Apply to libprocess and stout as well. Current set 
of flags that were discussed on slack to implement:

-Wformat-security
-Wstack-protector
-fstack-protector-strong (-fstack-protector-all might be overkill, it could be 
more effective to use this. Requires gcc >= 4.9 which should be reasonable. 
Detect compiler support and use what we can but prefer -fstack-protector-strong)
-pie
-fPIE 
-fPIC
-D_FORTIFY_SOURCE=2
-Wl,-z,relro,-z,now (currently not a part of the patch, this should be another 
JIRA)
-fno-omit-frame-pointer

https://reviews.apache.org/r/52645/
https://reviews.apache.org/r/52695/
https://reviews.apache.org/r/52696/


  was:
Provide a default set of hardened compilation flags to help protect against 
overflows and other attacks. Apply to libprocess and stout as well. Current set 
of flags that were discussed on slack to implement:

-Wformat-security
-Wstack-protector
-fstack-protector-strong (-fstack-protector-all might be overkill, it could be 
more effective to use this. Requires gcc >= 4.9 which should be reasonable)
-pie
-fPIE 
-fPIC
-D_FORTIFY_SOURCE=2
-Wl,-z,relro,-z,now (currently not a part of the patch)
-fno-omit-frame-pointer

https://reviews.apache.org/r/52645/
https://reviews.apache.org/r/52695/
https://reviews.apache.org/r/52696/



> Default to using hardened compilation flags
> ---
>
> Key: MESOS-6229
> URL: https://issues.apache.org/jira/browse/MESOS-6229
> Project: Mesos
>  Issue Type: Improvement
>Reporter: Aaron Wood
>Assignee: Aaron Wood
>Priority: Minor
>  Labels: c++, clang, gcc, security
>
> Provide a default set of hardened compilation flags to help protect against 
> overflows and other attacks. Apply to libprocess and stout as well. Current 
> set of flags that were discussed on slack to implement:
> -Wformat-security
> -Wstack-protector
> -fstack-protector-strong (-fstack-protector-all might be overkill, it could 
> be more effective to use this. Requires gcc >= 4.9 which should be 
> reasonable. Detect compiler support and use what we can but prefer 
> -fstack-protector-strong)
> -pie
> -fPIE 
> -fPIC
> -D_FORTIFY_SOURCE=2
> -Wl,-z,relro,-z,now (currently not a part of the patch, this should be 
> another JIRA)
> -fno-omit-frame-pointer
> https://reviews.apache.org/r/52645/
> https://reviews.apache.org/r/52695/
> https://reviews.apache.org/r/52696/



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (MESOS-6523) Agent cgroup assignment should precede agent initialization.

2016-11-01 Thread Anindya Sinha (JIRA)

Anindya Sinha created MESOS-6523:


 Summary: Agent cgroup assignment should precede agent 
initialization.
 Key: MESOS-6523
 URL: https://issues.apache.org/jira/browse/MESOS-6523
 Project: Mesos
  Issue Type: Improvement
  Components: cgroups
Reporter: Anindya Sinha
Assignee: Anindya Sinha
Priority: Minor


If mesos-slave is launched with agent_subsystems flag, it would be preferred to 
move the agent process to its own cgroup for all the subsystems before the 
mesos-slave starts accepting traffic, ie. before port 5051 is opened up.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Comment Edited] (MESOS-6518) Mesos dashboard: Symlinks not displayed in sandbox view

2016-11-01 Thread JIRA


[ 
https://issues.apache.org/jira/browse/MESOS-6518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15626356#comment-15626356
 ] 

Mischa Krüger edited comment on MESOS-6518 at 11/1/16 7:04 PM:
---

I'm using DC/OS to deploy a service. In my case I'm using a docker image to 
spawn a flink instance:
https://github.com/Makman2/dcos-flink-docker/blob/d754767152e44ad81bff890ced9fba97d5ffd99c/Dockerfile

Also I'm using a custom command, in this case it contains the same `ln` command 
like in the Dockerfile:
{code}
echo "";
ls $MESOS_SANDBOX;
echo "";
ln -s $(pwd)/mesos_sandbox/* $MESOS_SANDBOX/;
echo "ln -s $(pwd)/mesos_sandbox/* $MESOS_SANDBOX/";
ls $MESOS_SANDBOX;
sleep 60; 
{code}

which write following into the stdout file:

{code}
--container="mesos-90e534a9-de1b-4a32-ac01-d7d9d54337f2-S0.4515a673-3baa-4af3-83b7-b0755fe4ad94"
 --docker="docker" --docker_socket="/var/run/docker.sock" --help="false" 
--initialize_driver_logging="true" 
--launcher_dir="/opt/mesosphere/packages/mesos--55e36b7783f1549d26b7567b11090ff93b89487a/libexec/mesos"
 --logbufsecs="0" --logging_level="INFO" 
--mapped_directory="/mnt/mesos/sandbox" --quiet="false" 
--sandbox_directory="/var/lib/mesos/slave/slaves/90e534a9-de1b-4a32-ac01-d7d9d54337f2-S0/frameworks/90e534a9-de1b-4a32-ac01-d7d9d54337f2-/executors/container-test.24183a01-a064-11e6-a8e8-9278bb0b38e1/runs/4515a673-3baa-4af3-83b7-b0755fe4ad94"
 --stop_timeout="20secs"
--container="mesos-90e534a9-de1b-4a32-ac01-d7d9d54337f2-S0.4515a673-3baa-4af3-83b7-b0755fe4ad94"
 --docker="docker" --docker_socket="/var/run/docker.sock" --help="false" 
--initialize_driver_logging="true" 
--launcher_dir="/opt/mesosphere/packages/mesos--55e36b7783f1549d26b7567b11090ff93b89487a/libexec/mesos"
 --logbufsecs="0" --logging_level="INFO" 
--mapped_directory="/mnt/mesos/sandbox" --quiet="false" 
--sandbox_directory="/var/lib/mesos/slave/slaves/90e534a9-de1b-4a32-ac01-d7d9d54337f2-S0/frameworks/90e534a9-de1b-4a32-ac01-d7d9d54337f2-/executors/container-test.24183a01-a064-11e6-a8e8-9278bb0b38e1/runs/4515a673-3baa-4af3-83b7-b0755fe4ad94"
 --stop_timeout="20secs"
Registered docker executor on 10.0.2.185
Starting task container-test.24183a01-a064-11e6-a8e8-9278bb0b38e1

stderr
stderr.logrotate.conf
stdout
stdout.logrotate.conf

ln -s /flink-1.2-SNAPSHOT/mesos_sandbox/* /mnt/mesos/sandbox/
flink-conf.yaml
flink-python_2.10-1.2-SNAPSHOT.jar
flink.jar
log4j-1.2.17.jar
log4j.properties
slf4j-log4j12-1.7.7.jar
stderr
stderr.logrotate.conf
stdout
stdout.logrotate.conf
{code}
But only following files are displayed in the sandbox view:
{code}
stderr
stderr.logrotate.conf
stdout
stdout.logrotate.conf
{code}
so like no linking ever happened. But as you see in the stdout file, the links 
are there! :/

PS:
When I try to spawn a service with following command:
{code}
touch x.txt;
ln -s x.txt my-symlink;
{code}
I can indeed see the symlink appearing in the sandbox view. Very strange...


was (Author: makman2):
I'm using DC/OS to deploy a service. In my case I'm using a docker image to 
spawn a flink instance:
https://github.com/Makman2/dcos-flink-docker/blob/d754767152e44ad81bff890ced9fba97d5ffd99c/Dockerfile

Also I'm using a custom command, in this case it contains the same `ln` command 
like in the Dockerfile:
{code}
echo "";
ls $MESOS_SANDBOX;
echo "";
ln -s $(pwd)/mesos_sandbox/* $MESOS_SANDBOX/;
echo "ln -s $(pwd)/mesos_sandbox/* $MESOS_SANDBOX/";
ls $MESOS_SANDBOX;
sleep 60; 
{code}

which write following into the stdout file:

{code}
--container="mesos-90e534a9-de1b-4a32-ac01-d7d9d54337f2-S0.4515a673-3baa-4af3-83b7-b0755fe4ad94"
 --docker="docker" --docker_socket="/var/run/docker.sock" --help="false" 
--initialize_driver_logging="true" 
--launcher_dir="/opt/mesosphere/packages/mesos--55e36b7783f1549d26b7567b11090ff93b89487a/libexec/mesos"
 --logbufsecs="0" --logging_level="INFO" 
--mapped_directory="/mnt/mesos/sandbox" --quiet="false" 
--sandbox_directory="/var/lib/mesos/slave/slaves/90e534a9-de1b-4a32-ac01-d7d9d54337f2-S0/frameworks/90e534a9-de1b-4a32-ac01-d7d9d54337f2-/executors/container-test.24183a01-a064-11e6-a8e8-9278bb0b38e1/runs/4515a673-3baa-4af3-83b7-b0755fe4ad94"
 --stop_timeout="20secs"
--container="mesos-90e534a9-de1b-4a32-ac01-d7d9d54337f2-S0.4515a673-3baa-4af3-83b7-b0755fe4ad94"
 --docker="docker" --docker_socket="/var/run/docker.sock" --help="false" 
--initialize_driver_logging="true" 
--launcher_dir="/opt/mesosphere/packages/mesos--55e36b7783f1549d26b7567b11090ff93b89487a/libexec/mesos"
 --logbufsecs="0" --logging_level="INFO" 
--mapped_directory="/mnt/mesos/sandbox" --quiet="false"

[jira] [Updated] (MESOS-6518) Mesos dashboard: Symlinks not displayed in sandbox view

2016-11-01 Thread JIRA


 [ 
https://issues.apache.org/jira/browse/MESOS-6518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mischa Krüger updated MESOS-6518:
-
Attachment: tmp.png

> Mesos dashboard: Symlinks not displayed in sandbox view
> ---
>
> Key: MESOS-6518
> URL: https://issues.apache.org/jira/browse/MESOS-6518
> Project: Mesos
>  Issue Type: Improvement
>  Components: webui
>Reporter: Mischa Krüger
>Assignee: haosdent
>Priority: Minor
> Attachments: sandbox.png, tmp.png
>
>
> Soft symbolic links do not appear in the sandbox, though they exist.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (MESOS-6518) Mesos dashboard: Symlinks not displayed in sandbox view

2016-11-01 Thread JIRA


[ 
https://issues.apache.org/jira/browse/MESOS-6518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15626356#comment-15626356
 ] 

Mischa Krüger commented on MESOS-6518:
--

I'm using DC/OS to deploy a service. In my case I'm using a docker image to 
spawn a flink instance:
https://github.com/Makman2/dcos-flink-docker/blob/d754767152e44ad81bff890ced9fba97d5ffd99c/Dockerfile

Also I'm using a custom command, in this case it contains the same `ln` command 
like in the Dockerfile:
{code}
echo "";
ls $MESOS_SANDBOX;
echo "";
ln -s $(pwd)/mesos_sandbox/* $MESOS_SANDBOX/;
echo "ln -s $(pwd)/mesos_sandbox/* $MESOS_SANDBOX/";
ls $MESOS_SANDBOX;
sleep 60; 
{code}

which write following into the stdout file:

{code}
--container="mesos-90e534a9-de1b-4a32-ac01-d7d9d54337f2-S0.4515a673-3baa-4af3-83b7-b0755fe4ad94"
 --docker="docker" --docker_socket="/var/run/docker.sock" --help="false" 
--initialize_driver_logging="true" 
--launcher_dir="/opt/mesosphere/packages/mesos--55e36b7783f1549d26b7567b11090ff93b89487a/libexec/mesos"
 --logbufsecs="0" --logging_level="INFO" 
--mapped_directory="/mnt/mesos/sandbox" --quiet="false" 
--sandbox_directory="/var/lib/mesos/slave/slaves/90e534a9-de1b-4a32-ac01-d7d9d54337f2-S0/frameworks/90e534a9-de1b-4a32-ac01-d7d9d54337f2-/executors/container-test.24183a01-a064-11e6-a8e8-9278bb0b38e1/runs/4515a673-3baa-4af3-83b7-b0755fe4ad94"
 --stop_timeout="20secs"
--container="mesos-90e534a9-de1b-4a32-ac01-d7d9d54337f2-S0.4515a673-3baa-4af3-83b7-b0755fe4ad94"
 --docker="docker" --docker_socket="/var/run/docker.sock" --help="false" 
--initialize_driver_logging="true" 
--launcher_dir="/opt/mesosphere/packages/mesos--55e36b7783f1549d26b7567b11090ff93b89487a/libexec/mesos"
 --logbufsecs="0" --logging_level="INFO" 
--mapped_directory="/mnt/mesos/sandbox" --quiet="false" 
--sandbox_directory="/var/lib/mesos/slave/slaves/90e534a9-de1b-4a32-ac01-d7d9d54337f2-S0/frameworks/90e534a9-de1b-4a32-ac01-d7d9d54337f2-/executors/container-test.24183a01-a064-11e6-a8e8-9278bb0b38e1/runs/4515a673-3baa-4af3-83b7-b0755fe4ad94"
 --stop_timeout="20secs"
Registered docker executor on 10.0.2.185

Starting task container-test.24183a01-a064-11e6-a8e8-9278bb0b38e1
stderr
stderr.logrotate.conf
stdout
stdout.logrotate.conf

ln -s /flink-1.2-SNAPSHOT/mesos_sandbox/* /mnt/mesos/sandbox/
flink-conf.yaml
flink-python_2.10-1.2-SNAPSHOT.jar
flink.jar
log4j-1.2.17.jar
log4j.properties
slf4j-log4j12-1.7.7.jar
stderr
stderr.logrotate.conf
stdout
stdout.logrotate.conf
{code}
But only following files are displayed in the sandbox view:
{code}
stderr
stderr.logrotate.conf
stdout
stdout.logrotate.conf
{code}
so like no linking ever happened. But as you see in the stdout file, the links 
are there! :/

PS:
When I try to spawn a service with following command:
{code}
touch x.txt;
ln -s x.txt my-symlink;
{code}
I can indeed see the symlink appearing in the sandbox view. Very strange...

> Mesos dashboard: Symlinks not displayed in sandbox view
> ---
>
> Key: MESOS-6518
> URL: https://issues.apache.org/jira/browse/MESOS-6518
> Project: Mesos
>  Issue Type: Improvement
>  Components: webui
>Reporter: Mischa Krüger
>Assignee: haosdent
>Priority: Minor
> Attachments: sandbox.png
>
>
> Soft symbolic links do not appear in the sandbox, though they exist.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MESOS-5597) Document Mesos "health check" feature.

2016-11-01 Thread Alexander Rukletsov (JIRA)


 [ 
https://issues.apache.org/jira/browse/MESOS-5597?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alexander Rukletsov updated MESOS-5597:
---
Shepherd: Till Toenshoff
Assignee: Alexander Rukletsov
  Sprint: Mesosphere Sprint 46
Target Version/s: 1.2.0
  Issue Type: Documentation  (was: Bug)

> Document Mesos "health check" feature.
> --
>
> Key: MESOS-5597
> URL: https://issues.apache.org/jira/browse/MESOS-5597
> Project: Mesos
>  Issue Type: Documentation
>  Components: documentation
>Reporter: Neil Conway
>Assignee: Alexander Rukletsov
>  Labels: documentation, health-check, mesosphere
>
> We don't talk about this feature at all.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MESOS-5597) Document Mesos "health check" feature.

2016-11-01 Thread Alexander Rukletsov (JIRA)


 [ 
https://issues.apache.org/jira/browse/MESOS-5597?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alexander Rukletsov updated MESOS-5597:
---
Summary: Document Mesos "health check" feature.  (was: Document Mesos 
"health check" feature)

> Document Mesos "health check" feature.
> --
>
> Key: MESOS-5597
> URL: https://issues.apache.org/jira/browse/MESOS-5597
> Project: Mesos
>  Issue Type: Bug
>  Components: documentation
>Reporter: Neil Conway
>  Labels: documentation, health-check, mesosphere
>
> We don't talk about this feature at all.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Comment Edited] (MESOS-6518) Mesos dashboard: Symlinks not displayed in sandbox view

2016-11-01 Thread haosdent (JIRA)


[ 
https://issues.apache.org/jira/browse/MESOS-6518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15626219#comment-15626219
 ] 

haosdent edited comment on MESOS-6518 at 11/1/16 6:15 PM:
--

!sandbox.png|width=200!


was (Author: haosd...@gmail.com):
!sandbox.png!

> Mesos dashboard: Symlinks not displayed in sandbox view
> ---
>
> Key: MESOS-6518
> URL: https://issues.apache.org/jira/browse/MESOS-6518
> Project: Mesos
>  Issue Type: Improvement
>  Components: webui
>Reporter: Mischa Krüger
>Assignee: haosdent
>Priority: Minor
> Attachments: sandbox.png
>
>
> Soft symbolic links do not appear in the sandbox, though they exist.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Comment Edited] (MESOS-6518) Mesos dashboard: Symlinks not displayed in sandbox view

2016-11-01 Thread haosdent (JIRA)


[ 
https://issues.apache.org/jira/browse/MESOS-6518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15626219#comment-15626219
 ] 

haosdent edited comment on MESOS-6518 at 11/1/16 6:15 PM:
--

!sandbox.png|width=600px!


was (Author: haosd...@gmail.com):
!sandbox.png|width=200!

> Mesos dashboard: Symlinks not displayed in sandbox view
> ---
>
> Key: MESOS-6518
> URL: https://issues.apache.org/jira/browse/MESOS-6518
> Project: Mesos
>  Issue Type: Improvement
>  Components: webui
>Reporter: Mischa Krüger
>Assignee: haosdent
>Priority: Minor
> Attachments: sandbox.png
>
>
> Soft symbolic links do not appear in the sandbox, though they exist.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MESOS-6457) Tasks shouldn't transition from TASK_KILLING to TASK_RUNNING.

2016-11-01 Thread Alexander Rukletsov (JIRA)


 [ 
https://issues.apache.org/jira/browse/MESOS-6457?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alexander Rukletsov updated MESOS-6457:
---
Affects Version/s: 0.28.2
   1.0.1

> Tasks shouldn't transition from TASK_KILLING to TASK_RUNNING.
> -
>
> Key: MESOS-6457
> URL: https://issues.apache.org/jira/browse/MESOS-6457
> Project: Mesos
>  Issue Type: Bug
>Affects Versions: 0.28.2, 1.0.1
>Reporter: Gastón Kleiman
>Assignee: Gastón Kleiman
>Priority: Blocker
>
> A task can currently transition from {{TASK_KILLING}} to {{TASK_RUNNING}}, if 
> for example it starts/stops passing a health check once it got into the 
> {{TASK_KILLING}} state.
> I think that this behaviour is counterintuitive. It also makes the life of 
> framework/tools developers harder, since they have to keep track of the 
> complete task status history in order to know if a task is being killed.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MESOS-6518) Mesos dashboard: Symlinks not displayed in sandbox view

2016-11-01 Thread haosdent (JIRA)


 [ 
https://issues.apache.org/jira/browse/MESOS-6518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

haosdent updated MESOS-6518:

Attachment: sandbox.png

!sandbox.png!

> Mesos dashboard: Symlinks not displayed in sandbox view
> ---
>
> Key: MESOS-6518
> URL: https://issues.apache.org/jira/browse/MESOS-6518
> Project: Mesos
>  Issue Type: Improvement
>  Components: webui
>Reporter: Mischa Krüger
>Assignee: haosdent
>Priority: Minor
> Attachments: sandbox.png
>
>
> Soft symbolic links do not appear in the sandbox, though they exist.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MESOS-6457) Tasks shouldn't transition from TASK_KILLING to TASK_RUNNING.

2016-11-01 Thread Alexander Rukletsov (JIRA)


 [ 
https://issues.apache.org/jira/browse/MESOS-6457?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alexander Rukletsov updated MESOS-6457:
---
Target Version/s: 0.28.3, 1.0.2, 1.1.0  (was: 1.0.2, 1.1.0)

> Tasks shouldn't transition from TASK_KILLING to TASK_RUNNING.
> -
>
> Key: MESOS-6457
> URL: https://issues.apache.org/jira/browse/MESOS-6457
> Project: Mesos
>  Issue Type: Bug
>Affects Versions: 0.28.2, 1.0.1
>Reporter: Gastón Kleiman
>Assignee: Gastón Kleiman
>Priority: Blocker
>
> A task can currently transition from {{TASK_KILLING}} to {{TASK_RUNNING}}, if 
> for example it starts/stops passing a health check once it got into the 
> {{TASK_KILLING}} state.
> I think that this behaviour is counterintuitive. It also makes the life of 
> framework/tools developers harder, since they have to keep track of the 
> complete task status history in order to know if a task is being killed.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (MESOS-6518) Mesos dashboard: Symlinks not displayed in sandbox view

2016-11-01 Thread haosdent (JIRA)


[ 
https://issues.apache.org/jira/browse/MESOS-6518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15626210#comment-15626210
 ] 

haosdent commented on MESOS-6518:
-

Hi, [~Makman2] Symbol links looks work for me. Would you elaborate how you 
create symbol links in the sandbox?

The sandbox contains symbol links which works in in my environment
{code}
bash-3.2$ ls -l
total 24
drwxr-xr-x  3 haosdent  staff  102 Nov  2 02:09 a
lrwxr-xr-x  1 haosdent  staff1 Nov  2 02:09 b -> a
-rw-r--r--  1 haosdent  staff  171 Nov  2 02:08 stderr
-rw-r--r--  1 haosdent  staff  245 Nov  2 02:08 stdout
{code}

> Mesos dashboard: Symlinks not displayed in sandbox view
> ---
>
> Key: MESOS-6518
> URL: https://issues.apache.org/jira/browse/MESOS-6518
> Project: Mesos
>  Issue Type: Improvement
>  Components: webui
>Reporter: Mischa Krüger
>Assignee: haosdent
>Priority: Minor
>
> Soft symbolic links do not appear in the sandbox, though they exist.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MESOS-6457) Tasks shouldn't transition from TASK_KILLING to TASK_RUNNING.

2016-11-01 Thread Joris Van Remoortere (JIRA)


 [ 
https://issues.apache.org/jira/browse/MESOS-6457?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Joris Van Remoortere updated MESOS-6457:

Target Version/s: 1.0.2, 1.1.0  (was: 1.1.0)

> Tasks shouldn't transition from TASK_KILLING to TASK_RUNNING.
> -
>
> Key: MESOS-6457
> URL: https://issues.apache.org/jira/browse/MESOS-6457
> Project: Mesos
>  Issue Type: Bug
>Reporter: Gastón Kleiman
>Assignee: Gastón Kleiman
>Priority: Blocker
>
> A task can currently transition from {{TASK_KILLING}} to {{TASK_RUNNING}}, if 
> for example it starts/stops passing a health check once it got into the 
> {{TASK_KILLING}} state.
> I think that this behaviour is counterintuitive. It also makes the life of 
> framework/tools developers harder, since they have to keep track of the 
> complete task status history in order to know if a task is being killed.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MESOS-6457) Tasks shouldn't transition from TASK_KILLING to TASK_RUNNING.

2016-11-01 Thread Joris Van Remoortere (JIRA)


 [ 
https://issues.apache.org/jira/browse/MESOS-6457?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Joris Van Remoortere updated MESOS-6457:

Target Version/s: 1.1.0  (was: 1.2.0)

> Tasks shouldn't transition from TASK_KILLING to TASK_RUNNING.
> -
>
> Key: MESOS-6457
> URL: https://issues.apache.org/jira/browse/MESOS-6457
> Project: Mesos
>  Issue Type: Bug
>Reporter: Gastón Kleiman
>Assignee: Gastón Kleiman
>Priority: Blocker
>
> A task can currently transition from {{TASK_KILLING}} to {{TASK_RUNNING}}, if 
> for example it starts/stops passing a health check once it got into the 
> {{TASK_KILLING}} state.
> I think that this behaviour is counterintuitive. It also makes the life of 
> framework/tools developers harder, since they have to keep track of the 
> complete task status history in order to know if a task is being killed.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (MESOS-6457) Tasks shouldn't transition from TASK_KILLING to TASK_RUNNING.

2016-11-01 Thread Neil Conway (JIRA)


[ 
https://issues.apache.org/jira/browse/MESOS-6457?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15626095#comment-15626095
 ] 

Neil Conway commented on MESOS-6457:


{{TASK_KILLING}} to {{TASK_LOST}} is definitely possible.

{{TASK_KILLING}} => {{TASK_LOST}} => {{TASK_RUNNING}} should be possible if 
{{TASK_KILLING}} => {{TASK_RUNNING}} is possible (I haven't looked into whether 
allowing the latter transition is a good idea or not...)

> Tasks shouldn't transition from TASK_KILLING to TASK_RUNNING.
> -
>
> Key: MESOS-6457
> URL: https://issues.apache.org/jira/browse/MESOS-6457
> Project: Mesos
>  Issue Type: Bug
>Reporter: Gastón Kleiman
>Assignee: Gastón Kleiman
>Priority: Blocker
>
> A task can currently transition from {{TASK_KILLING}} to {{TASK_RUNNING}}, if 
> for example it starts/stops passing a health check once it got into the 
> {{TASK_KILLING}} state.
> I think that this behaviour is counterintuitive. It also makes the life of 
> framework/tools developers harder, since they have to keep track of the 
> complete task status history in order to know if a task is being killed.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (MESOS-6457) Tasks shouldn't transition from TASK_KILLING to TASK_RUNNING.

2016-11-01 Thread JIRA


[ 
https://issues.apache.org/jira/browse/MESOS-6457?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15626057#comment-15626057
 ] 

Gastón Kleiman commented on MESOS-6457:
---

[~zhitao] yes, this behaviour only affect the default and the command executors.

I believe that it is still possible for a task in {{TASK_KILLING}} to 
transition to {{TASK_LOST}}. Given that {{TASK_LOST}} is not a terminal state, 
I don't know if it would then be possible for the task to transition to 
{{TASK_RUNNING}} instead of to {{TASK_KILLING}}/{{TASK_KILLED}}. Maybe [~neilc] 
knows.

> Tasks shouldn't transition from TASK_KILLING to TASK_RUNNING.
> -
>
> Key: MESOS-6457
> URL: https://issues.apache.org/jira/browse/MESOS-6457
> Project: Mesos
>  Issue Type: Bug
>Reporter: Gastón Kleiman
>Assignee: Gastón Kleiman
>Priority: Blocker
>
> A task can currently transition from {{TASK_KILLING}} to {{TASK_RUNNING}}, if 
> for example it starts/stops passing a health check once it got into the 
> {{TASK_KILLING}} state.
> I think that this behaviour is counterintuitive. It also makes the life of 
> framework/tools developers harder, since they have to keep track of the 
> complete task status history in order to know if a task is being killed.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (MESOS-6522) Ability to set global maximum executor runtime for an agent

2016-11-01 Thread Will Rouesnel (JIRA)

Will Rouesnel created MESOS-6522:


 Summary: Ability to set global maximum executor runtime for an 
agent
 Key: MESOS-6522
 URL: https://issues.apache.org/jira/browse/MESOS-6522
 Project: Mesos
  Issue Type: Improvement
  Components: slave
Reporter: Will Rouesnel
Priority: Minor


With the developing concept of agent maintenance mode, it would be nice to have 
some blunt-force ability to reason about the behavior of agents on 
uncooperative frameworks.

Ideally there would be a new parameter --executor_maximum_lifetime which would 
specify a maximum duration for which *any* executor on an agent can run before 
being terminated.

Even when using persistent schedulers such as Marathon, the ability to enforce 
reasonable gurantees about when an agent's tasks definitely must end can help 
contribute to keeping the cluster turning over and prevent nodes becoming 
"special" or jammed up with jobs which will not end.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (MESOS-6494) Clean up the flags parsing in the executors.

2016-11-01 Thread Alexander Rukletsov (JIRA)


[ 
https://issues.apache.org/jira/browse/MESOS-6494?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15625362#comment-15625362
 ] 

Alexander Rukletsov commented on MESOS-6494:


{noformat}
commit e802760c932e766775eba481d890d43232693815
Author: Gastón Kleiman gas...@mesosphere.com
Date:   Mon Oct 31 16:07:42 2016 -0700

Removed outdated TODO in stout::flags.

Review: https://reviews.apache.org/r/52878/
{noformat}
{noformat}
commit 0d116e099b6568dd0b99a973df57f3581b81a8a4
Author: Gastón Kleiman gas...@mesosphere.com
Date:   Mon Oct 31 16:09:04 2016 -0700

Added parsers for 'SlaveID', 'ExecutorID' and 'FrameworkID'.

Review: https://reviews.apache.org/r/53197/
{noformat}

> Clean up the flags parsing in the executors.
> 
>
> Key: MESOS-6494
> URL: https://issues.apache.org/jira/browse/MESOS-6494
> Project: Mesos
>  Issue Type: Improvement
>Reporter: Gastón Kleiman
>Assignee: Gastón Kleiman
>  Labels: mesosphere
>
> The current executors and the executor libraries use a mix of `stout::flags` 
> and `os::getenv` to parse flags, leading to a lot of unnecessary and 
> sometimes duplicated code.
> This should be cleaned up, using only {{stout::flags}} to parse flags.
> Environment variables should be used for the flags that are common to ALL the 
> executors (listed in the Executor HTTP API doc).
> Command line parameters should be used for flags that apply only to 
> individual executors.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MESOS-6459) PosixRLimitsIsolatorTest.TaskExceedingLimit fails on OS X

2016-11-01 Thread Benjamin Bannier (JIRA)


 [ 
https://issues.apache.org/jira/browse/MESOS-6459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Benjamin Bannier updated MESOS-6459:

Sprint: Mesosphere Sprint 46

> PosixRLimitsIsolatorTest.TaskExceedingLimit fails on OS X
> -
>
> Key: MESOS-6459
> URL: https://issues.apache.org/jira/browse/MESOS-6459
> Project: Mesos
>  Issue Type: Bug
>Reporter: Gastón Kleiman
>Assignee: Benjamin Bannier
>  Labels: mesosphere
> Fix For: 1.2.0
>
>
> This test consistently fails on OS X:
> {code}
> 31-7e9c-4248-acfd-21634150a657@172.28.128.1:64864 on agent 
> 52cc4957-1a39-4d66-ace6-5622fac3b85e-S0
> ../../src/tests/containerizer/posix_rlimits_isolator_tests.cpp:120: Failure
> Value of: statusFailed->state()
>   Actual: TASK_FINISHED
> Expected: TASK_FAILED
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (MESOS-6516) Parallel test running does not respect GTEST_FILTER

2016-11-01 Thread Benjamin Bannier (JIRA)


 [ 
https://issues.apache.org/jira/browse/MESOS-6516?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Benjamin Bannier updated MESOS-6516:

Sprint: Mesosphere Sprint 46

> Parallel test running does not respect GTEST_FILTER
> ---
>
> Key: MESOS-6516
> URL: https://issues.apache.org/jira/browse/MESOS-6516
> Project: Mesos
>  Issue Type: Bug
>  Components: tests
>Reporter: Neil Conway
>Assignee: Benjamin Bannier
>  Labels: mesosphere
>
> Normally, you can use {{GTEST_FILTER}} to control which tests will be run by 
> {{make check}}. However, this doesn't currently work if Mesos is configured 
> with {{--enable-parallel-test-execution}}.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

71 matches

Mail list logo