How can I enable forwarding in a mesos container

2020-12-31 Thread Marc Roos
If I run it as root I am getting sysctl: error setting key 'net.ipv4.ip_forward': Read-only file system Looking for something like this: docker run --sysctl net.ipv4.ip_forward=1 someimage

Problems getting the new mvp csi working

2020-12-21 Thread Marc Roos
I have been looking forward to the update of mesos offering this mvp csi, mainly to finally be able to use ceph. But unfortunately I am still not able to get a simple rbd image attached to a container. I am able to use the csilvm by adding the volume like this[2], but the cephcsi keeps

new mvp how to use block csi lvm

2020-12-18 Thread Marc Roos
If I use the csilvm driver, I am able to use a published volume with this task[1] with xfs fs. However when I try and ad the volume as block device the task[3] fails to deploy the log[4] however seems ok and does a mount and unmount. Should I change more than just accessType and fsType? The

csi specification handles volumeid(?)

2020-12-17 Thread Marc Roos
I hope nobody minds putting this here since the csi mailing list is on invitation only, and Jie Yu seems to be everywhere ;) I am having some problems understanding how the cephcsi plugin works. I am using the csc[1] from the rexray people which I believe to have quite some history with the

csi volume

2020-12-15 Thread Marc Roos
When I create this task[2], I am getting the error message: "There was a problem with your configuration general: App creation unsuccessful. Check your app settings and try again." I have the csi managed plugin running and can mount with the command line csc[1]. What should I look at to

virtual memory task on mesos ~40GB while on docker ~3G

2020-12-13 Thread Marc Roos
When I launch a task via docker with: docker run --memory 2G --memory-swappiness 0 -v /dev/log:/dev/log -it --network host marathon:1.11.24 This task seems to use ~400MB resident, and 2,8GB virtual. When I launch the same task on mesos I am getting This task seems to use ~900MB resident,

marathon plugin interface for mesos 1.11

2020-12-12 Thread Marc Roos
I wanted to test csi in mesos 1.11, but noticed that I am using a plugin for marathon that does not load any more. It has this in the "build.sbt" file: libraryDependencies += "mesosphere.marathon" %% "plugin-interface" % "1.6.325" % "provided" I assume this needs to be changed to a newer

RE: hostname in task

2020-12-08 Thread Marc Roos
Hi James, Sorry to bring this up again. But I have marathon constantly logging because of using the host name from the host networking, instead of using its own task name marathon.xxx.xxx.xxx.mesos as a host name for which there is a certificate. Do you have an example of setting the

Marathon shutdown after master connection lost

2020-11-29 Thread Marc Roos
I hope nobody minds that I am crossposting this to mesos, since there is not much activity on the marathon mailing list. Is there an option to keep marathon running, having it try to reconnect to the mesos-master after it lost connection? Currently I am running sort of a test cluster with

Package mesos-1.11.0-2.0.1.el7.x86_64.rpm is not signed

2020-11-28 Thread Marc Roos
Package mesos-1.11.0-2.0.1.el7.x86_64.rpm is not signed

RE: Suddenly all tasks gone, framework at completed, cannot start framework

2020-11-11 Thread Marc Roos
Is there a way to change this failover_timeout after the framework is running? Via the api or so? I see it is changed when the leader is changing. -Original Message- To: user Cc: cf.natali; janiszt Subject: RE: Suddenly all tasks gone, framework at completed, cannot start framework

Changing logging timestamp

2020-09-20 Thread Marc Roos
I have default remote syslog setup on centos all applications and server log the same timestamp (zone), except mesos and marathon tasks. I assume UTC times are send from them. How can I set this back to the 'hosts default'?

RE: Paid help for getting csi ceph working

2020-09-14 Thread Marc Roos
is not available yet. We are currently working on an alternative way to get external storage into Mesos instead of using SLRP. Please watch the progress here: https://issues.apache.org/jira/browse/MESOS-10141 . MVP support will land in the upcoming release of Mesos. On Mon, Sep 7, 2020 at 2:08 PM Marc Roos

Paid help for getting csi ceph working

2020-09-07 Thread Marc Roos
Is there anyone interested in giving some paid help to get me up and running with an slrp with ceph? I assume this serp is not available still not?

slrp csi ceph rbd static volume possible with mesos 1.10

2020-08-26 Thread Marc Roos
I would like to map a ceph rbd device to a task as a static/pre-existing volume. Is there any guide on how to do this?

recommended ceph csi plugin?

2020-08-26 Thread Marc Roos
Is there a recommended csi ceph plugin? I found this one[1] but I think it is only usable for kubernetes since it requires secrets to be stored in some kubernetes property. [1] https://github.com/ceph/ceph-csi

RE: marathon (or java) container contantly oom

2020-08-26 Thread Marc Roos
, Tomas On Wed, 26 Aug 2020 at 11:11, Marc Roos wrote: Recently I enabled the cpu and memory isolators on my test cluster. And since then I have been seeing the marathon containers (when becoming leader) increase memory usage from ~400MB until they oom at 850MB

marathon (or java) container contantly oom

2020-08-26 Thread Marc Roos
Recently I enabled the cpu and memory isolators on my test cluster. And since then I have been seeing the marathon containers (when becoming leader) increase memory usage from ~400MB until they oom at 850MB (checking vi systemd-cgtop). Now I am testing with these settings from this page[1]

FW: How to configure a pre-existing slrp volume/disk

2020-08-25 Thread Marc Roos
On this dcos manual[1] there is only listed how to use a profile from an slrp. Any one know how to change this to a pre-existing (lvm) volume? (mesos example is also welcome ;) cat > app2.json <> data/foo && cat data/foo && sleep 5000", "container": { "docker": { "image":

RE: Suddenly all tasks gone, framework at completed, cannot start framework

2020-08-25 Thread Marc Roos
Message- From: Tomek Janiszewski [mailto:jani...@gmail.com] Sent: dinsdag 25 augustus 2020 16:55 To: user Subject: Re: Suddenly all tasks gone, framework at completed, cannot start framework See: https://stackoverflow.com/a/42544023/1387612 wt., 25 sie 2020 o 15:07 Marc Roos napisał

RE: Suddenly all tasks gone, framework at completed, cannot start framework -

2020-08-25 Thread Marc Roos
I assume this was because something happened with zookeeper, and it restarted loading the wrong configuration file without the quorum=1. Because I was testing with different zookeeper rpms (mesos rpm conf is not standard location) Question: Is this by design that all tasks are terminated

Suddenly all tasks gone, framework at completed, cannot start framework

2020-08-25 Thread Marc Roos
Today all my tasks are down and framework marathon is at completed. Any idea how this can happen? ed.cpp:520] Successfully authenticated with master master@192.168.10.151:5050 I0825 13:03:27.961248 108 sched.cpp:1188] Got error 'Framework has been removed'

mesosphere csilvm doesn't have socket after startup

2020-08-22 Thread Marc Roos
I am not sure if the csi standard requires that CSI_ENDPOINT should be set, in any case. - csilvm does not work without specifically setting -unix-addr-env CSI_ENDPOINT. So either document this or make it default. - I could not test with csc on csilvm master branch only this update csi

csi drivers endpoint errors, maybe update slrp page with info on how to configure these csi endpoints

2020-08-22 Thread Marc Roos
E0815 18:43:38.774154 1073 service_manager.cpp:751] Container daemon for 'org-apache-mesos-rp-local-storage-local_blockdevices--nfs-csi-k8s-io-cs i_blockdevices--CONTROLLER_SERVICE-NODE_SERVICE' failed: Timed out waiting for endpoint 'unix:///tmp/mesos-csi-iJusqh/endpoint.sock' E0815

cni chaining, bandwitdh plugin

2020-08-21 Thread Marc Roos
You should reconsider supporting cni 0.3.0, so people can use this cni bandwidth plugin[1] [1] https://github.com/containernetworking/plugins/tree/master/plugins/meta/bandwidth

RE: How to test if slrp is working correctly

2020-08-20 Thread Marc Roos
No one able to help? ;) -Original Message- To: user Subject: How to test if slrp is working correctly I am testing with slrp and csi drivers after watching this video[1] of mesosphere. I would like to know how I can verify that the slrp is properly configured and working. 1. Can

How to test if slrp is working correctly

2020-08-17 Thread Marc Roos
I am testing with slrp and csi drivers after watching this video[1] of mesosphere. I would like to know how I can verify that the slrp is properly configured and working. 1. Can I use an api endpoint to query controller/list-volumes or do a controller/create-volume. I found this csc tool

RE: mesos csi test plugin slrp 401 Unauthorized

2020-08-15 Thread Marc Roos
If I disable authenticate_http_readwrite authenticate_http_readonly. My test slrp's are indeed loaded and I see tasks running. Launching these tasks as described on the manual page via curl[1] also fails. The task is not running, but I see that curl commands json is being put in the

A more practical guide on how to configure and get csi working (preferably with ceph)

2020-08-14 Thread Marc Roos
Can anyone point me to a more practical guide on how to configure and get csi working (preferably with ceph)

test-csi-plugin should work?

2020-08-14 Thread Marc Roos
This option has no effect when using the HTTP scheduler/executor APIs. By default, this option is true. (default: true) --log_dir=VALUE Location to put log files. By default, nothing is

mesos csi test plugin slrp 401 Unauthorized

2020-08-14 Thread Marc Roos
I am testing with this Failed to recover resource provider with type 'org.apache.mesos.rp.local.storage' and name 'test_slrp': Failed to get containers: Unexpected response '401 Unauthorized' (401 Unauthorized.) Is this because I am having authentication on, and the standalone container

RE: crv port lookups on tasks with cni networks

2020-08-12 Thread Marc Roos
"container": { "type": "MESOS", "portMappings": [ {"hostPort": 0, "name": "https", "protocol": "tcp", "networkNames": ["cni-apps"]}, {"hostPort": 0, "name": "metrics", "protocol": "tcp", "networkNames": ["cni-apps"]} ], -Original Message- To: user

Are cni networks launch in sequence of how to they are configured?

2020-08-11 Thread Marc Roos
I was wondering if cni networks were always applied in sequence. I am seeing the same order of eth0, eth1 etc. But is it true that the 2nd network is only created when the first was successfully completed/attached?

RE: Assymetric route possible between agent and container?

2020-08-08 Thread Marc Roos
ll provide a unix domain socket that the executor can use to connect to the agent, instead of relying on a TCP connection. Regards, Qian Zhang On Sat, Aug 8, 2020 at 4:59 PM Marc Roos wrote: "it is imperative that the Agent IP is reachable from the container IP

Assymetric route possible between agent and container?

2020-08-08 Thread Marc Roos
"it is imperative that the Agent IP is reachable from the container IP and vice versa." Anyone know/tested if this can be an asymmetric route when you are having multiple networks? [1] http://mesos.apache.org/documentation/latest/cni/

Error "networkNames must be a single item list when hostPort is specified and more than 1 container network is defined"

2020-08-05 Thread Marc Roos
I am getting this error message. When launching a task with portMappings and two container networks. What is the proper way to configure this? general: networkNames must be a single item list when hostPort is specified and more than 1 container network is defined "networks": [ {

crv port lookups on tasks with cni networks

2020-08-05 Thread Marc Roos
How can I assign random ports to a cni network and read these back from srv. What is the equivalent of portDefinitions at network/host for network/container? -Original Message- To: user Subject: health check not working after changing host network If I change a task from:

health check not working after changing host network

2020-08-04 Thread Marc Roos
If I change a task from: "networks": [ { "mode": "host" } ], "portDefinitions": [ {"port": 0, "name": "health", "protocol": "tcp"}, {"port": 0, "name": "metrics", "protocol": "tcp"} ], To: "networks": [ { "mode": "container", "name": "cni-storage" } ],

Ceph support planned?

2020-07-31 Thread Marc Roos
Is native ceph support in the planning? Libvirt supports ceph with librbd[1]. What is currently the best practice to use ceph storage? [1] https://docs.ceph.com/docs/master/rbd/libvirt/

mesos master default drop acl

2020-07-30 Thread Marc Roos
Currently I am running on a testing environment with some default acl I found[1]. I have configured mesos-credentials, and afaik everything agents/marathon framework is authenticating. So I thought about converting the acl to default drop/deny. However I see there are quite a few options.

Fyi: nginx+ srv lookups also now available in basic nginx

2020-07-29 Thread Marc Roos
For anyone who is interested. I was surprised that nginx was not offering srv lookups in their free version. I found a module that offered this, however it did not work because of syntax differences in srv lookups on mesos. I adapted this module to force sending a whole srv domain, and tests

RE: random string in task groups hostname

2020-07-28 Thread Marc Roos
2nd I have the impression that SRV records are not correctly implemented should ._tcp not be at the front (after the service) instead of in the middle? Or do I have something incorrect in my mesos configuration that makes these groups act as part of the task name? [@]$ dig +short

random string in task groups hostname

2020-07-28 Thread Marc Roos
I cannot remember seeing this before. I wondered if this is common and is it should to be. I am having in srv lookups random string in the group. Why is test appended with '-grxx9-s0'? [@~]$ dig +short @192.168.10.14 server.temp.test.marathon.mesos 192.168.10.151 [@~]$ dig +short

RE: getting correct metrics port from SRV records.

2020-07-27 Thread Marc Roos
Oops ;) [@test2 image-synapse]$ dig +short @192.168.10.14 _metrics._synapse.dev._tcp.marathon.mesos SRV 0 1 31032 synapse.dev-nppzf-s0.marathon.mesos. -Original Message- To: user Subject: getting correct metrics port from SRV records. Is there a way to identify the correct port

getting correct metrics port from SRV records.

2020-07-27 Thread Marc Roos
Is there a way to identify the correct port via dns? I have created a task with two ports[1]. But a dns srv query does not show anything different than the port number. How can I identify the correct port? Mesos-master tasks endpoint[3] shows the port names, is there a way to get these from

RE: fyi: mesos-dns is not registering all ip addresses

2020-07-27 Thread Marc Roos
n": true, "listener": "0.0.0.0", "SOAMname": "ns1.mesos", "SOARname": "root.ns1.mesos", "SOARefresh": 5, "SOARetry": 600, "SOAExpire": 86400, "SOAMinttl": 5, "IPSources&q

fyi: mesos-dns is not registering all ip addresses

2020-07-27 Thread Marc Roos
I am not sure if mesos-dns is discontinued. But for the ones still using it, in some cases it does not register all tasks ip addresses. The default[2] works, but if you have this setup[1] it will only register one ip address 192.168.122.140 and not the 2nd. I filed issue a year ago or so[3]

Mesos syslog logging to error level instead of info?

2020-07-24 Thread Marc Roos
I have my test cluster of mesos on again, and I am having mesos-master logs end up in the wrong logs. I think mesos is not logging to correct levels/facility. (using mesos-1.10.0-2.0.1.el7.x86_64) Eg. I have got this on level error: Jul 24 12:25:16 m01 mesos-master[28922]: I0724

RE: Advice on alternative for marathon framework

2020-07-15 Thread Marc Roos
Thanks Tomek, have it running, giving it a try. -Original Message- To: user Subject: Re: Advice on alternative for marathon framework You can try https://github.com/HubSpot/Singularity Aurora was moved to attic <https://attic.apache.org/> śr., 15 lip 2020 o 16:29 Mar

Advice on alternative for marathon framework

2020-07-15 Thread Marc Roos
I am having problems[1] getting marathon to run since march (can only run 1.7) and the only emails I receive from d2iq is how to rate their support. I wonder if this Marathon is still best to be used with mesos. I have aurora running, but it looks to have less options. What I like about the

problems running marathon >=1.8 on mesos

2020-06-07 Thread Marc Roos
I am cross posting this to mesos-users, hoping someone has came accros this issue, and can help me resolve this issue I have. There are several JIRA issues open with similar symptoms. All of a sudden I having problems with marathon ui getting stuck at 'loading' and end points like

RE: No offers are being made -- how to debug Mesos?

2020-06-06 Thread Marc Roos
You already put these on debug? [@ ]# cat /etc/mesos-master/logging_level WARNING [@ ]# cat /etc/mesos-slave/logging_level WARNING -Original Message- From: Benjamin Wulff [mailto:benjamin.wulff...@ieee.org] Sent: zaterdag 6 juni 2020 13:36 To: user@mesos.apache.org Subject: No

RE: Subject: [VOTE] Release Apache Mesos 1.10.0 (rc1)

2020-05-28 Thread Marc Roos
* ability for an executor to communicate with an agent via Unix domain socket instead of TCP I think this will solve my problem with tasks running on different ip which I was doing via a local route. But somehow this route was not being used in mesos. While ping to the netspace were ok.

RE: Found no roles suitable for revive repetition.

2020-03-18 Thread Marc Roos
contact the marathon mailing list or slack channel. Also, if there is a question here or some more context, please include that so they know what you need help with. On Wed, Mar 18, 2020 at 9:46 AM Marc Roos wrote: Marathon is stuck on 'loading applications

registered in SERVER runtime does not implement any provider interfaces applicable in the SERVER runtime.

2020-03-18 Thread Marc Roos
I am having these, has been reported already on Jira long time ago. How to fix these? der mesosphere.marathon.api.v2.PodsResource will be ignored. (org.glassfish.jersey.internal.inject.Providers:MarathonHttpService STARTING) Mar 18 16:38:21 m01 marathon: [2020-03-18 16:38:21,785] WARN A

Found no roles suitable for revive repetition.

2020-03-18 Thread Marc Roos
Marathon is stuck on 'loading applications' Mar 18 14:43:48 m01 marathon: [2020-03-18 14:43:48,646] INFO Received fake heartbeat task-status update (mesosphere.marathon.core.heartbeat.MesosHeartbeatMonitor:Thread-30) Mar 18 14:43:53 m01 marathon: [2020-03-18 14:43:53,321] INFO Found no

Failed to send 'mesos.internal.FrameworkErrorMessage'

2020-02-22 Thread Marc Roos
I am getting these on a test setup, where marathon and mesos-master running on the same node and iptables is not even configured. W0222 23:03:48.829741 1112 process.cpp:1917] Failed to send 'mesos.internal.FrameworkErrorMessage' to '192.168.10.151:35530', connect: Failed connect, connection

RE: cni iptables best practice

2020-02-05 Thread Marc Roos
it adheres to the project's quality standards). On Wed, Feb 5, 2020 at 8:57 AM Marc Roos wrote: Is this possible? I would like to start using mesos in production to be honest. -Original Message- Sent: 30 January 2020 18

RE: cni iptables best practice

2020-02-05 Thread Marc Roos
practice I do not think we plan to do it in short term. Regards, Qian Zhang On Tue, Jan 28, 2020 at 1:54 AM Marc Roos wrote: Hi Qian, Any idea on when this cni 0.3 is going to be implemented? I saw the issue priority is Major, can't remember

Kill task, but not restarted

2020-02-02 Thread Marc Roos
Because the instance was not showing in the marathon gui. I have killed a task with kill -KILL, assuming it would restart, yet it did not. I think it has to do with these messages. Why do I have these even, when I can just ping them? W0202 14:46:51.215673 359364 process.cpp:1480] Failed to

RE: cni iptables best practice

2020-01-30 Thread Marc Roos
. -Original Message- Cc: user Subject: Re: cni iptables best practice I do not think we plan to do it in short term. Regards, Qian Zhang On Tue, Jan 28, 2020 at 1:54 AM Marc Roos wrote: Hi Qian, Any idea on when this cni 0.3 is going to be implemented? I

RE: cni iptables best practice

2020-01-27 Thread Marc Roos
think there is a ticket to trace it: https://issues.apache.org/jira/browse/MESOS-7079. Regards, Qian Zhang On Sat, Dec 14, 2019 at 7:08 AM Marc Roos wrote: Is anyone applying iptables rules in their cni networking, and how? I wrote a iptables chaining plugin but cannot

RE: cni iptables best practice

2019-12-14 Thread Marc Roos
think there is a ticket to trace it: https://issues.apache.org/jira/browse/MESOS-7079. Regards, Qian Zhang On Sat, Dec 14, 2019 at 7:08 AM Marc Roos wrote: Is anyone applying iptables rules in their cni networking, and how? I wrote a iptables chaining plugin but cannot

cni iptables best practice

2019-12-13 Thread Marc Roos
Is anyone applying iptables rules in their cni networking, and how? I wrote a iptables chaining plugin but cannot use it because this cni 0.3.0 is still not supported in mesos 1.9. I wondered how this done currently

Iptables

2019-11-03 Thread Marc Roos
How to set iptable rules inside a container? I am getting these Fatal: can't open lock file /run/xtables.lock: Permission denied Fatal: can't open lock file /run/xtables.lock: Permission denied Fatal: can't open lock file /run/xtables.lock: Permission denied Fatal: can't open lock file

Degraded performance container vs vm (-80% !!!)

2019-10-22 Thread Marc Roos
I have still with mesos 1.9 degraded performance, any help to sort this out would be nice. Makes me also wonder if others have bothered testing this or not? I am testing still with mesos and thus have mostly a default setup. Previously when I opened this thread, there was questioning about

changing /etc/hosts in container

2019-10-21 Thread Marc Roos
What are my options to adding a host entry to /etc/hosts in container running not as root?

RE: Is chained cni networks supported in mesos 1.7

2019-10-21 Thread Marc Roos
the CNI chain support? -Gilbert On Wed, Jul 24, 2019 at 12:52 PM Marc Roos wrote: Hmm, I guess I should not get my hopes up this will be there soon? [0] https://issues.apache.org/jira/browse/MESOS-7178 -Original Message

RE: Mesos task example json

2019-10-14 Thread Marc Roos
Thanks Benjamin, I will bookmark these. -Original Message- To: user@mesos.apache.org Subject: Re: Mesos task example json Hi Marc, > You also know how/where to put the capabilities? I am struggling with > that. Have a look at the protobufs which define this API: * `TaskInfo`

Don't understand how to use mesos capabilities

2019-10-14 Thread Marc Roos
Don't understand how to use mesos capabilities as described here[0] 1. removed caps from ping with setcap 'cap_net_raw=-p' /usr/bin/ping 2. linux/capabilities in the isolators, 3. mesos-slave running as root, 4. did not set effective_capabilities nor bounding_capabilities 5. Running kernel

RE: Mesos task example json

2019-10-14 Thread Marc Roos
type": "SCALAR", "scalar": {"value": 128}} ], "command": { "value": "sleep 10" }, "container": { "type": "MESOS", "mesos": { "image": { "type": "DOCKER", "docker": { "name": "busybox" } } } } } Regards, Qian Zhang On Sat, Oct 12, 2019 at 6:26 AM Marc Roos wrote: Is there some example json available with all options for use with 'mesos-execute --task='

Mesos task example json

2019-10-11 Thread Marc Roos
Is there some example json available with all options for use with 'mesos-execute --task='

mesos 1.9 should have mesos task not?

2019-10-11 Thread Marc Roos
[@~]# mesos help Usage: mesos [OPTIONS] Available commands: help dns daemon.sh agent start-cluster.sh master start-agents.sh start-masters.sh start-slaves.sh stop-agents.sh stop-cluster.sh stop-masters.sh stop-slaves.sh tail cat

NET_ADMIN permission equivalent for mesos

2019-10-10 Thread Marc Roos
I have a docker image that requires NET_ADMIN, I have found this[0] (for the docker containerizer?), but what is the syntax for the mesos containerizer. [0] { "cpus": 0.1, "mem": 50, "id": "/openvpn", "instances": 1, "container": { "docker": { "image":

Kernel module restrictions on launced task?

2019-10-08 Thread Marc Roos
Are there any restrictions on a launched task that could block access to ipsec in the kernel? I am getting this in the launched task Oct 8 16:05:19 c02 ipsec_starter[695921]: no netkey IPsec stack detected Oct 8 16:05:19 c02 ipsec_starter[695921]: no KLIPS IPsec stack detected Oct 8

RE: Task list node

2019-10-01 Thread Marc Roos
.name) \(.state) \(.slave_id)" ' -Original Message- To: user Subject: Re: Task list node You can just mimic UI behaviour and use /state endpoint and filter it with jq. wt., 1 paź 2019 o 13:56 Marc Roos napisał(a): Hmmm, if do something like this[0

RE: Task list node

2019-10-01 Thread Marc Roos
and filter them localy with jq http://mesos.apache.org/documentation/latest/endpoints/master/tasks/ czw., 26 wrz 2019 o 22:09 Marc Roos napisał(a): What would be the easiest way to list running tasks on a node/agent/slave?

Maybe new feature/option for the health check

2019-09-30 Thread Marc Roos
I have a few tasks that take a while before they get started. Sendmail eg. Is not to happy you cannot set the hostname (in marathon) and then gives a timeout of 1 minute. I think there is something similar when starting openldap. If I enable a regular health check there, it will fail the

Problems with tasks and cni networking after upgrading from 1.8 to 1.9

2019-09-28 Thread Marc Roos
Looks like my tasks that have dual networking, a gateway and cni_args assigned ip address are not able to start anymore on mesos 1.9. During deployment I am able to ping these assigned ip addresses. Why can't this executor reach the task then? I guess something has changed in how the

How to clean up "Failed to find 'libprocess.pid' or 'http.marker'"

2019-09-28 Thread Marc Roos
W0929 00:45:10.676910 468993 process.cpp:1055] Failed SSL connections will be downgraded to a non-SSL socket W0929 00:45:10.901372 469057 state.cpp:657] Failed to find 'libprocess.pid' or 'http.marker' for container 8bf306d5-a10c-4787-9258-4198ea80bbec of executor W0929 00:45:10.902492

Task list node

2019-09-26 Thread Marc Roos
What would be the easiest way to list running tasks on a node/agent/slave?

BUG /tmp/mesos losing files add /usr/lib/tmpfiles.d/mesos.conf

2019-09-15 Thread Marc Roos
x86_64 -Original Message- From: Marc Roos Sent: maandag 19 augustus 2019 21:47 To: user Subject: "Failed to launch container" "No such file or directory" Some temp folders gone? How to resolve this? Failed to launch container: Failed to read man

RE: Please some help regression testing a task

2019-09-02 Thread Marc Roos
Marc Roos wrote: mesos-1.8.1-2.0.1.el7.x86_64 CentOS Linux release 7.6.1810 (Core) -Original Message- To: user Subject: Please some help regression testing a task I have a task

W0831 containerizer.cpp:2375] Ignoring update for unknown container

2019-08-31 Thread Marc Roos
Why do get this message? How to resolve this? W0831 18:01:45.403295 2943686 containerizer.cpp:2375] Ignoring update for unknown container 48d9b77c-7348-4404-9845-211be74bad1d mesos-1.8.1-2.0.1.el7.x86_64

RE: Please some help regression testing a task

2019-08-31 Thread Marc Roos
mesos-1.8.1-2.0.1.el7.x86_64 CentOS Linux release 7.6.1810 (Core) -Original Message- To: user Subject: Please some help regression testing a task I have a task that under performs. I am unable to discover what is causing it. Could this be something mesos specific? Performance

RE: Large container image failing to start 'first' time

2019-08-30 Thread Marc Roos
] Ignoring update for unknown container 680d3849-2b2a-4549-8842-8ef358599478 -Original Message- From: Qian Zhang [mailto:zhq527...@gmail.com] Sent: woensdag 28 augustus 2019 15:07 To: Marc Roos Cc: user Subject: Re: Large container image failing to start 'first' time Can you please send

Converting vm to task (performance degraded)

2019-08-29 Thread Marc Roos
I am testing converting a nameserver vm to a task on mesos. If I query just one domain (so the results comes from cache) for 30 seconds I can do around 450.000 queries on the vm, and only 17.000 on the task. When I look at top output on the host where task is running I see this task only

RE: Large container image failing to start 'first' time

2019-08-28 Thread Marc Roos
ngs in agent logs when the container failed to start? Regards, Qian Zhang On Mon, Aug 19, 2019 at 10:46 PM Marc Roos wrote: I have a container image of around 800MB. I am not sure if that is a lot. But I have noticed it is probably to big for a default setup to

Exporting socket from container to host

2019-08-25 Thread Marc Roos
I was wondering if it is possible to export a socket of a container to the host, so I can then share it again with another container. (Without using pods eg. (I like to scale these applications independent from each other))

W0823 14:20:30.101281 2663193 containerizer.cpp:2375] Ignoring update for unknown container

2019-08-23 Thread Marc Roos
When scaling a task from 0 to 1 with two cni networks one of them having a gateway, I have quite a lot of failures Step 1: deploying Step 2: DHCPREQUEST and DHCPACK (fast) Step 3: Right after DHCPACK this error from the agent W0823 14:58:18.440388 2663180 containerizer.cpp:2375]

W0823 12:00:46. process.cpp:1453] Failed to link to '192.168.142.50:40746', connect: Failed connect: connection closed

2019-08-23 Thread Marc Roos
When scaling the task from 0 to 1, it takes sometimes quite a while for it to become active. Waiting maybe 10-20 seconds on the first waiting reported by marathon. Step 1: deploying (fast) Step 2: sometimes fast / sometimes 10-20 seconds Step 3: DHCPREQUEST and DHCPACK (fast) Step 4: Right

RE: Large container image failing to start 'first' time

2019-08-23 Thread Marc Roos
ogs when the container failed to start? Regards, Qian Zhang On Mon, Aug 19, 2019 at 10:46 PM Marc Roos wrote: I have a container image of around 800MB. I am not sure if that is a lot. But I have noticed it is probably to big for a default setup to get it to lau

RE: "Failed to launch container" "No such file or directory" /tmp files are being cleaned

2019-08-19 Thread Marc Roos
place to store docker layers? -Original Message- From: Marc Roos Sent: maandag 19 augustus 2019 21:47 To: user Subject: "Failed to launch container" "No such file or directory" Some temp folders gone? How to resolve this? Failed to launch container: Failed t

"Failed to launch container" "No such file or directory"

2019-08-19 Thread Marc Roos
Some temp folders gone? How to resolve this? Failed to launch container: Failed to read manifest from '/tmp/mesos/store/docker/layers/8c49e24d4aba93c77354143366e2427e0e2e7191 cb85dbc1aa187e4e480021c1/json': No such file or directory

Large container image failing to start 'first' time

2019-08-19 Thread Marc Roos
I have a container image of around 800MB. I am not sure if that is a lot. But I have noticed it is probably to big for a default setup to get it to launch. I think the only reason it launches eventually is because data is cached and no timeout expires. The container will launch eventually

RE: Provisioning containers with configuration file via sandbox mount or copy via entrypoint.sh

2019-08-14 Thread Marc Roos
s the sandbox is considered read only, yet the stdout and stderr are located there??? I think the document <http://mesos.apache.org/documentation/latest/sandbox/#using-the-sandbox> means that sandbox is not expected to be touched by any 3rd party software or people other than Mesos, executor and tas

RE: Is chained cni networks supported in mesos 1.7

2019-08-14 Thread Marc Roos
-Original Message- To: user Subject: Re: Is chained cni networks supported in mesos 1.7 Are you interested in implementing the CNI chain support? -Gilbert On Wed, Jul 24, 2019 at 12:52 PM Marc Roos wrote: Hmm, I guess I should not get my hopes up this will be there soon

Should mesos 1.8 (and marathon 1.8) drain/migrate tasks or not?

2019-08-08 Thread Marc Roos
you have unanswered questions. On Wed, Aug 7, 2019 at 4:59 PM Marc Roos wrote: I seem to be able to add a maintenance schedule, and get also a report on '{"down_machines":[{"hostname":"m02.local"}]}' but I do not see tasks

RE: Draining: Failed to validate master::Call: Expecting 'type' to be present

2019-08-08 Thread Marc Roos
r::Call: Expecting 'type' to be present Please read the "maintenace primitives" section in this doc http://mesos.apache.org/documentation/latest/maintenance/ and let us know if you have unanswered questions. On Wed, Aug 7, 2019 at 4:59 PM Marc Roos wrote:

RE: Draining: Failed to validate master::Call: Expecting 'type' to be present

2019-08-08 Thread Marc Roos
: Expecting 'type' to be present Please read the "maintenace primitives" section in this doc http://mesos.apache.org/documentation/latest/maintenance/ and let us know if you have unanswered questions. On Wed, Aug 7, 2019 at 4:59 PM Marc Roos wrote: I seem to be able to add a m

  1   2   3   >