FW: How to configure a pre-existing slrp volume/disk

2020-08-25 Thread Marc Roos
 

On this dcos manual[1] there is only listed how to use a profile from an 
slrp. Any one know how to change this to a pre-existing (lvm) volume? 
(mesos example is also welcome ;)


cat > app2.json <> data/foo && cat data/foo && sleep 5000",
  "container": {
"docker": {
  "image": "alpine"
},
"type": "MESOS",
"volumes": [
  {
"containerPath": "data",
"mode": "RW",
"persistent": {
  "size": 100,
  "profileName": "fast",
  "type": "mount"
}
  }
]
  },
  "cpus": 0.1,
  "id": "/app-persistent-stable-good-profile-2",
  "instances": 1,
  "mem": 128,
  "residency": {
"taskLostBehavior": "WAIT_FOREVER",
"relaunchEscalationTimeoutSeconds": 3600
  },
  "unreachableStrategy": "disabled",
  "upgradeStrategy": {
"maximumOverCapacity": 0,
"minimumHealthCapacity": 0
  }
}
EOF


[1]
https://docs.d2iq.com/mesosphere/dcos/services/storage/1.0.0/tutorials/manage-local-disks/



RE: Suddenly all tasks gone, framework at completed, cannot start framework

2020-08-25 Thread Marc Roos


Thanks Tomek, Charles, I increased my MARATHON_FAILOVER_TIMEOUT from a 
day to a week. I almost cannot believe something happened yesterday that 
made everything go down today. However I have recently been testing with 
JAVA_OPTS to prevent oom's from the marathon tasks.




-Original Message-
From: Tomek Janiszewski [mailto:jani...@gmail.com] 
Sent: dinsdag 25 augustus 2020 16:55
To: user
Subject: Re: Suddenly all tasks gone, framework at completed, cannot 
start framework

See: https://stackoverflow.com/a/42544023/1387612

wt., 25 sie 2020 o 15:07 Marc Roos  
napisał(a):




Today all my tasks are down and framework marathon is at completed. 
Any 
idea how this can happen?



ed.cpp:520] Successfully authenticated with master 
master@192.168.10.151:5050
I0825 13:03:27.961248   108 sched.cpp:1188] Got error 'Framework 
has 
been removed'






Re: Suddenly all tasks gone, framework at completed, cannot start framework

2020-08-25 Thread Tomek Janiszewski
See: https://stackoverflow.com/a/42544023/1387612

wt., 25 sie 2020 o 15:07 Marc Roos  napisał(a):

>
>
> Today all my tasks are down and framework marathon is at completed. Any
> idea how this can happen?
>
>
>
> ed.cpp:520] Successfully authenticated with master
> master@192.168.10.151:5050
> I0825 13:03:27.961248   108 sched.cpp:1188] Got error 'Framework has
> been removed'
>
>


Re: Suddenly all tasks gone, framework at completed, cannot start framework -

2020-08-25 Thread Charles-François Natali
Marc,

Have you read
https://mesos.readthedocs.io/en/1.1.0/high-availability-framework-guide/ in
particular the section about the FrameworkInfo failover_timeout?

Cheers,

Charles



On Tue, 25 Aug 2020, 16:01 Marc Roos,  wrote:

>
>
>
> I assume this was because something happened with zookeeper, and it
> restarted loading the wrong configuration file without the quorum=1.
> Because I was testing with different zookeeper rpms (mesos rpm conf is
> not standard location)
>
> Question: Is this by design that all tasks are terminated when zookeeper
> is gone? Is there some timeout setting that allows tasks to run for a
> day without zookeeper
>
>
>
>
>
> -Original Message-
> To: user
> Subject: Suddenly all tasks gone, framework at completed, cannot start
> framework
>
>
>
> Today all my tasks are down and framework marathon is at completed. Any
> idea how this can happen?
>
>
>
> ed.cpp:520] Successfully authenticated with master
> master@192.168.10.151:5050
> I0825 13:03:27.961248   108 sched.cpp:1188] Got error 'Framework has
> been removed'
>
>
>
>


RE: Suddenly all tasks gone, framework at completed, cannot start framework -

2020-08-25 Thread Marc Roos




I assume this was because something happened with zookeeper, and it 
restarted loading the wrong configuration file without the quorum=1. 
Because I was testing with different zookeeper rpms (mesos rpm conf is 
not standard location)

Question: Is this by design that all tasks are terminated when zookeeper 
is gone? Is there some timeout setting that allows tasks to run for a 
day without zookeeper





-Original Message-
To: user
Subject: Suddenly all tasks gone, framework at completed, cannot start 
framework



Today all my tasks are down and framework marathon is at completed. Any 
idea how this can happen?



ed.cpp:520] Successfully authenticated with master 
master@192.168.10.151:5050
I0825 13:03:27.961248   108 sched.cpp:1188] Got error 'Framework has 
been removed'





Suddenly all tasks gone, framework at completed, cannot start framework

2020-08-25 Thread Marc Roos



Today all my tasks are down and framework marathon is at completed. Any 
idea how this can happen?



ed.cpp:520] Successfully authenticated with master 
master@192.168.10.151:5050
I0825 13:03:27.961248   108 sched.cpp:1188] Got error 'Framework has 
been removed'