Ciao,

How many containers are you running, and what is your system like?

I've something like a dozen of slaves a 2 / 3 containers per slave. I'm running on a Centos6 derived distribution (Scientific Linux CERN). On the specific slave I do not have any running container:

```
[root@cmsbuild11 ~]# docker ps -q  | wc
      0       0       0
```

but I do have a bunch of dead one:

```
[root@cmsbuild11 ~]# docker ps -qa  | wc
    999     999   12987
```

due to some runaway process.

By attaching via gdb to the docker daemon I get:

```
#0  0x00000000005b0ad4 in syscall.Syscall ()
#1 0x000000000084f91b in github.com/docker/docker/pkg/devicemapper.ioctlBlkDiscard ()
#2  0x0000000000000010 in ?? ()
#3  0x000000000000000b in ?? ()
#4  0x0000000000001277 in ?? ()
#5  0x00007f06d004e128 in ?? ()
#6  0x000000c209341e68 in ?? ()
#7  0x00007f06d004e140 in ?? ()
#8  0x0000000000000018 in ?? ()
#9  0x000000c209341e40 in ?? ()
#10 0x0000000000000000 in ?? ()
```

for a few of the running threads (the other ones are blocked in some futex). Notice I'm running on a CEPH volume.

--
Ciao,
Giulio


Also are you able to capture through perf or strace what docker rm is
blocked on?

Tim


On Mon, Mar 23, 2015 at 10:12 AM, Giulio Eulisse <[email protected]>
wrote:

I suspect my problem is that "docker rm" takes forever in my case. I'm not
running docker in docker though.


On 23 Mar 2015, at 18:01, haosdent wrote:

Are your issue relevant to this?
https://issues.apache.org/jira/browse/MESOS-2115

On Tue, Mar 24, 2015 at 12:52 AM, Giulio Eulisse <[email protected]>
wrote:

Hi,

I'm running using 0.20.1 and I seem to have troubles due to the fact a mesos slave is not able to recover the docker containers after a restart,
resulting in a very long wait.

Is this some known issue?

--
Ciao,
Giulio




--
Best Regards,
Haosdent Huang


Reply via email to