[ 
https://issues.apache.org/jira/browse/MESOS-6118?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15570934#comment-15570934
 ] 

Jie Yu commented on MESOS-6118:
-------------------------------

commit ccc746a7d12cc524120a76aa49a0d69e7303608a
Author: Kevin Klues <klue...@gmail.com>
Date:   Wed Oct 12 22:33:56 2016 -0700

    Added special case when sorting hierarchically in MountInfoTable::read.
    
    It is legal to have entries in a `MountInfoTable` whose `entry.id` is
    the same as `entry.parent`. This can happen (for example), if a system
    boots from the network and then keeps the original `/` in RAM.
    However, to avoid cycles when walking the mount hierarchy, we should
    not treat these entries as children of their parent so we skip them.
    
    This commit adds functionality to handle this case.
    
    Review: https://reviews.apache.org/r/52596/

commit 70b227f7d5662c051d0e978e9e4bfec328854c57
Author: Kevin Klues <klue...@gmail.com>
Date:   Wed Oct 12 22:33:51 2016 -0700

    Added more detailed error message when failing in MountInfoTable::read.
    
    Review: https://reviews.apache.org/r/52597/

> Agent would crash with docker container tasks due to host mount table read.
> ---------------------------------------------------------------------------
>
>                 Key: MESOS-6118
>                 URL: https://issues.apache.org/jira/browse/MESOS-6118
>             Project: Mesos
>          Issue Type: Bug
>          Components: slave
>    Affects Versions: 1.0.1
>         Environment: Build: 2016-08-26 23:06:27 by centos
> Version: 1.0.1
> Git tag: 1.0.1
> Git SHA: 3611eb0b7eea8d144e9b2e840e0ba16f2f659ee3
> systemd version `219` detected
> Inializing systemd state
> Created systemd slice: `/run/systemd/system/mesos_executors.slice`
> Started systemd slice `mesos_executors.slice`
> Using isolation: posix/cpu,posix/mem,filesystem/posix,network/cni
>  Using /sys/fs/cgroup/freezer as the freezer hierarchy for the Linux launcher
> Linux ip-10-254-192-40 3.10.0-327.28.3.el7.x86_64 #1 SMP Thu Aug 18 19:05:49 
> UTC 2016 x86_64 x86_64 x86_64 GNU/Linux
>            Reporter: Jamie Briant
>            Assignee: Kevin Klues
>            Priority: Blocker
>              Labels: linux, slave
>         Attachments: crashlogfull.log, cycle2.log, cycle3.log, cycle5.log, 
> cycle6.log, slave-crash.log
>
>
> I have a framework which schedules thousands of short running (a few seconds 
> to a few minutes) of tasks, over a period of several minutes. In 1.0.1, the 
> slave process will crash every few minutes (with systemd restarting it).
> Crash is:
> Sep 01 20:52:23 ip-10-254-192-99 mesos-slave: F0901 20:52:23.905678  1232 
> fs.cpp:140] Check failed: !visitedParents.contains(parentId)
> Sep 01 20:52:23 ip-10-254-192-99 mesos-slave: *** Check failure stack trace: 
> ***
> Version 1.0.0 works without this issue.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to