[ 
https://issues.apache.org/jira/browse/MESOS-8286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16278923#comment-16278923
 ] 

James Peach commented on MESOS-8286:
------------------------------------

This happens because once the mount namespace is a child of a user namespace, 
it is considered unprivileged and the {{CL_UNPRIVILEGED}} flag is set on the 
mount. Once this flag is set, then a remount that changes the flags *must* 
preserve all the existing flags on the mount (see 
[do_remount|https://github.com/torvalds/linux/blob/master/fs/namespace.c#L2283].
 

When we bind mount a file from the host, the mount flags from the host 
filesystem are inherited to the new bind mount. For example:

{noformat}
/NetworkManager/resolv.conf 
/tmp/ExecutorType_UserNamespaceIsolatorTest_ROOT_USER_DockerTask_DefaultExecutor_Ea3QSF/provisioner/containers/2378c60f-d0ab-4144-8df5-46a2f2b5e9fe/containers/a1700239-518d-4908-a7b4-21deda36df8a/backends/overlay/rootfses/12c7b91f-5042-473d-bf38-30f4e9127e08/etc/resolv.conf
 rw,nosuid,nodev rw,mode=755 tmpfs tmpfs 
...
Failed to remount bind mount as readonly from '/etc/resolv.conf' to 
'/tmp/ExecutorType_UserNamespaceIsolatorTest_ROOT_USER_DockerTask_DefaultExecutor_Ea3QSF/provisioner/containers/2378c60f-d0ab-4144-8df5-46a2f2b5e9fe/containers/a1700239-518d-4908-a7b4-21deda36df8a/backends/overlay/rootfses/12c7b91f-5042-473d-bf38-30f4e9127e08/etc/resolv.conf':
 Operation not permitted
{noformat}

Updating the {{MS_RDONLY}} flag fails because {{MS_REMOUNT}} actually updates 
all the flags and we omitted the inherited {{MS_NOSUID|MS_NODEV}} flags.
 

> Making bind mounts readonly fails with user namespaces.
> -------------------------------------------------------
>
>                 Key: MESOS-8286
>                 URL: https://issues.apache.org/jira/browse/MESOS-8286
>             Project: Mesos
>          Issue Type: Improvement
>            Reporter: James Peach
>            Assignee: James Peach
>
> When user namespaces are in effect, the additional mounts performed by the 
> CNI isolator to bind host network files read-only  fail. The initial bind 
> mount succeeds, but the subsequent remount is failing. The reason for the 
> failure isn't clear to me - there are a number of kernel checks and I don't 
> know which one is failing yet.
> {noformat}
> ...
> [pid 15609] execve("/home/jpeach/src/mesos/build/src/mesos-containerizer", 
> ["/home/jpeach/src/mesos/build/src"..., "launch"], 0x7f74a001c450 /* 30 vars 
> */I1130 17:04:34.281958 15537 containerizer.cpp:2921] Transitioning the state 
> of container 
> 0a0fdd6b-9532-4010-913b-5e36cad6f666.c4b9a777-eb6c-4c4a-9c4c-5d39e23373eb 
> from PREPARING to ISOLATING
> ) = 0
> strace: Process 15610 attached
> [pid 15610] execve("/home/jpeach/src/mesos/build/src/mesos-containerizer", 
> ["mesos-containerizer", "network-cni-setup", "--bind_host_files=false", 
> "--bind_readonly=true", "--etc_hostname_path=/etc/hostnam"..., 
> "--etc_hosts_path=/etc/hosts", "--etc_resolv_conf=/etc/resolv.co"..., 
> "--help=false", "--pid=15609", "--rootfs=/tmp/ExecutorType_UserN"...], 
> 0x58f07f0 /* 24 vars */) = 0
> [pid 15610] mount(NULL, "/", NULL, MS_REC|MS_SLAVE, NULL) = 0
> [pid 15610] mount("/etc/resolv.conf", 
> "/tmp/ExecutorType_UserNamespaceIsolatorTest_ROOT_USER_DockerTask_DefaultExecutor_IMJpTh/provisioner/containers/0a0fdd6b-9532-4010-913b-5e36cad6f666/containers/c4b9a777-eb6c-4c4a-9c4c-5d39e23373eb/backends/overlay/rootfses/0aaba267-75e7-444a-9f3a-adb22adcf195/etc/resolv.conf",
>  NULL, MS_BIND, NULL) = 0
> [pid 15610] mount(NULL, 
> "/tmp/ExecutorType_UserNamespaceIsolatorTest_ROOT_USER_DockerTask_DefaultExecutor_IMJpTh/provisioner/containers/0a0fdd6b-9532-4010-913b-5e36cad6f666/containers/c4b9a777-eb6c-4c4a-9c4c-5d39e23373eb/backends/overlay/rootfses/0aaba267-75e7-444a-9f3a-adb22adcf195/etc/resolv.conf",
>  NULL, MS_RDONLY|MS_REMOUNT, NULL) = -1 EPERM (Operation not permitted)
> [pid 15610] +++ exited with 1 +++
> ...
> {noformat}
> Note that in this log I've experimentally modified the mount flags, but that 
> doesn't make any difference.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to