Acked-by: Eric Paris <epa...@redhat.com>

On Mon, 2015-09-21 at 11:45 -0700, Dave Hansen wrote:
> From: Dave Hansen <dave.han...@linux.intel.com>
> 
> There was a report that my patch:
> 
>       inotify: actually check for invalid bits in
> sys_inotify_add_watch()
> 
> broke CRIU.
> 
> The reason is that CRIU looks up raw flags in /proc/$pid/fdinfo/*
> to figure out how to rebuild inotify watches and then passes those
> flags directly back in to the inotify API.  One of those flags
> (FS_EVENT_ON_CHILD) is set in mark->mask, but is not part of the
> inotify API.  It is used inside the kernel to _implement_ inotify
> but it is not and has never been part of the API.
> 
> My patch above ensured that we only allow bits which are part of
> the API (IN_ALL_EVENTS).  This broke CRIU.
> 
> FS_EVENT_ON_CHILD is really internal to the kernel.  It is set
> _anyway_ on all inotify marks.  So, CRIU was really just trying
> to set a bit that was already set.
> 
> This patch hides that bit from fdinfo.  CRIU will not see the
> bit, not try to set it, and should work as before.  We should not
> have been exposing this bit in the first place, so this is a good
> patch independent of the CRIU problem.
> 
> Signed-off-by: Dave Hansen <dave.han...@linux.intel.com>
> Reported-by: Andrey Wagin <ava...@gmail.com>
> Cc: Andrew Morton <a...@linux-foundation.org>
> Cc: Cyrill Gorcunov <gorcu...@openvz.org>
> Cc: xe...@parallels.com
> Cc: Eric Paris <epa...@redhat.com>
> Cc: j...@johnmccutchan.com
> Cc: rl...@rlove.org
> Cc: linux-kernel@vger.kernel.org
> ---
> 
>  b/fs/notify/fdinfo.c |    9 ++++++++-
>  1 file changed, 8 insertions(+), 1 deletion(-)
> 
> diff -puN fs/notify/fdinfo.c~fdinfo-mask fs/notify/fdinfo.c
> --- a/fs/notify/fdinfo.c~fdinfo-mask  2015-09-21
> 10:24:01.031864268 -0700
> +++ b/fs/notify/fdinfo.c      2015-09-21 10:25:04.335723826 -0700
> @@ -82,9 +82,16 @@ static void inotify_fdinfo(struct seq_fi
>       inode_mark = container_of(mark, struct inotify_inode_mark,
> fsn_mark);
>       inode = igrab(mark->inode);
>       if (inode) {
> +             /*
> +              * IN_ALL_EVENTS represents all of the mask bits
> +              * that we expose to userspace.  There is at
> +              * least one bit (FS_EVENT_ON_CHILD) which is
> +              * used only internally to the kernel.
> +              */
> +             u32 mask = mark->mask & IN_ALL_EVENTS;
>               seq_printf(m, "inotify wd:%x ino:%lx sdev:%x mask:%x
> ignored_mask:%x ",
>                          inode_mark->wd, inode->i_ino, inode->i_sb
> ->s_dev,
> -                        mark->mask, mark->ignored_mask);
> +                        mask, mark->ignored_mask);
>               show_mark_fhandle(m, inode);
>               seq_putc(m, '\n');
>               iput(inode);
> _
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Reply via email to