On Mon, Apr 02, 2018 at 09:50:48PM +0200, Martin Wilck wrote:
> For "find_multipaths smart", check if a path is already in use
> before setting DM_MULTIPATH_DEVICE_PATH to 1 or 2 (and thus,
> SYSTEMD_READY=0). If we don't do this, a device which has already been
> mounted (e.g. during initrd processing) may be unmounted by systemd, causing
> havoc to the boot process.

I'm reviewing  v3 of this patch because I don't see patch 17/20 in your
emails from v4. Am I missing an email, or did it not get sent?


> 
> Signed-off-by: Martin Wilck <mwi...@suse.com>
> ---
>  multipath/main.c | 31 ++++++++++++++++++++++++++++++-
>  1 file changed, 30 insertions(+), 1 deletion(-)
> 
> diff --git a/multipath/main.c b/multipath/main.c
> index d09f117..392d5f0 100644
> --- a/multipath/main.c
> +++ b/multipath/main.c
> @@ -629,16 +629,45 @@ configure (struct config *conf, enum mpath_cmds cmd,
>  
>  
>       if (cmd == CMD_VALID_PATH) {
> +             struct path *pp;
> +             int fd;
> +
>               /* This only happens if find_multipaths and
>                * ignore_wwids is set.
>                * If there is currently a multipath device matching
>                * the refwwid, or there is more than one path matching
>                * the refwwid, then the path is valid */
> -             if (VECTOR_SIZE(curmp) != 0 || VECTOR_SIZE(pathvec) > 1)
> +             if (VECTOR_SIZE(curmp) != 0) {
> +                     r = 0;
> +                     goto print_valid;
> +             } else if (VECTOR_SIZE(pathvec) > 1)
>                       r = 0;
>               else
>                       /* Use r=2 as an indication for "maybe" */
>                       r = 2;
> +
> +             /*
> +              * If opening the path with O_EXCL fails, the path
> +              * is in use (e.g. mounted during initramfs processing).
> +              * We know that it's not used by dm-multipath.
> +              * We may not set SYSTEMD_READY=0 on such devices, it
> +              * might cause systemd to umount the device.
> +              * Use O_RDONLY, because udevd would trigger another
> +              * uevent for close-after-write.
> +              *
> +              * get_refwwid() above stores the path we examine in slot 0.
> +              */
> +             pp = VECTOR_SLOT(pathvec, 0);
> +             fd = open(udev_device_get_devnode(pp->udev),
> +                       O_RDONLY|O_EXCL);

I'm worried about this.  Since we can't be sure that is_failed_wwid()
will really tell us that multipathd has tried to multipath the device
and failed, it is totally possible to get a maybe after multipath has
turned the path device over to the rest of the system. If this is true,
then the exclusive open might race with something else that is trying to
use the device, and cause that to fail.  Or worse, it might win but have
the other process mount the file system on it, only to have multipath go
and claim the device, unmounting it. I still think that the only safe
course is to only do this grab when we know that it is safe, such as on
add events, or if we have already labelled this device as a maybe
device, and we are still waiting on it.

Of course, this means I would exlcude the whole second "if (cmd ==
CMD_VALID_PATH)" section in configure() unless we know that it is safe
to grab the device.  Otherwise, there is nothing to stop us from
claiming a device that is in use. Clearly that exclusive grab check is
racy at any time except on add events or when the device already is set
to SYSTEMD_READY=0.  I'm pretty sure that the coldplug add event after
the switchroot is safe, since nothing will be racing to grab the device
then. 

You've already agreed that it should be fine to allow multipathd to try
to create a multipath device on top of a non-claimed path, since we can
just claim it later by issuing a uevent.  I feel like this is just
another instance of that.  If this isn't a new path, where we have
excluded everyone else from using it, we can't suddenly claim it just
because a second path appears. However, if multipathd manages to create
a multipath device on top of it, then it will add the wwid to the wwids
file, and be able to claim it.  But otherwise, I don't think that the
exclusive grab is safe or reliable enough to allow us to simply do this
on any uevent.

I would add a new option to multipath, that works with -u, to tell it
that maybes are allowed. If find_multipaths == FIND_MULTIPATHS_SMART,
then it should not claim the device if it doesn't get positively claimed
in the first "if (cmd == CMD_VALID_PATH)" section of configure(). That
will save us from claiming devices that are already in use, and speed
the multipath -u calls up.
 

> +             if (fd >= 0)
> +                     close(fd);
> +             else {
> +                     condlog(3, "%s: path %s is in use: %s",
> +                             __func__, pp->dev,
> +                             strerror(errno));
> +                     r = 1;
> +             }
>               goto print_valid;
>       }
>  
> -- 
> 2.16.1

--
dm-devel mailing list
dm-devel@redhat.com
https://www.redhat.com/mailman/listinfo/dm-devel

Reply via email to