On 02/03/2026 12:31, Nilay Shroff wrote:

+#define MPATH_HEAD_DISK_LIVE             0
+
  struct mpath_head {
      struct srcu_struct    srcu;
      struct list_head    dev_list;    /* list of all mpath_devs */
@@ -17,12 +34,36 @@ struct mpath_head {
      struct kref        ref;
+    unsigned long        flags;
      struct mpath_device __rcu         *current_path[MAX_NUMNODES];
+    const struct mpath_head_template    *mpdt;
      void            *drvdata;
  };
Not sure why we don't have back reference to struct mpath_disk
from struct mpath_head here. Does it make sense to have this?

We can get away without it.

Some more background info .. so the concept of separate mpath_head and mpath_disk is driven by SCSI, which has scsi_device and scsi_disk classes. The scsi_disk driver (sd.c) controls the per-path gendisk and the mpath_disk, and these internals are hidden from the scsi_core (which controls the scsi_device). SCSI having this layered approach makes things more complicated. This is unlike NVMe, where the core driver controls the NS gendisk also.



+static inline struct mpath_disk *mpath_bd_device_to_disk(struct device *dev)
+{
+    return dev_get_drvdata(dev);
+}
+
+static inline struct mpath_disk *mpath_gendisk_to_disk(struct gendisk *disk)
+{
+    return mpath_bd_device_to_disk(disk_to_dev(disk));
+}
+
  int mpath_get_head(struct mpath_head *mpath_head);
  void mpath_put_head(struct mpath_head *mpath_head);
  struct mpath_head *mpath_alloc_head(void);
+void mpath_put_disk(struct mpath_disk *mpath_disk);
+void mpath_remove_disk(struct mpath_disk *mpath_disk);
+void mpath_unregister_disk(struct mpath_disk *mpath_disk);
+struct mpath_disk *mpath_alloc_head_disk(struct queue_limits *lim,
+            int numa_node);
+void mpath_device_set_live(struct mpath_disk *mpath_disk,
+            struct mpath_device *mpath_device);
+void mpath_unregister_disk(struct mpath_disk *mpath_disk);
+static inline bool is_mpath_head(struct gendisk *disk)
+{
+    return disk->fops == &mpath_ops;
+}
  #endif // _LIBMULTIPATH_H
diff --git a/lib/multipath.c b/lib/multipath.c
index 15c495675d729..88efb0ae16acb 100644
--- a/lib/multipath.c
+++ b/lib/multipath.c
@@ -32,6 +32,135 @@ void mpath_put_head(struct mpath_head *mpath_head)
  }
  EXPORT_SYMBOL_GPL(mpath_put_head);
+static void mpath_free_disk(struct kref *ref)
+{
+    struct mpath_disk *mpath_disk =
+        container_of(ref, struct mpath_disk, ref);
+    struct mpath_head *mpath_head = mpath_disk->mpath_head;
+
+    put_disk(mpath_disk->disk);
+    mpath_put_head(mpath_head);
+    kfree(mpath_disk);
+}
+

The mpath_alloc_head_disk() doesn't get a reference to the
mpath_head object but here while freeing mpath_disk we put
the reference to mpath_head. Would that create a reference
imbalance?

I think that what I done can be improved. If you check nvme_mpath_alloc_disk(), when we alloc the head the ref is 1, and then we rely on the disk release to release that head reference.

Yes we got a reference to mpath_head while
allocating it but then these are two (alloc mpath_disk and
alloc mpath_head) disjoint operations. In that case, can't
we have both mpath_disk and mpath_head allocated under one
libmultipath API?

I would like to have something simpler (like mainline NVMe code), but I have it this way because of SCSI, as above.

Thanks


Reply via email to