Hi Stephen,

Thank you for your feedback on the patch-set. I have submitted v14 
incorporating the changes you suggested. The responses to your comments are 
inline.


Internal Use - Confidential
> -----Original Message-----
> From: Stephen Hemminger <[email protected]>
> Sent: Monday, May 18, 2026 10:14 AM
> To: Bathija, Pravin <[email protected]>
> Cc: [email protected]; [email protected];
> [email protected]; [email protected]
> Subject: Re: [PATCH v13 0/5] Support add/remove memory region and get-max-
> slots
>
>
> [EXTERNAL EMAIL]
>
> On Thu, 14 May 2026 02:01:52 +0000
> <[email protected]> wrote:
>
> > From: Pravin M Bathija <[email protected]>
> >
> > This is version v13 of the patchset and it incorporates the
> > recommendations made by Fengcheng Wen.
> >
> > Changes made to patch 3/5 and 4/5
> > * Relocated function remove_guest_pages from patch 3/5 to 4/5.
> > * Renamed VhostUserSingleMemReg to VhostUserMemRegMsg and
> memory_single
> >   to memreg.
> >
> > This implementation has been extensively tested by doing Read/Write
> > I/O from multiple instances of fio + libblkio (front-end) talking to
> > spdk/dpdk (back-end) based drives. Tested with qemu front-end talking
> > to dpdk testpmd (back-end) performing add/removal of memory regions.
> > Also tested post-copy live migration after doing add_memory_region.
> >
> > Version Log:
> > Version v13 (Current version): Incorporate code review suggestions
> > from Fengcheng Wen as described above.
> > Version v12: Incorporate code review suggestions from Maxime Coquelin
> > and ai-code-review.
> > Changes made to patch 3/5
> > Refactored async_dma_map() to delegate to async_dma_map_region(),
> > eliminating code duplication between the two functions.
> > Restored original comments in async_dma_map_region() explaining why
> > ENODEV and EINVAL errors are ignored (these were stripped in v10)
> > Reverted unnecessary changes to vhost_user_postcopy_register() --
> > removed the host_user_addr == 0 checks and reg_msg_index indirection
> > that were added in  v10, since this function is only called from
> > vhost_user_set_mem_table() where regions are always contiguous.
> >
> > Version v11: Incorporate code review suggestions from Stephen Hemminger.
> > Change made to patch 4/5
> > Fix incomplete cleanup in vhost_user_add_mem_reg() when
> > vhost_user_mmap_region() fails after the mmap succeeds (e.g.
> > add_guest_pages() realloc failure) realloc failure). The error path
> > now calls remove_guest_pages() and free_mem_region() to undo the
> > mapping and stale guest-page entries, preventing a leaked mmap and
> > slot reuse corruption. The plain close(fd) path is kept for pre-mmap 
> > failures.
> >
> > Version v10: Incorporate code review suggestions from Stephen Hemminger.
> > Change made to patch 4/5
> > Moved dev_invalidate_vrings after free_mem_region, array compaction,
> > and nregions decrement. This ensures translate_ring_addresses only
> > sees surviving memory regions, preventing vring pointers from
> > resolving into a region that is about to be unmapped.
> >
> > Version v9: Incorporate code review suggestions from Stephen Hemminger.
> > Changes made to patch 3/5
> > Restored max_guest_pages initial value to hardcoded 8 instead of
> > VHOST_MEMORY_MAX_NREGIONS, matching upstream semantics.
> > Changes made to patch 4/5
> > Added close(reg->fd) and reg->fd = -1 before goto close_msg_fds in the
> > mmap failure path to fix fd leak after fd was moved from ctx->fds[0].
> > Converted dev_invalidate_vrings from a plain function to a macro +
> > implementation function pair, accepting message ID as a parameter so
> > the static_assert reports the correct handler at each call site.
> > Updated dev_invalidate_vrings call in add_mem_reg to pass
> > VHOST_USER_ADD_MEM_REG as message ID.
> > Updated dev_invalidate_vrings call in rem_mem_reg to pass
> > VHOST_USER_REM_MEM_REG as message ID.
> >
> > Version v8:  Incorporate code review suggestions from Stephen Hemminger.
> > rewrite async_dma_map_region function to iterate guest pages by host
> > address range matching change function dev_invalidate_vrings to accept
> > a double pointer to propagate pointer updates new function
> > remove_guest_pages was added add_mem_reg error path was narrowed to
> > only clean up the single failed region instead of destroting all
> > existing regions
> >
> > Version v7: Incorporate code review suggestions from Maxime Coquelin.
> > Add debug messages to vhost_postcopy_register function.
> >
> > Version v6: Added the enablement of this feature as a final patch in
> > this patch-set and other code optimizations as suggested by Maxime
> > Coquelin.
> >
> > Version v5: removed the patch that increased the number of memory
> > regions from 8 to 128. This will be submitted as a separate feature at
> > a later point after incorporating additional optimizations. Also
> > includes code optimizations as suggested by Feng Cheng Wen.
> >
> > Version v4: code optimizations as suggested by Feng Cheng Wen.
> >
> > Version v3: code optimizations as suggested by Maxime Coquelin and
> > Thomas Monjalon.
> >
> > Version v2: code optimizations as suggested by Maxime Coquelin.
> >
> > Version v1: Initial patch set.
> >
> > Pravin M Bathija (5):
> >   vhost: add user to mailmap and define to vhost hdr
> >   vhost_user: header defines for add/rem mem region
> >   vhost_user: support function defines for back-end
> >   vhost_user: Function defs for add/rem mem regions
> >   vhost_user: enable configure memory slots
> >
> >  .mailmap               |   1 +
> >  lib/vhost/rte_vhost.h  |   4 +
> >  lib/vhost/vhost_user.c | 418
> > +++++++++++++++++++++++++++++++++++------
> >  lib/vhost/vhost_user.h |  10 +
> >  4 files changed, 371 insertions(+), 62 deletions(-)
> >
>
> Some useful AI feedback
>
> Review of [PATCH v13 0-5/5] vhost: configure memory slots support
> Author: Pravin M Bathija <[email protected]>
>
> This revision addresses essentially every correctness issue raised in the 
> v7-v12
> reviews:
>
>   - ctx->fds[0] = -1 ownership transfer is now done before mmap, and
>     the mmap-failure path closes reg->fd explicitly when mmap never
>     set reg->mmap_addr.
>   - _dev_invalidate_vrings now takes struct virtio_net **pdev and
>     writes back *pdev = dev at the end, so a numa_realloc inside
>     translate_ring_addresses propagates correctly. Both call sites
>     refresh "dev = *pdev;" afterwards.
>   - The dev_invalidate_vrings() macro now takes the message id and
>     uses static_assert(id ## _LOCK_ALL_QPS, ...), matching the
>     existing VHOST_USER_ASSERT_LOCK pattern. Works for both
>     VHOST_USER_ADD_MEM_REG and VHOST_USER_REM_MEM_REG call sites.
>   - Overlap check in vhost_user_add_mem_reg uses guest address
>     space (guest_user_addr, size / userspace_addr, memory_size),
>     no longer mmap_size.
>   - free_new_region undoes only the failed region: async DMA unmap,
>     remove_guest_pages, free_mem_region(reg), nregions--.
>   - async_dma_map_region iterates dev->nr_guest_pages and filters
>     by [reg_start, reg_end), eliminating the prior reg_size
>     underflow loop.
>   - The regions array is kept contiguous via memmove on REM_MEM_REG,
>     so existing iterators that walk mem->nregions remain correct.
>   - max_guest_pages is back to 8 in vhost_user_initialize_memory.
>
> One protocol-level issue remains worth raising.
>
>
> Patch 4/5 -- vhost_user: Function defs for add/rem mem regions
> --------------------------------------------------------------------
>
> Warning: ADD_MEM_REG does not send the host_user_addr reply
>
>   Per the vhost-user spec for VHOST_USER_ADD_MEM_REG, the back-end
>   is expected to reply with the same message format and the
>   userspace_addr field replaced by the host userspace address that
>   the region was mapped into. The handler returns
>   RTE_VHOST_MSG_RESULT_OK with no reply constructed, so the
>   dispatcher does not call send_vhost_reply().
>
>   For postcopy migration this matters in particular: the original
>   vhost_user_postcopy_register() does two things -- exchange the
>   host_user_addr with the front-end and wait for an ack, then
>   register the regions with userfaultfd. The patch only does the
>   userfaultfd registration via vhost_user_postcopy_region_register().
>   The in-code comment notes the payload-layout mismatch with
>   vhost_user_postcopy_register() but stops there.
>
>   Without the address reply, QEMU will not know the back-end's
>   mapping for regions added via ADD_MEM_REG, so the userfaultfd
>   handling on the QEMU side cannot resolve faults in those
>   regions. Postcopy migration combined with the
>   CONFIGURE_MEM_SLOTS feature will not work.
>
>   Suggested fix: construct a memreg-payload reply with
>   region->userspace_addr replaced by reg->host_user_addr and
>   return RTE_VHOST_MSG_RESULT_REPLY. At minimum, refuse
>   ADD_MEM_REG when dev->postcopy_listening is set, so that the
>   combination fails cleanly rather than silently mis-mapping.

Fixed, now constructs a memreg reply with host_user_addr and returns 
RTE_VHOST_MSG_RESULT_REPLY.

>
>
> Info: vhost_user_rem_mem_reg does not validate ctx->fd_num
>
>   The handler is registered with accepts_fd = true and does not
>   call validate_msg_fds(). The trailing close_msg_fds(ctx) cleans
>   up whatever fds were passed, so this is not a leak, but a
>   malformed message with an unexpected fd count is silently
>   accepted. The other accepts_fd handlers in this file validate
>   fd_num explicitly.

Fixed, added validate_msg_fds(dev, ctx, 0).

>
>
> Info: vhost_user_get_max_mem_slots cast is unnecessary
>
>   ctx->msg.payload.u64 = (uint64_t)max_mem_slots;
>
>   max_mem_slots is uint32_t and the assignment widens
>   automatically; the cast can be dropped. Minor.

Dropped the cast.

>
>
> Reviewed-by would be appropriate once the postcopy reply is addressed (or the
> combination is rejected). The rest of the series looks correct.

Added the Reviewed-by line in v14

Reply via email to