Re: [PR] [DLPack] C Functions for DLPack Speed Exchange and Stream Handling [tvm-ffi]

via GitHub Fri, 17 Oct 2025 21:46:39 -0700


gemini-code-assist[bot] commented on PR #96:
URL: https://github.com/apache/tvm-ffi/pull/96#issuecomment-3393641393


   ## Summary of Changes
   
   
   
   This pull request significantly refactors the DLPack integration by 
introducing a unified `DLPackExchangeAPI` struct. This change streamlines the 
process of exchanging tensors and handling device streams between different 
frameworks, particularly PyTorch, by consolidating related C functions into a 
single, more manageable interface. It also adds support for non-owning DLTensor 
conversions to improve efficiency.
   
   ### Highlights
   
   * **Unified DLPack Exchange API**: A new `DLPackExchangeAPI` C struct has 
been introduced to consolidate all DLPack-related function pointers (for 
converting to/from PyObject, allocating tensors, and querying current work 
streams) into a single, streamlined interface. This replaces the previous 
method of passing individual function pointers.
   * **Enhanced Stream Handling**: The new `DLPackExchangeAPI` now includes a 
`current_work_stream` function pointer, which allows for more robust and 
integrated querying of the current device stream (e.g., CUDA stream) during 
DLPack tensor exchanges, particularly for non-CPU devices.
   * **Non-Owning DLTensor Conversion**: A new `toDLPackNonOwningImpl` function 
has been added to facilitate the creation of non-owning `DLTensor` instances 
directly from PyTorch tensors. This can lead to performance improvements by 
avoiding unnecessary reference counting overhead.
   * **Refactored PyTorch Integration**: The Python FFI for PyTorch has been 
updated to expose the new `DLPackExchangeAPI` struct via 
`__c_dlpack_exchange_api__` on `torch.Tensor`, simplifying the integration and 
interaction with DLPack functionalities.
   * **DLPack Submodule Update**: The `3rdparty/dlpack` submodule has been 
updated to a newer commit, which likely incorporates the necessary definitions 
and changes for the new `DLPackExchangeAPI` structure.
   
   <details>
   <summary><b>Changelog</b></summary>
   
   * **3rdparty/dlpack**
       * Updated the DLPack submodule to a newer commit.
   * **python/tvm_ffi/_optional_torch_c_dlpack.py**
       * Introduced `toDLPackNonOwningImpl` for non-owning DLTensor conversion.
       * Refactored individual DLPack C functions into a unified 
`TorchDLPackExchangeAPI` struct.
       * Added `TorchDLPackCurrentWorkStream` for querying device streams.
       * Updated `torch.Tensor` to expose the new `__c_dlpack_exchange_api__` 
instead of individual function pointers.
   * **python/tvm_ffi/core.pyi**
       * Updated type hints to reflect the change to 
`__c_dlpack_exchange_api__`.
   * **python/tvm_ffi/cython/base.pxi**
       * Added `DLPACK_MAJOR_VERSION` and `DLPACK_MINOR_VERSION` macros.
       * Defined the `DLPackExchangeAPI` struct and its associated function 
pointer types.
       * Updated `TVMFFIPyCallContext` and `TVMFFIPyArgSetter` to use the new 
`const DLPackExchangeAPI* c_dlpack_exchange_api` member.
   * **python/tvm_ffi/cython/function.pxi**
       * Modified `make_ret` to accept the `DLPackExchangeAPI` pointer.
       * Refactored `TVMFFIPyArgSetterDLPackCExporter_` to 
`TVMFFIPyArgSetterDLPackExchangeAPI_` to utilize the new API struct.
       * Introduced `GetTorchFallbackExchangeAPI` for handling fallback 
scenarios.
   * **python/tvm_ffi/cython/tensor.pxi**
       * Refactored `DLTensorTestWrapper`'s DLPack functions into separate 
`_dltensor_test_wrapper_from_pyobject`, `_dltensor_test_wrapper_to_pyobject`, 
and `_dltensor_test_wrapper_current_work_stream` functions.
       * Implemented `_dltensor_test_wrapper_get_exchange_api` to provide a 
static `DLPackExchangeAPI` instance for testing purposes.
       * Updated `DLTensorTestWrapper` to use `__c_dlpack_exchange_api__`.
   * **python/tvm_ffi/cython/tvm_ffi_python_helpers.h**
       * Updated `TVMFFIPyCallContext` and `TVMFFIPyArgSetter` to use a single 
`const DLPackExchangeAPI*` member instead of individual function pointers.
       * Modified `TVMFFIPyFuncCall` to correctly pass and receive the 
`DLPackExchangeAPI` pointer.
   * **tests/python/test_load_inline.py**
       * Updated `test_load_inline_with_env_tensor_allocator` to check for the 
presence of `__c_dlpack_exchange_api__`.
   </details>
   
   
   
   <details>
   <summary><b>Activity</b></summary>
   
   * Kathryn-cat addressed comments from tqchen and indicated that the PR is in 
good shape, suggesting that unit tests be moved to a separate PR.
   * tqchen provided several review comments, including suggestions for 
refactoring context fields, clarifying static member initialization, moving 
implementations to the `aten::` namespace, ensuring consistent naming, and 
updating the DLPack API passing mechanism.
   * Kathryn-cat updated the code multiple times in response to tqchen's 
feedback.
   * A bot (`gemini-code-assist[bot]`) identified a broad condition for device 
type checks in CUDA stream handling and suggested making it specific to 
`kDLCUDA` or `kDLROCM`, which was subsequently incorporated into the code.
   </details>
   
   
   
   
   
   
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: [PR] [DLPack] C Functions for DLPack Speed Exchange and Stream Handling [tvm-ffi]

Reply via email to