morristai commented on issue #5090:
URL: https://github.com/apache/opendal/issues/5090#issuecomment-3050998325

   Followed up on this issue — after some testing, I found that GDS over TCP/IP 
isn’t supported yet. I tested with both a remote S3 bucket and a local MinIO 
cluster. Using the latest `cudf-cu12 25.6.0` to read a remote S3 Parquet file 
(~1 GiB), it first reads the data into host RAM, and only after streaming the 
entire Parquet file does it copy the data into GPU VRAM. This means that even 
if we create a Kvikio Rust FFI binding, GDS for S3 services won’t work if the 
underlying library doesn’t support GDS over TCP/IP. However, `fs` should work 
though.
   
   ```python
   file_url = 
"s3://morris-testing-s3/00000-0-c5b97e11-c15c-4dcc-bab8-4f227a34a669-0-00001.parquet"
   df = cudf.read_parquet(file_url)
   ```
   
   
https://github.com/user-attachments/assets/4d910eaf-0da7-4c95-9d79-ec38c73c7b60


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@opendal.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to