Hi Arrow Devs,

I'm a maintainer of a project called cuDF 
(https://github.com/rapidsai/cudf.git) that is based on the Arrow columnar 
format and depends on the Arrow C++ and Python libraries. Currently, we are 
pinned to version `1.0.1`, but we've gotten feedback from the community that 
they'd really like us to upgrade to the latest version. We tried to upgrade to 
version `3.0.0` but ran into a major issue:


- GPU buffer validation fails in array and table creation code flows. Related 
JIRA is https://issues.apache.org/jira/browse/ARROW-11872 and subsequent PR 
fixing the issue is https://github.com/apache/arrow/pull/9640 and was merged a 
few days ago. Unfortunately, this effectively blocks most usage of Arrow 
libraries for CUDA projects.

- After this, we tried version `2.0.0` but hit a different issue of 
round-tripping of list of structs columns doesn't work properly in the parquet 
reader and writer. This is fixed in version `3.0.0`.

It seems like version `4.0.0` is tentatively scheduled for June 2021, which is 
unfortunately quite a bit away. Would it be possible to do a patch release in 
`3.x` that included the fix for GPU buffer validation? This would greatly help 
us upgrade to the latest arrow version and directly help users using both Arrow 
and cuDF.


Regards,
Prem

Reply via email to