paleolimbot commented on issue #7063:
URL: https://github.com/apache/arrow-rs/issues/7063#issuecomment-2794478219

   > not based on what is expedient for one particular downstream use-case
   
   Agreed! There are also a number of arrow-rs' own crates that make heavy use 
of the `DataType` and `ArrayRef` (e.g., the Parquet internals, casting). 
Another approach might be an `arrow-extensions` crate that includes an 
extension-aware `cast()`, CSV reader/writer (i.e., extension authors can choose 
how an array is represented in a CSV file), Parquet reader/writer (the issue 
here...how to propagate an extension type to Parquet and vice versa), 
extension-aware pretty printer, etc. All of these utilities are made use of by 
DataFusion but there is nothing strictly related to DataFusion about them and 
the work we do to (e.g.) support Variant, Geometry/Geography, or UUIDs doesn't 
have to only benefit DataFusion users.
   
   > what do you think about filing a ticket in DataFusion like "Better 
extension type support" where we can start listing the APIs that would need to 
be updated to Field instead of DataType?
   
   Yes, I'll continue my efforts there, too! In addition to requiring a `Field` 
a number of them also require a `Context` of some kind to look up 
extension-specific behaviour (e.g., whether a cast should or should not happen).


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to