tobixdev opened a new issue, #21144:
URL: https://github.com/apache/datafusion/issues/21144

   ### Is your feature request related to a problem or challenge?
   
   Now that it seems like we can merge 
https://github.com/apache/datafusion/pull/20312 soon, we should implement the 
full-range of Arrow's canonical extension types. Currently, only UUID is 
supported in the PR.
   
   This issue tracks adding the remaining canonincal extension types:
   - [ ] Fixed shape tensor
   - [ ] Variable shape tensor
   - [ ] JSON
   - [x] UUID
   - [ ] Opaque
   - [ ] 8-bit Boolean
   - [ ] Parquet Variant
   - [ ] Timestamp With Offset
   
   ### Describe the solution you'd like
   
   Implement the `DFExtensionType` similar to UUID.
   
   The question that remains is how we implement pretty-printing for these 
types.
   - Do we try to pretty-print tensors?
   - Do we pretty-print JSON using newlines?
   - I guess Parquet Variant would benefit from a nice representation in 
tests/CLIs. @friendlymatthew maybe you have some 2 cents here?
   
   ### Describe alternatives you've considered
   
   We could implement the formatters within arrow-rs and just use them in 
DataFusion. But I am unsure where they best fit.
   
   Maybe starting in DataFusion and migrating them to arrow-rs sometimes in the 
future (depending on a use case) is a good choice.
   
   ### Additional context
   
   Some (maybe) related issues I've found:
   - https://github.com/apache/datafusion/issues/7845
   - https://github.com/apache/datafusion/issues/16116
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to