mbrobbel opened a new pull request, #5822:
URL: https://github.com/apache/arrow-rs/pull/5822

   # Rationale for this change
    
   It would be nice to better support reading and writing the canonical `uuid` 
extension type with the arrow and parquet crate i.e. mapping between the arrow 
extension type and the parquet logical `uuid` type.
   
   # What changes are included in this PR?
   
   This adds an `ExtensionType` enum for canonical extension types.
   
   # Are there any user-facing changes?
   
   Users can now annotate their logical types with extension types, and for 
`uuid` they are propagated via the arrow writer to map to the parquet `uuid` 
logical type.
   
   This needs tests and better docs, but I'd like to get some feedback on the 
approach first, because there are many different ways to implement this.
   
   I quickly tested this change with 
[narrow](https://github.com/mbrobbel/narrow/pull/183/files#diff-8868964a71a65db2aaef344fc98c97484058a8a9d46dc45d91a9d47313ffe100R21-R26)
 and those `uuid` fields (in the parquet file) are now picked up as `uuid` 
instead of `blob` by DuckDB.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to