alamb commented on issue #7424:
URL: https://github.com/apache/arrow-rs/issues/7424#issuecomment-2835994322

   > Thanks [@alamb](https://github.com/alamb) for the great writeup! I’ve 
sketched out a rough implementation and had two related questions:
   > 
   > * Reusing metadata: Do we want to support metadata reuse during the value 
creation phase within the same builder, or is the goal to allow sharing 
metadata across multiple Variant values after they are already created and in 
Arrow memory? (My current version only supports reuse during value creation 
within a single builder.)
   
   I think we should start with reuse within a single builder. 
   
   > * Key ordering: Since the spec allows for sorted_strings in the metadata, 
should the VariantBuilder take an option to control key sorting?(This refers to 
sorting keys in the metadata, and is separate from object-level sorted 
dictionaries.)
   
   Yes, I think having an option on the builder makes the most sense. I haven't 
fully thought through the interplay between creating sorted metadata and trying 
to reuse the metadata -- it seems like once the metadata fields have been 
added, we can't then add new field names without distrurbing existing values 
(the metadata field indexes would have changed)
   
   
   I think the design principles of the arrow-rs crate are to provide high 
performance primitives and reasonable defaults, and allow it to the user to 
specify / control things at a lower level of performance
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to