gszadovszky commented on issue #42069:
URL: https://github.com/apache/arrow/issues/42069#issuecomment-2841345548

   > (...) could arrow just avoid dealing with shredding altogether and leave 
that up to the query engine?
   
   What does it mean exactly? Arrow want to avoid representing the shredded 
values altogether or represent as is (according to the variant specification)?
   For the first approach, a variant vector would be similar to a struct vector 
with varbinary fields for `metadata` and `value`. For the latter approach we 
would need an additional arbitrary typed field of `typed_value`.
   I think handling `typed_value` makes handling the values quite complicated. 
For example two different Parquet files might use different schema for 
shredding so two variant vectors would have different schema as well. 
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to