gabotechs commented on issue #19973:
URL: https://github.com/apache/datafusion/issues/19973#issuecomment-3872145576

   > I considered just using byte_size / num_rows because it's "the same thing" 
but this is not true. While this can be calculated on a file scan, it is a 
fragile value that can easily be corrupt
   
   Aren't all the challenges you are describing here also applicable to 
`avg_byte_size`?
   
   in other words: what would it mean if `avg_byte_size` is different from 
`byte_size / num_rows`?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to