gene-bordegaray opened a new pull request, #19963:
URL: https://github.com/apache/datafusion/pull/19963

   ## Which issue does this PR close?
   
   add avg byte size in column stats.  This is just a first PR so kept scope 
small. Will follow up with more propagation and adding more support for 
different types.
   
   this should help planning in CBO and for distributed. Mentioned in this 
point: "Output byte size estimation : rows * avg_row_width for network cost 
awareness (we discussed this offline, avg_row_width can be computed based on 
the row type and avg_len/avg_size for collections or length varying types)"


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to