sivabalan narayanan created HUDI-1570:
-----------------------------------------

             Summary: Add Avg record size in commit metadata
                 Key: HUDI-1570
                 URL: https://issues.apache.org/jira/browse/HUDI-1570
             Project: Apache Hudi
          Issue Type: Improvement
          Components: Utilities
            Reporter: sivabalan narayanan


Many users want to understand what would be their avg record size. As of now, 
there is no easy way to fetch record size for the end user. Even w/ hudi-cli, 
we could decipher from commit metadata, but we need to make some rough 
calculation. So, it would be better if we store the avg record size w/ 
WriteStats (total bytes written/ total records written) , as well as in commit 
metadata. So, in hudi_cli, we could expose this info along w/ "commit 
showpartitions" or expose another command "commit showmetadata" or something. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to