garyli1019 commented on pull request #1602: URL: https://github.com/apache/incubator-hudi/pull/1602#issuecomment-626385758
Hello @bvaradar , I'd like to get your opinion about how to fix this issue, because if we change the way to calculate the record size, it will impact many places in the codebase. The issue basically is: The `totalBytesWritten` in metadata included bloom filter, so `totalBytesWritten/totalRecordsWritten` will be off if the number of records is small but bloom filter is large. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
