[
https://issues.apache.org/jira/browse/HUDI-8635?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17937029#comment-17937029
]
Lin Liu edited comment on HUDI-8635 at 3/20/25 7:28 AM:
--------------------------------------------------------
FileGroupReaderBasedMergeHandle contains stats:
# setTotalLogSizeCompacted
# setPrevCommit
# setPrevBaseFile
# setPartitionPath
# setFileId
# setTotalLogReadTimeMs
# setTotalUpdatedRecordsCompacted
# setTotalLogFilesCompacted
# setTotalLogRecords
# setTotalLogBlocks
# setTotalCorruptLogBlock
# setTotalRollbackBlocks
# getRuntimeStats
General MergeHandle contains stats:
# setPrevCommit
# setPrevBaseFile
# setPartitionPath
# setFileId
# setTotalWriteBytes
# setFileSizeInBytes
# setNumWrites
# setNumDeletes
# setNumUpdateWrites
# setNumInserts
# setTotalWriteErrors
was (Author: JIRAUSER301185):
FileGroupReaderBasedMergeHandle contains stats:
# setTotalLogSizeCompacted
# setPrevCommit
# setPrevBaseFile
# setPartitionPath
# setFileId
# setTotalLogReadTimeMs
# setTotalUpdatedRecordsCompacted
# setTotalLogFilesCompacted
# setTotalLogRecords
# setTotalLogBlocks
# setTotalCorruptLogBlock
# setTotalRollbackBlocks
# getRuntimeStats
General MergeHandle contains stats:
# setPrevCommit
# setPrevBaseFile
# setPartitionPath
# setFileId
# setTotalWriteBytes
# setFileSizeInBytes
# setNumWrites
# setNumDeletes
# setNumUpdateWrites
# setNumInserts
# setTotalWriteErrors
> Revisit stats generated in HoodieSparkFileGroupReaderBasedMergeHandle
> ---------------------------------------------------------------------
>
> Key: HUDI-8635
> URL: https://issues.apache.org/jira/browse/HUDI-8635
> Project: Apache Hudi
> Issue Type: Sub-task
> Reporter: Y Ethan Guo
> Assignee: Y Ethan Guo
> Priority: Blocker
> Fix For: 1.0.2
>
> Original Estimate: 2h
> Remaining Estimate: 2h
>
> We need to make sure the write stats generated by the new file group
> reader-based merge handle for compaction (
> HoodieSparkFileGroupReaderBasedMergeHandle) are intact in all cases.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)