[ 
https://issues.apache.org/jira/browse/HUDI-8661?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Y Ethan Guo reassigned HUDI-8661:
---------------------------------

    Assignee: Y Ethan Guo

> Support log compaction in HoodieSparkFileGroupReaderBasedMergeHandle
> --------------------------------------------------------------------
>
>                 Key: HUDI-8661
>                 URL: https://issues.apache.org/jira/browse/HUDI-8661
>             Project: Apache Hudi
>          Issue Type: Sub-task
>            Reporter: Y Ethan Guo
>            Assignee: Y Ethan Guo
>            Priority: Blocker
>             Fix For: 1.0.1
>
>   Original Estimate: 8h
>          Time Spent: 1h
>  Remaining Estimate: 7h
>
> The new file group reader-based compaction in 
> HoodieSparkFileGroupReaderBasedMergeHandle has not supported log compaction 
> yet.  Right now it's disabled through the code below in HoodieCompactor.  
> When enabled, there are a few tests that fail: 
> TestHoodieMergeOnReadTable.testLogBlocksCountsAfterLogCompaction, 
> TestHoodieClientOnMergeOnReadStorage.testArchivalOnLogCompaction.
> {code:java}
> boolean useFileGroupReaderBasedCompaction = context.supportsFileGroupReader() 
>   // the engine needs to support fg reader first
>     && !metaClient.isMetadataTable()
>     && 
> config.getBooleanOrDefault(HoodieReaderConfig.FILE_GROUP_READER_ENABLED)
>     && operationType == WriteOperationType.COMPACT
>     && !hasBootstrapFile(operations)                                          
>   // bootstrap file read for fg reader is not ready
>     && StringUtils.isNullOrEmpty(config.getInternalSchema())                  
>   // schema evolution support for fg reader is not ready
>     && config.populateMetaFields();                                           
>   // Virtual key support by fg reader is not ready
> {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to