[ https://issues.apache.org/jira/browse/HUDI-9767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Voon Hou updated HUDI-9767: --------------------------- Description: Upstreaming a bunch of Trino improvements to Hudi-Trino and they are: # Increase Default MaxOutstandingSplits and SplitLoaderParallelism # HUDI-9525 Extend file system cache support to Hudi connector # Fix split generation parallelism for a non-partitioned table # [MINOR] Disable PR labeler # -Add parquet page skipping in Iceberg Connector- (Skipped as this is for testing purposes) # HUDI-9577 Make target result size configurable from endpoint and server # -Create pipeline to build arm image- (Skipped as this modifies github actions) # -[MINOR] Revert changes made to non-hudi modules- (Skipped as this is already upstreamed) # [MINOR] Added optimizations for HudiColumnStatsIndexSupport # [MINOR] Cleanup HudiSplitFactory to extend cache support # Fix flakiness when testing cache correctness # [Trino] Enable Metadata Table by default # [Trino] Fix flaky tests due to table stats computation lagging behind query execution # Implement Metadata table based Partition listing # Fix Case Sensitivity Issues Between Table and Catalog Schemas # [Trino] Workers should use latest commit time from table handle # Incorrect query results for Merge-On-Read (RT) tables when column stats are enabled Upstream starts at commit hash (inclusive): ea7f22d0371173a31be0c693a24fa00b7374fe0f Upstream ends at commit hash (inclusive): 5e3ebd5b4edd0623e041109cc769f0123bcbd4a7 was: Upstreaming a bunch of Trino improvements to Hudi-Trino and they are: # Increase Default MaxOutstandingSplits and SplitLoaderParallelism # [HUDI-9525] Extend file system cache support to Hudi connector # Fix split generation parallelism for a non-partitioned table # [MINOR] Disable PR labeler # -Add parquet page skipping in Iceberg Connector- (Skipped as this is for testing purposes) # [HUDI-9577] Make target result size configurable from endpoint and server # -Create pipeline to build arm image- (Skipped as this modifies github actions) # -[MINOR] Revert changes made to non-hudi modules- (Skipped as this is already upstreamed) # [MINOR] Added optimizations for HudiColumnStatsIndexSupport # [MINOR] Cleanup HudiSplitFactory to extend cache support # Fix flakiness when testing cache correctness # [Trino] Enable Metadata Table by default # [Trino] Fix flaky tests due to table stats computation lagging behind query execution # Implement Metadata table based Partition listing # Fix Case Sensitivity Issues Between Table and Catalog Schemas # [Trino] Workers should use latest commit time from table handle # Incorrect query results for Merge-On-Read (RT) tables when column stats are enabled Upstream starts at commit hash (inclusive): ea7f22d0371173a31be0c693a24fa00b7374fe0f Upstream ends at commit hash (inclusive): 5e3ebd5b4edd0623e041109cc769f0123bcbd4a7 > Upstream Trino Improvements to Hudi-Trino > ----------------------------------------- > > Key: HUDI-9767 > URL: https://issues.apache.org/jira/browse/HUDI-9767 > Project: Apache Hudi > Issue Type: Task > Reporter: Voon Hou > Assignee: Voon Hou > Priority: Major > > Upstreaming a bunch of Trino improvements to Hudi-Trino and they are: > > # Increase Default MaxOutstandingSplits and SplitLoaderParallelism > # HUDI-9525 Extend file system cache support to Hudi connector > # Fix split generation parallelism for a non-partitioned table > # [MINOR] Disable PR labeler > # -Add parquet page skipping in Iceberg Connector- (Skipped as this is for > testing purposes) > # HUDI-9577 Make target result size configurable from endpoint and server > # -Create pipeline to build arm image- (Skipped as this modifies github > actions) > # -[MINOR] Revert changes made to non-hudi modules- (Skipped as this is > already upstreamed) > # [MINOR] Added optimizations for HudiColumnStatsIndexSupport > # [MINOR] Cleanup HudiSplitFactory to extend cache support > # Fix flakiness when testing cache correctness > # [Trino] Enable Metadata Table by default > # [Trino] Fix flaky tests due to table stats computation lagging behind > query execution > # Implement Metadata table based Partition listing > # Fix Case Sensitivity Issues Between Table and Catalog Schemas > # [Trino] Workers should use latest commit time from table handle > # Incorrect query results for Merge-On-Read (RT) tables when column stats > are enabled > > Upstream starts at commit hash (inclusive): > ea7f22d0371173a31be0c693a24fa00b7374fe0f > Upstream ends at commit hash (inclusive): > 5e3ebd5b4edd0623e041109cc769f0123bcbd4a7 -- This message was sent by Atlassian Jira (v8.20.10#820010)