Re: [I] [SUPPORT] Why HUDI ConsistentBucketClusteringExecutionStrategy not supported by flink engine? [hudi]

via GitHub Tue, 16 Jul 2024 18:20:41 -0700


pursuit-wangpz commented on issue #11636:
URL: https://github.com/apache/hudi/issues/11636#issuecomment-2232120967


   > Because it's hard for Flink to support both compaction and clustering 
execution in the same pipeline, current Flink only supports the clustering plan 
generation for consistnet hashing, a separate clustering job is needed for 
execution.
   
   However, it seems that 
org.apache.hudi.sink.clustering.HoodieFlinkClusteringJob does not support 
ConsistentBucketClusteringExecutionStrategy, which can only be specified with 
the Spark engine using 
org.apache.hudi.client.clustering.run.strategy.SparkConsistentBucketClusteringExecutionStrategy.
 This operation implies that HUDI requires two engines to complete the 
Consistent Bucket operation: the Flink engine to generate the plan, and the 
Spark engine to execute the plan.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Re: [I] [SUPPORT] Why HUDI ConsistentBucketClusteringExecutionStrategy not supported by flink engine? [hudi]

Reply via email to