[ 
https://issues.apache.org/jira/browse/HUDI-6864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ethan Guo updated HUDI-6864:
----------------------------
    Description: 
See HUDI-6863 and 
https://github.com/apache/hudi/pull/6802#issuecomment-1455802492

I think we need to make sure that the dedup parallelism is only applied to the 
dedup stage, not affecting subsequent stages, which may require better 
parallelism control by repartitioning with right parallelism before workload 
profiling.

> Auto-tune dedup parallelism without affecting write parallelism
> ---------------------------------------------------------------
>
>                 Key: HUDI-6864
>                 URL: https://issues.apache.org/jira/browse/HUDI-6864
>             Project: Apache Hudi
>          Issue Type: Improvement
>            Reporter: Ethan Guo
>            Priority: Major
>             Fix For: 0.14.1
>
>
> See HUDI-6863 and 
> https://github.com/apache/hudi/pull/6802#issuecomment-1455802492
> I think we need to make sure that the dedup parallelism is only applied to 
> the dedup stage, not affecting subsequent stages, which may require better 
> parallelism control by repartitioning with right parallelism before workload 
> profiling.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to