sivabalan narayanan created HUDI-9067:
-----------------------------------------

             Summary: Clean action sometimes falls back to 
spark.default.parallelism set in the env
                 Key: HUDI-9067
                 URL: https://issues.apache.org/jira/browse/HUDI-9067
             Project: Apache Hudi
          Issue Type: Improvement
          Components: cleaning
            Reporter: sivabalan narayanan


Rarely, we notice there are 1024 (spark.default.parallelism) tasks spinning up 
for clean actions. 

 

for eg, if we try to ingest very small no of records say just 1, clean action 
executor spins up 1024 tasks even though the clean parallelism is set to small 
value. 

 

 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to