[
https://issues.apache.org/jira/browse/HUDI-7489?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ASF GitHub Bot updated HUDI-7489:
---------------------------------
Labels: pull-request-available (was: )
> Row writer clustering collects write statuses on the driver
> -----------------------------------------------------------
>
> Key: HUDI-7489
> URL: https://issues.apache.org/jira/browse/HUDI-7489
> Project: Apache Hudi
> Issue Type: Bug
> Components: clustering, spark
> Reporter: Jonathan Vexler
> Assignee: Jonathan Vexler
> Priority: Critical
> Labels: pull-request-available
>
> Row writer clustering collects write statuses on the driver which can be a
> very large amount of data. We Immediately parallelize the data again, so we
> can avoid this collect.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)