[
https://issues.apache.org/jira/browse/HUDI-9043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17936425#comment-17936425
]
Geser Dugarov commented on HUDI-9043:
-------------------------------------
{color:#000000}`RowDataStreamWriteFunction{color}::deduplicateRecordsIfNeeded`
should be completed first now, and then we could check costs.
> Analyze possibility to optimize `FlinkWriteHelper::deduplicateRecords`
> ----------------------------------------------------------------------
>
> Key: HUDI-9043
> URL: https://issues.apache.org/jira/browse/HUDI-9043
> Project: Apache Hudi
> Issue Type: Task
> Reporter: Geser Dugarov
> Assignee: Geser Dugarov
> Priority: Major
>
> `FlinkWriteHelper::deduplicateRecords` looks like too costly.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)