[
https://issues.apache.org/jira/browse/HUDI-3995?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Alexey Kudinkin updated HUDI-3995:
----------------------------------
Description:
While investigating, perf hits in the Bulk Insert a few issues were found:
# NonPartitionedKeyGenerator
# HUDI-3993:
was:
there are some optimizations we could do on the row writer path for bulk
inserts by avoiding udf calls.
> Avoid converting to Avro in standard Key Generators
> ---------------------------------------------------
>
> Key: HUDI-3995
> URL: https://issues.apache.org/jira/browse/HUDI-3995
> Project: Apache Hudi
> Issue Type: Improvement
> Components: spark, writer-core
> Reporter: sivabalan narayanan
> Assignee: sivabalan narayanan
> Priority: Blocker
> Labels: pull-request-available
> Fix For: 0.12.0
>
> Original Estimate: 12h
> Remaining Estimate: 12h
>
> While investigating, perf hits in the Bulk Insert a few issues were found:
> # NonPartitionedKeyGenerator
> # HUDI-3993:
>
--
This message was sent by Atlassian Jira
(v8.20.7#820007)