[jira] [Updated] (HUDI-5347) Fix Merge Into performance traps

ASF GitHub Bot (Jira) Wed, 07 Dec 2022 15:27:05 -0800


     [ 
https://issues.apache.org/jira/browse/HUDI-5347?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


ASF GitHub Bot updated HUDI-5347:
---------------------------------
    Labels: performance pull-request-available  (was: performance)

> Fix Merge Into performance traps
> --------------------------------
>
>                 Key: HUDI-5347
>                 URL: https://issues.apache.org/jira/browse/HUDI-5347
>             Project: Apache Hudi
>          Issue Type: Bug
>          Components: spark-sql
>    Affects Versions: 0.12.1
>            Reporter: Alexey Kudinkin
>            Assignee: Alexey Kudinkin
>            Priority: Blocker
>              Labels: performance, pull-request-available
>             Fix For: 0.12.2
>
>
> Merge Into currently relies on SparkSqlTypedRecord abstraction doing for 
> *every* field of *every* record lookup in the cache keyed by Avro's Schema 
> incurring Schema.equals (bearing non-trivial overhead), leading to 95% of the 
> compute being wasted on it.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Updated] (HUDI-5347) Fix Merge Into performance traps

Reply via email to