Alexey Kudinkin created HUDI-5347:
-------------------------------------

             Summary: Fix Merge Into performance traps
                 Key: HUDI-5347
                 URL: https://issues.apache.org/jira/browse/HUDI-5347
             Project: Apache Hudi
          Issue Type: Bug
          Components: spark-sql
    Affects Versions: 0.12.1
            Reporter: Alexey Kudinkin
            Assignee: Alexey Kudinkin
             Fix For: 0.12.2


Merge Into currently relies on SparkSqlTypedRecord abstraction doing for 
*every* field of *every* record lookup in the cache keyed by Avro's Schema 
incurring Schema.equals (bearing non-trivial overhead), leading to 95% of the 
compute being wasted on it.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to