Timothy Brown created HUDI-7237:
-----------------------------------
Summary: Minor Improvements to Schema Handling in Delta Sync
Key: HUDI-7237
URL: https://issues.apache.org/jira/browse/HUDI-7237
Project: Apache Hudi
Issue Type: Improvement
Reporter: Timothy Brown
There are a two minor items that we have run into running DeltaStreamer in
production.
1. The number of times the schema is fetched is more than it needs to be and
can put unnecessary load on schema providers or increase file system reads
2. SchemaProviders that return null target schemas on empty batches cause null
schema values in commits leading to unexpected issues later
--
This message was sent by Atlassian Jira
(v8.20.10#820010)