Brandon Scheller created HUDI-1146:
--------------------------------------
Summary: DeltaStreamer fails to start when No updated records +
schemaProvider not supplied
Key: HUDI-1146
URL: https://issues.apache.org/jira/browse/HUDI-1146
Project: Apache Hudi
Issue Type: Bug
Components: Hive Integration
Reporter: Brandon Scheller
DeltaStreamer issue — happens with both COW or MOR - Restarting the
DeltaStreamer Process crashes, that is, 2nd Run does nothing.
Steps:
Run Hudi DeltaStreamer job in --continuous mode
Run the same job again without deleting the output parquet files generated due
to step above
2nd run crashes with the below error ( it does not crash if we delete the
output parquet file)
{{Caused by: org.apache.hudi.exception.HoodieException: Please provide a valid
schema provider class!
at
org.apache.hudi.utilities.sources.InputBatch.getSchemaProvider(InputBatch.java:53)
at
org.apache.hudi.utilities.deltastreamer.DeltaSync.readFromSource(DeltaSync.java:312)
at
org.apache.hudi.utilities.deltastreamer.DeltaSync.syncOnce(DeltaSync.java:226)
at
org.apache.hudi.utilities.deltastreamer.HoodieDeltaStreamer$DeltaSyncService.lambda$startService$0(HoodieDeltaStreamer.java:392)}}
--
This message was sent by Atlassian Jira
(v8.3.4#803005)