pratyakshsharma commented on a change in pull request #1584:
URL: https://github.com/apache/incubator-hudi/pull/1584#discussion_r419151032
##########
File path:
hudi-utilities/src/main/java/org/apache/hudi/utilities/deltastreamer/DeltaSync.java
##########
@@ -298,15 +300,15 @@ private void refreshTimeline() throws IOException {
// default to RowBasedSchemaProvider
schemaProvider = this.schemaProvider == null ||
this.schemaProvider.getTargetSchema() == null
? transformed.map(r -> (SchemaProvider) new
RowBasedSchemaProvider(r.schema())).orElse(
- dataAndCheckpoint.getSchemaProvider())
+ schemaProviderFromFetched)
: this.schemaProvider;
} else {
// Pull the data from the source & prepare the write
InputBatch<JavaRDD<GenericRecord>> dataAndCheckpoint =
formatAdapter.fetchNewDataInAvroFormat(resumeCheckpointStr,
cfg.sourceLimit);
avroRDDOptional = dataAndCheckpoint.getBatch();
checkpointStr = dataAndCheckpoint.getCheckpointForNextBatch();
- schemaProvider = dataAndCheckpoint.getSchemaProvider();
+ schemaProvider = avroRDDOptional.isPresent() ?
dataAndCheckpoint.getSchemaProvider() : null;
Review comment:
please refer to my other comment on the changes in SourceFormatAdapter.
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]