bhasudha commented on issue #1813: URL: https://github.com/apache/hudi/issues/1813#issuecomment-657990530
@jcunhafonte you wouldn't need to clean up the Hudi table as the Deltastreamer checkpoints the source offsets along with Hudi metadata. So when the job is run again it can pick up from where it left last time. Regarding your second question on schema provider in the http://hudi.apache.org/blog/change-capture-using-aws/ there was no breaking change. @bvaradar could you help with sharing more context on what happens when schema provider is not present and the implications of running the Deltastreamer sync once mode iteratively (NOT in the continuous mode) without providing schema provider . ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
