bvaradar commented on issue #1985: URL: https://github.com/apache/hudi/issues/1985#issuecomment-678044206
@piyushrl : The strategy would be to orchestrate this bootstrap and handoff in 3 steps 1. Copy the earliest checkpoint from kafka and save it after ensuring your upstream source is up to date till the checkpoint saved. 2. Perform spark.write() to hudi. 3. Run deltastreamer in single run mode and pass the checkpoint (1) as part of --checkpoint command line argument. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
