vinothchandar commented on issue #1453: HUDI-644 kafka connect checkpoint provider URL: https://github.com/apache/incubator-hudi/pull/1453#issuecomment-606735369 @garyli1019 I am fine either way.. if this is going to be a utility for now, that's okay. but lets still clarify the naming to be `InitialCheckpointProvider`. As for the big picture, With the bootstrap work that is going on from @bvaradar & @umehrot2 , here is a future I think of. - User writing data to S3 using another mechanism (sqoop, connect, ...) at `/old/dataset/path` - DeltaStreamer can support a `--bootstrap-from /old/dataset/path` and `--initial-checkpoint-provider SqoopCheckpointProvider.class`, then it will seamlessly perform the initial bootstrap, extract a checkpoint and keep incrementally ingesting from that.. I was trying to see if we can jump ahead with the checkpoint flag now itself.. (its a matter of UX ? providing one time via --checkpoint vs the provider invoked by delta streamer itself)..
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services
