vinothchandar commented on issue #1453: HUDI-644 kafka connect checkpoint 
provider
URL: https://github.com/apache/incubator-hudi/pull/1453#issuecomment-606735369
 
 
   @garyli1019 I am fine either way.. if this is going to be a utility for now, 
that's okay. but lets still clarify the naming to be 
`InitialCheckpointProvider`. 
   
   As for the big picture, 
   
   With the bootstrap work that is going on from @bvaradar & @umehrot2 , here 
is a future I think of. 
   
   - User writing data to S3 using another mechanism (sqoop, connect, ...) at 
`/old/dataset/path`
   - DeltaStreamer can support a `--bootstrap-from /old/dataset/path` and 
`--initial-checkpoint-provider SqoopCheckpointProvider.class`, then it will 
seamlessly perform the initial bootstrap, extract a checkpoint and keep 
incrementally ingesting from that.. 
   
   I was trying to see if we can jump ahead with the checkpoint flag now 
itself.. (its a matter of UX ? providing one time via --checkpoint vs the 
provider invoked by delta streamer itself).. 
   
   
   
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

Reply via email to