[
https://issues.apache.org/jira/browse/HUDI-1214?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17182960#comment-17182960
]
Trevorzhang edited comment on HUDI-1214 at 8/24/20, 5:52 AM:
-------------------------------------------------------------
hi,[~vbalaji], I want to claim this jiar , if no one does it.
was (Author: trevorzhang):
hi,Balaji Varadarajan, I want to claim this jiar , if no one does it.
> Need ability to set deltastreamer checkpoints when doing Spark datasource
> writes
> --------------------------------------------------------------------------------
>
> Key: HUDI-1214
> URL: https://issues.apache.org/jira/browse/HUDI-1214
> Project: Apache Hudi
> Issue Type: Improvement
> Components: Spark Integration
> Reporter: Balaji Varadarajan
> Priority: Major
> Fix For: 0.6.1
>
>
> Such support is needed for bootstrapping cases when users use spark write to
> do initial bootstrap and then subsequently use deltastreamer.
> DeltaStreamer manages checkpoints inside hoodie commit files and expects
> checkpoints in previously committed metadata. Users are expected to pass
> checkpoint or initial checkpoint provider when performing bootstrap through
> deltastreamer. Such support is not present when doing bootstrap using Spark
> Datasource.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)