FeiZou edited a comment on issue #3418:
URL: https://github.com/apache/hudi/issues/3418#issuecomment-894495149


   @nsivabalan Sorry I didn't made it clear. The source table we are migrating 
contains duplicate(one PK with several different timestamps) that's why we want 
to use `upsert` to remove the duplicate during the migration. And I have 
another question that should I use partitioned or non-partitioned table in this 
case? From reading the docs, it's saying that if I want to update records 
within partitioned table, I need to use `globalBloomIndex` instead 
`bloomIndex`. Not sure how partitioned table with `globalBloomIndex` will 
perform comparing with non-partitioned table with `bloomIndex` base the total 
data I have?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to