FeiZou commented on issue #3418:
URL: https://github.com/apache/hudi/issues/3418#issuecomment-894495149


   @nsivabalan Sorry I didn't made it clear. The source table we are migrating 
contains duplicate that's why we want to use `upsert` to remove the duplicate 
during the migration. And I have another question that should I use partitioned 
or non-partitioned table in this case? From reading the docs, it's saying that 
if I want to update records within partitioned table, I need to use 
`globalBloomIndex` instead `bloomIndex`. Not sure how partitioned table with 
`globalBloomIndex` will perform comparing with non-partitioned table with 
`bloomIndex` base the total data I have?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to