FeiZou commented on issue #3418: URL: https://github.com/apache/hudi/issues/3418#issuecomment-894495149
@nsivabalan Sorry I didn't made it clear. The source table we are migrating contains duplicate that's why we want to use `upsert` to remove the duplicate during the migration. And I have another question that should I use partitioned or non-partitioned table in this case? From reading the docs, it's saying that if I want to update records within partitioned table, I need to use `globalBloomIndex` instead `bloomIndex`. Not sure how partitioned table with `globalBloomIndex` will perform comparing with non-partitioned table with `bloomIndex` base the total data I have? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
