leesf commented on issue #2051: URL: https://github.com/apache/hudi/issues/2051#issuecomment-683280244
@bvaradar hi, I think the point @jiegzhan pointed out is reasonable, for insert operation, we should not update the existing records. Right now the behavior/result is different when setting different small file limit, when it is set to 0, the new inserts will not update the old records and write into a new file, but when it is set to other value such as 128M, the new inserts may update the old records lies in small file picked up the UpsertPartitioner. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
