pftn commented on issue #8892:
URL: https://github.com/apache/hudi/issues/8892#issuecomment-1584058475

   > @pftn can you please help to verify if the data in these 2 parquets are 
the same?
   > 
   > 1. 
20220604/00000007-3477-401f-982e-e5ae38ca0e23_3-20-6_20230510170043301.parquet
   > 2. 
20220604/00000007-4bc1-4340-a9d8-330666a58244_5-20-6_20230511183601566.parquet
   > 
   > Do you still have the compaction plans that generated these 2 parquet 
files, it'll be extremely helpful if we can know the write token of the log 
files before compaction. Thanks!
   
   Each parquet has 1 row. Just the ts column's value is different.
   According to the PartialUpdateAvroPayload, the row in 
00000007-4bc1-4340-a9d8-330666a58244_5-20-6_20230511183601566.parquet should 
overwrite the row in 
00000007-3477-401f-982e-e5ae38ca0e23_3-20-6_20230510170043301.parquet.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to