ad1happy2go commented on issue #8365: URL: https://github.com/apache/hudi/issues/8365#issuecomment-1494557334
@pravin1406 I did some experimentation on the same and here is my findings. It handles only updates in the log file and new insert goes to parquet file. Only updates - can see only log file Only inserts - can see only parquet file Updates or Inserts - can see both log and parquet file. So When we read the read optimised view it will read the parquet files. So we can see new inserts without existing rows being updated as they will just be coming from old parquet. So This can be reason for data inconsistency issues. Will confirm and if its a valid bug I will raise the JIRA for the same. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
