stream2000 opened a new issue, #8500:
URL: https://github.com/apache/hudi/issues/8500

   **_Tips before filing an issue_**
   
   In hive sync, we get new partition events by getting written partitions 
since `lastCommitTimeSynced` field and will sync the newest completed instant 
time to hive after syncing partitions.  But in multi writer scenario, some 
instant before lastCommitTimeSynced may be completed after lastCommitTimeSynced 
have been synced to hive, and the written partitions in those instants will 
never be included in the future sync procedure. 
   
   My opinion is that we can only sync partitions before the oldest inflight 
instant, just like what we do in archive and incremental clean. 
   
   
   **Describe the problem you faced**
   
   A clear and concise description of the problem.
   
   **To Reproduce**
   
   Steps to reproduce the behavior:
   
   1.
   2.
   3.
   4.
   
   **Expected behavior**
   
   A clear and concise description of what you expected to happen.
   
   **Environment Description**
   
   * Hudi version :
   
   * Spark version :
   
   * Hive version :
   
   * Hadoop version :
   
   * Storage (HDFS/S3/GCS..) :
   
   * Running on Docker? (yes/no) :
   
   
   **Additional context**
   
   Add any other context about the problem here.
   
   **Stacktrace**
   
   ```Add the stacktrace of the error.```
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to