xuzifu666 opened a new issue, #10542:
URL: https://github.com/apache/hudi/issues/10542

   **_Tips before filing an issue_**
   
   - Have you gone through our [FAQs](https://hudi.apache.org/learn/faq/)?
   
   - Join the mailing list to engage in conversations and get faster support at 
[email protected].
   
   - If you have triaged this as a bug, then file an 
[issue](https://issues.apache.org/jira/projects/HUDI/issues) directly.
   
   **Describe the problem you faced**
   
   When we use FlinkCDC into HUDI 0.14.0/0.14.1,would occur dataloss such as 
deleted data in mysql but can find the data in Hudi,or insert data to mysql but 
not find data in Hudi,more important  not produce any exception for us to dig 
cause root.
   sences:
   1. Flink write to Hudi without compaction
   2. Flink write to Hudi with compaction online
   3. Flink write to Hudi with compaction offline
   above 3 conditions all can occur the problem
   
   **To Reproduce**
   
   Steps to reproduce the behavior:
   
   1. Flink write to hudi with Billions data
   2. then check data with mysql
   
   **Expected behavior**
   
   all data should consistent with mysql
   
   **Environment Description**
   
   * Flink version : 1.13.2
   
   * Hudi version : 0.14.0/0.14.1
   
   * Spark version : 3.2.0
   
   * Hive version : 1.1.0
   
   * Hadoop version : 3.2.1
   
   * Storage (HDFS/S3/GCS..) : HDFS
   
   * Running on Docker? (yes/no) :no
   
   
   **Additional context**
   
   Add any other context about the problem here.
   
   **Stacktrace**
   
   ```Add the stacktrace of the error.```
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to