[GitHub] [hudi] JoshuaZhuCN opened a new issue, #7452: [SUPPORT]SparkSQL can not read the latest data(snapshot mode) after write by flink

GitBox Tue, 13 Dec 2022 19:33:02 -0800


JoshuaZhuCN opened a new issue, #7452:
URL: https://github.com/apache/hudi/issues/7452


   SparkSQL can not read the latest data(snapshot mode) after write by flink.
   
   **To Reproduce**
   
   Steps to reproduce the behavior:
   
   1. run a spark session first(e.g:spark-sql, spark-shell,thrift-server )
   2. insert the data to hudi by flink stream api using hive sync
   3. upsert the data to hudi by flink stream api using hive sync
   4. using spark sql to query，it can not query the latest data in step3
   5. using spark sql to execute `refresh table xxx`
   6. using spark sql to query again, it can query the latest data in step3
   
   **Expected behavior**
   
   A clear and concise description of what you expected to happen.
   
   **Environment Description**
   
   * Hudi version : 0.12.1
   
   * Spark version : 3.1.3
   
   * Hive version : 3.1.0
   
   * Hadoop version : 3.1.1
   
   * Storage (HDFS/S3/GCS..) : HDFS
   
   * Running on Docker? (yes/no) : no
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [hudi] JoshuaZhuCN opened a new issue, #7452: [SUPPORT]SparkSQL can not read the latest data(snapshot mode) after write by flink

Reply via email to