[ 
https://issues.apache.org/jira/browse/HUDI-8788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Geser Dugarov updated HUDI-8788:
--------------------------------
    Description: 
Currently, there is a lack of end-to-end tests combining Flink and Spark (Hive, 
etc.).

The most interesting scenario is stream writing by Flink and subsequent read by 
Spark. Also, we could schedule compaction for MOR table using Flink, and run 
compaction itself by Spark.

To implement it, I suppose, we need additional module, where we will place all 
needed Hudi, Flink, Spark dependencies, and could implement complex test 
scenarios with multiple engines.

  was:
Currently, there is a lack of end-to-end tests combining Flink and Spark (Hive, 
etc.).

The most interesting scenario is stream writing by Flink and subsequent read by 
Spark.

To implement it, I suppose, we need additional module, where we will place all 
needed Hudi, Flink, Spark dependencies, and could implement complex test 
scenarios with multiple engines.


> e2e test module
> ---------------
>
>                 Key: HUDI-8788
>                 URL: https://issues.apache.org/jira/browse/HUDI-8788
>             Project: Apache Hudi
>          Issue Type: Improvement
>            Reporter: Geser Dugarov
>            Assignee: Geser Dugarov
>            Priority: Minor
>             Fix For: 1.1.0
>
>
> Currently, there is a lack of end-to-end tests combining Flink and Spark 
> (Hive, etc.).
> The most interesting scenario is stream writing by Flink and subsequent read 
> by Spark. Also, we could schedule compaction for MOR table using Flink, and 
> run compaction itself by Spark.
> To implement it, I suppose, we need additional module, where we will place 
> all needed Hudi, Flink, Spark dependencies, and could implement complex test 
> scenarios with multiple engines.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to