[ 
https://issues.apache.org/jira/browse/FLINK-19654?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17214510#comment-17214510
 ] 

Robert Metzger commented on FLINK-19654:
----------------------------------------

Looking at the PyFlink table test: It seems that the test has 2 parts
1st: start a cluster and run a number of jobs
2nd: start a Kerberos secured? flink on yarn on docker cluster to run 2 jobs.

In one example, the 1st part took 10 minutes, the 2nd part 14 minutes.

I propose to remove the 2nd part from the pyflink table test: Running Flink on 
a Kerberos secured cluster is already tested separately.


Looking at the PyFlink DataStream test: It seems to do a lot of the same python 
setup procedures (which take roughly 2 minutes).
Can't we do the DataStream test in the same test as the PyFlink Table test? 
(because we have already python set up, and a cluster running?) 


> Improve the execution time of PyFlink end-to-end tests
> ------------------------------------------------------
>
>                 Key: FLINK-19654
>                 URL: https://issues.apache.org/jira/browse/FLINK-19654
>             Project: Flink
>          Issue Type: Bug
>          Components: API / Python, Tests
>    Affects Versions: 1.12.0
>            Reporter: Dian Fu
>            Priority: Major
>         Attachments: image (7).png
>
>
> Thanks for the sharing from [~rmetzger], currently the test duration for 
> PyFlink end-to-end test is as following:
> ||test case||average execution-time||maximum execution-time||
> |PyFlink Table end-to-end test|1340s|1877s|
> |PyFlink DataStream end-to-end test|387s|575s|
> |Kubernetes PyFlink application test|606s|694s|
> We need to investigate how to improve them to reduce the execution time.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to