[ 
https://issues.apache.org/jira/browse/FLINK-19520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated FLINK-19520:
-----------------------------------
    Labels: pull-request-available  (was: )

> Add reliable test randomization for checkpointing
> -------------------------------------------------
>
>                 Key: FLINK-19520
>                 URL: https://issues.apache.org/jira/browse/FLINK-19520
>             Project: Flink
>          Issue Type: Test
>          Components: Runtime / Configuration
>    Affects Versions: 1.12.0
>            Reporter: Arvid Heise
>            Assignee: Arvid Heise
>            Priority: Major
>              Labels: pull-request-available
>
> With the larger refactoring of checkpoint alignment and the additional of 
> more unaligned checkpoint settings, it becomes increasingly important to 
> provide a large test coverage.
> Unfortunately, adding sufficient test cases in a test matrix appears to be 
> unrealistic: many of the encountered issues were subtle, sometimes caused by 
> race conditions or unusual test configurations and often only visible in e2e 
> tests.
> Hence, we like to rely on all existing Flink tests to provide a sufficient 
> coverage for checkpointing. However, as more and more options in unaligned 
> checkpoint are going to be implemented in this and the upcoming release, 
> running all Flink tests - especially e2e - in a test matrix is prohibitively 
> expensive, even for nightly builds.
> Thus, we want to introduce test randomization for all tests that do not use a 
> specific checkpointing mode. In a similar way, we switched from aligned 
> checkpoints by default in tests to unaligned checkpoint during the last 
> release cycle.
> To not burden the developers of other components too much, we set the 
> following requirements:
>  * Randomization should be seeded in a way that both builds on Azure 
> pipelines and local builds will result in the same settings to ease debugging 
> and ensure reproducibility.
>  * Randomized options should be shown in the test log.
>  * Execution order of test cases will not influence the randomization.
>  * Randomization is hidden, no change on any test is needed.
>  * Randomization only happens during local/remote test execution. User 
> deployments are not affected.
>  * Test developers are able to avoid randomization by explicitly providing 
> checkpoint configs.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to