Till Rohrmann created FLINK-9004:
------------------------------------

             Summary: Cluster test: Run general purpose job with failures with 
Yarn session
                 Key: FLINK-9004
                 URL: https://issues.apache.org/jira/browse/FLINK-9004
             Project: Flink
          Issue Type: Sub-task
          Components: Tests
    Affects Versions: 1.5.0
            Reporter: Till Rohrmann
             Fix For: 1.5.0


Similar to FLINK-8973, we should run the general purpose job (FLINK-8971) on a 
Yarn session cluster and simulate failures.

The job jar should be ill-packaged, meaning that we include too many 
dependencies in the user jar. We should include the Scala library, Hadoop and 
Flink itself to verify that there are no class loading issues.

The general purpose job should run with misbehavior activated. Additionally, we 
should simulate at least the following failure scenarios:
* Kill Flink processes
* Kill connection to storage system for checkpoints and jobs
* Simulate network partition

We should run the test at least with the following state backend: RocksDB 
incremental async and checkpointing to S3.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to