Re: [PROPOSITION] schedule some sanity tests on a daily basis

Jean-Baptiste Onofré Thu, 15 Mar 2018 14:26:05 -0700

Hi,

I would suggest to prepare a Maven profile to perform nexmark runs.Then, I can setup a job (seed/manual) in Jenkins to run this.


Regards
JB

On 15/03/2018 22:13, Etienne Chauchot wrote:

So what next? Shall we schedule nexmark runs and add a Bigquery sink tonexmark output?
Le lundi 12 mars 2018 à 10:30 +0100, Etienne Chauchot a écrit :
Thanks everyone for your comments and support.

Le vendredi 09 mars 2018 à 21:28 +0000, Alan Myrvold a écrit :
Great ideas. I want to see a daily signal for anything that couldprevent a release from happening, and precommits that are fast andreliable for areas that are commonly broken by code changes.
We are now running the java quickstarts daily on a cron schedule,using direct, dataflow, and local spark and flink inthe beam_PostRelease_NightlySnapshot job, seehttps://github.com/apache/beam/blob/master/release/build.gradle Thisshould provide a good signal for the examples integration testsagainst these runners.
As Kenn noted, the java_maveninstall also runs lots of tests. Itwould be good to be more clear and intentional about which tests runwhen, and to consider implementing additional "always up"environments for use by the tests.
Having the nexmark smoke tests run regularly and stored in a databasewould really enhance our efforts, perhaps starting with directrunnerfor the performance tests.
Yes
What area would have the most immediate impact? Nexmark smoke tests?
Yes IMHO I think that Nexmark smoke tests would have a great return oninvestment. By just scheduling some of them (at first), we enabledeep confidence in the runners on real user pipelines. In the pastNexmark has allowed to discover regressions in performance before arelease and also to discover some bugs in some runners. But, pleasenote that, for this last ability, Nexmark is limited currently: itonly detects failures if an exception is thrown, there is no check ofthe correctness of the output PCollection because the aim wasperformance tests and there is no point adding a slow test forcorrectness. Nevertheless, if we store the output size (as I suggestedin this thread), we can get a hint on a failure if the output size isdifferent from the last stored output sizes.
Etienne
On Fri, Mar 9, 2018 at 12:57 PM Kenneth Knowles <k...@google.com<mailto:k...@google.com>> wrote:
On Fri, Mar 9, 2018 at 3:08 AM Etienne Chauchot<echauc...@apache.org <mailto:echauc...@apache.org>> wrote:
Hi guys,
I was looking at the various jenkins jobs and I wanted to submit aproposition:
- Validates runner tests: currently run at PostCommit for all therunners. I think it is the quickest way to see
regressions. So keep it that way
We've also toyed with precommit for runners where it is fast.
- Integration tests: AFAIK we only run the ones in examples moduleand only on demand. What about running all the IT (inparticular IO IT) as a cron job on a daily basis with directrunner? Please note that it will require some always up
backend infrastructure.
I like this idea. We actually run more, but in postcommit. You cansee the goal here:https://github.com/apache/beam/blob/master/.test-infra/jenkins/job_beam_PostCommit_Java_MavenInstall.groovy#L47
There's no infrastructure set up that I see. It is only DirectRunnerand DataflowRunner currently, as they are "always up". But so couldbe local Flink and Spark. Do the ITs spin up local versions of whatthey are connecting to?
If we have adequate resources, I also think ValidatesRunner on areal cluster would add value, once we have the cluster set up / teardown or "always up".
- Performance tests: what about running Nexmark SMOKE test suite inbatch and streaming modes with all the runners on adaily basis and store the running times in a RRD database (to seeperformance regressions)?
I like this idea, too. I think we could do DirectRunner (andprobably local Flink) as postcommit without being too expensive.
Kenn
Please note that not all the
queries run in all the runners in all the modes right now. Also, wehave some streaming pipelines termination issues
(see https://issues.apache.org/jira/browse/BEAM-2847)
I know that Stephen Sisk use to work on these topics. I also talkedto guys from Polidea. But As I understood, they
launch mainly integration tests on Dataflow runner.

WDYT?

Etienne

Re: [PROPOSITION] schedule some sanity tests on a daily basis

Reply via email to