Re: Review Request 52453: Add support for receiving min_consecutive_successes in health checker

Kai Huang Tue, 04 Oct 2016 00:10:06 -0700


> On Oct. 4, 2016, 6:36 a.m., Kai Huang wrote:
> > I noticed that AuroraBot says the test is green but some coverage are 
> > missing...
> > 
> > When I test the code locally, the code passes pants tests and style check 
> > in:
> >     ./build-support/jenkins/build.sh
> > 
> > it also passes the e2e tests in:
> >     ./src/test/sh/org/apache/aurora/e2e/test_end_to_end.sh
> > 
> > However, I noticed some abnormalies in individual tests:
> > 
> > The following test command is always green:
> >     ./pants test.pytest --options="-s -k test_thermos_executor" 
> > src/test/python/apache/aurora/executor::
> > 
> > However, the following test command with coverage flag turned-on is flaky 
> > for one test: test_health_check_ok.
> >     ./pants test.pytest --options="-s -k test_thermos_executor" 
> > --coverage=1 src/test/python/apache/aurora/executor::
> > 
> > This test is testing health check in a thermos executor with very small 
> > initial_interval(0.1 seconds) plus a fast StatusManager(Polling status 
> > every 10 milli_seconds). See 
> > https://github.com/apache/aurora/blob/master/src/test/python/apache/aurora/executor/test_thermos_executor.py#L216
> >  and 
> > https://github.com/apache/aurora/blob/master/src/test/python/apache/aurora/executor/test_thermos_executor.py#L458
> > 
> > This may explain why AuroraBot says green tests but coverage are missing.
> > 
> > I'm not sure if this is purely an artifact of the coverage flag or caused 
> > by my code change in the Aurora Executor. But I'm now concerned that the 
> > flakiness is not exposed by our pants test and e2e test.
> > 
> > My guess is that it might be caused by my code 
> > change(https://github.com/apache/aurora/blob/master/src/main/python/apache/aurora/executor/aurora_executor.py#L120),
> >  and exacerbated by the --coverage flag. 
> > 
> > I was wondering if we should just proceed and ignore the warning or be more 
> > precautious and revisit executor change?


Currently the test case is doing a health check with initial_interval_secs=0.1 
secs, interval_secs=0.1 secs. We can eliminate the flakiness by slightly 
increasing the initial_interval_secs in the test case (e.g. from 0.1 to 0.12 
sec).

But I believe that should be addressed in an separate review.


- Kai


-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/52453/#review151303
-----------------------------------------------------------


On Oct. 4, 2016, 12:42 a.m., Kai Huang wrote:
> 
> -----------------------------------------------------------
> This is an automatically generated e-mail. To reply, visit:
> https://reviews.apache.org/r/52453/
> -----------------------------------------------------------
> 
> (Updated Oct. 4, 2016, 12:42 a.m.)
> 
> 
> Review request for Aurora, Joshua Cohen and Zameer Manji.
> 
> 
> Bugs: AURORA-894
>     https://issues.apache.org/jira/browse/AURORA-894
> 
> 
> Repository: aurora
> 
> 
> Description
> -------
> 
> - Add support for receiving a new HealthCheckConfig attribute 
> "min_consecutive_successes" in health checker.
> - Add an entry in release note that describes the health check driven update 
> feature.
> 
> This patch is related to https://reviews.apache.org/r/52094/, in which I 
> added a new configuration value "min_consecutive_successes" in 
> HealthCheckConfig.
> 
> 
> Diffs
> -----
> 
>   RELEASE-NOTES.md 49c03e85ae4c2e3ebc8af89e9ce41df9fd52d6cd 
>   src/main/python/apache/aurora/client/api/updater_util.py 
> c649316edb876565c92cc90c9f030e153c008924 
>   src/main/python/apache/aurora/executor/common/health_checker.py 
> 03fbffdc3862a94c2ba42c9b9e8f2be4094129b8 
> 
> Diff: https://reviews.apache.org/r/52453/diff/
> 
> 
> Testing
> -------
> 
> ./build-support/jenkins/build.sh
> 
> ./pants test.pytest src/test/python/apache/aurora/executor::
> 
> ./src/test/sh/org/apache/aurora/e2e/test_end_to_end.sh
> 
> 
> Thanks,
> 
> Kai Huang
> 
>

Re: Review Request 52453: Add support for receiving min_consecutive_successes in health checker

Reply via email to