Re: Master test stability poor

Ufuk Celebi Wed, 27 Apr 2016 05:56:29 -0700

Filed an issue with INFRA: https://issues.apache.org/jira/browse/INFRA-11773


@Robert: I agree, but still we see failing builds over and over again.
At best it is annoying, at worst it "hides" new bugs being introduced.

On Wed, Apr 27, 2016 at 2:41 PM, Till Rohrmann <[email protected]> wrote:
> That is good to hear that we can so easily solve most of the failing
> builds. We should then iterate over the open test-stability issues to see
> whether they are still valid after we've merged PR 1915.
>
> On Wed, Apr 27, 2016 at 2:25 PM, Robert Metzger <[email protected]> wrote:
>
>> I'm not sure if the issues is as big as it seems on a first sight.
>> The reason why all the builds of master are red on travis is that the cache
>> of the 5th build is invalid. We have to ask infra to delete the caches and
>> then they'll be green again.
>>
>> On Wed, Apr 27, 2016 at 2:13 PM, Ufuk Celebi <[email protected]> wrote:
>>
>> > Along the lines of what Greg already mentioned, I would like to
>> > re-iterate that Travis is often a problem too:
>> > - long build times and we are reaching the time limit
>> > - unreliable I/O
>> > - unreliable resolving of build dependencies
>> >
>> > @Max: I think you wanted to look into whether we can use Apache's
>> > Jenkins server for our builds instead of Travis. Did you ever get
>> > around at looking into it? If yes: What's your opinion on replacing
>> > Travis with Jenkins? Is it a viable option? Would it improve the
>> > Travis-specific problems?
>> >
>> > On the other hand, the very slow Travis machines also helped
>> > discovering some hard-to-catch race conditions.
>> >
>> > – Ufuk
>> >
>> >
>> > On Wed, Apr 27, 2016 at 2:01 PM, Greg Hogan <[email protected]> wrote:
>> > > We have also started running over Travis' 2 hour limit for the longest
>> > build.
>> > >
>> > > Greg
>> > >
>> > >
>> > >> On Apr 27, 2016, at 7:53 AM, Ufuk Celebi <[email protected]> wrote:
>> > >>
>> > >> Hi Till,
>> > >>
>> > >> thank you for bringing this up. We really need to fix this.
>> > >>
>> > >> Filing JIRAs with critical priority was how we tried to solve it in
>> > >> the past, but obviously it did not work. There seems to be a mismatch
>> > >> between assigned and actual priorities.
>> > >>
>> > >> As a first step, I would volunteer to gather a list of tests, which
>> > >> have failed in the last weeks and make sure that we have JIRAs for
>> > >> them.
>> > >>
>> > >> As a next step, we should coordinate how to resolve those issues
>> > >> (maybe prioritized by failure frequency) to get master stable again.
>> > >>
>> > >> – Ufuk
>> > >>
>> > >>
>> > >>> On Wed, Apr 27, 2016 at 12:12 PM, Till Rohrmann <
>> [email protected]>
>> > wrote:
>> > >>> Hi Flink community,
>> > >>>
>> > >>> I just wanted to raise awareness that in the last 16 days there was
>> > just a
>> > >>> single Travis build of master which passed all tests. This indicates
>> > that
>> > >>> we have some serious problems with our test stability or even worse a
>> > >>> problem with the master itself. Having an unstable master makes it
>> > really
>> > >>> hard to assess whether new changes actually broke something or
>> whether
>> > the
>> > >>> failing test was unrelated.
>> > >>>
>> > >>> We have currently 37 open issues labeled with test-stability and most
>> > of
>> > >>> them have a critical priority. Therefore, I would propose that we try
>> > to
>> > >>> tackle them as soon as possible in order to improve our testing
>> > stability.
>> > >>>
>> > >>> Cheers,
>> > >>> Till
>> >
>>

Re: Master test stability poor

Reply via email to