RE: [PROPOSAL] Workflow for ignoring long failing tests

Jozef Marko Wed, 24 Jan 2024 20:51:46 -0800

Thank you for this discussion. I think disabling flaky tests in given way is 
reasonable.


I have one related question, once similar flaky test is disabled, and a ticket 
is reported. Who will be assigned to this ticket? Will it be the test author?

My point is, that usually only test author and few other engineers working on 
the same component understand completely the given test, but probably they do 
not check the codebase daily if the test is still enabled.

So, will be these people notified? Otherwise, most of these tickets will be 
probably closed after the proposed period without any test update/fix.

Jozef Marko
Software Developer
jozef.ma...@ibm.com

-----Original Message-----
From: Toni Rikkola <trikk...@redhat.com> 
Sent: Wednesday, January 24, 2024 3:58 PM
To: dev@kie.apache.org
Subject: [EXTERNAL] Re: [PROPOSAL] Workflow for ignoring long failing tests

The reason I mentioned this was that we had this same decision made 8 years ago 
and shared it to the entire team.
It stated that anyone can disable any flaky test and make a ticket for it.
It was even more strict, you just had to have one flaky failure.

So to me it looks like we are just stating the same thing again hoping for a 
different outcome.

But yes, to make it clear for everyone we should write it down and we can see 
how it goes and act on it later if needed.

Toni

On Wed, Jan 24, 2024 at 2:09 PM Tibor Zimányi <tzima...@apache.org> wrote:

> I think we don't need to enforce it or gatekeep this any way. The 
> thing is, if we have the agreement that we want to have this official 
> workflow, it should fix the problem you are mentioning Toni, that 
> people are afraid to do so. If it will be officially stated, that 
> failing tests can be disabled in the code, people should just do it. 
> They can always point to this potential agreement. The best thing 
> would be to even document it on Confluence. I will do that if we have an 
> agreement on this.
>
> Tibor
>
> Dňa st 24. 1. 2024, 12:40 Toni Rikkola <trikk...@redhat.com> napísal(a):
>
> > Not informing the author was my mistake on that PR. I should have 
> > asked
> the
> > original author to review it.
> >
> > But the problem is not that we lack this procedure. The KIE team 
> > agreed
> on
> > this in 2015. It is that maybe we lack the strength to do it, 
> > especially when it is someone else's area of code. This needs some 
> > sort of
> gatekeeper
> > or supervisor and time frames when the build quality and if these
> problems
> > were acted on are validated and measured.
> >
> > Meanwhile I offer myself to work on these and disable them, if 
> > anyone
> wants
> > to point me to some flaky test I can act on them. I could even make 
> > some system for measurement. We can then, or now, decide what we 
> > then do with that data.
> >
> > Toni
> >
> > On Tue, Jan 23, 2024 at 8:04 PM Tibor Zimányi <tzima...@apache.org>
> wrote:
> >
> > > I think the notification of the involved commiter could be done as 
> > > part
> > of
> > > creating of the issue to fix the tests. The person involved can be
> > assigned
> > > as an asignee.
> > >
> > > T.
> > >
> > > Dňa ut 23. 1. 2024, 18:50 Francisco Javier Tirado Sarti < 
> > > ftira...@redhat.com>
> > > napísal(a):
> > >
> > > > Ok, but I think we first need to establish a way (my apologies 
> > > > is
> that
> > > way
> > > > is already there) to notify  committerthat a test that they were
> > involved
> > > > with is failing
> > > > That way, we will avoid disable tests like this one 
> > > > INVALID URI REMOVED
> > > > apache_incubator-2Dkie-2Dkogito-2Dexamples_issues_1831&d=DwIFaQ&
> > > > c=jf_iaSHvJObTbx-siA1ZOg&r=Tvtr6trQuIEn20iSHtFwcpynBt8ViCybz4kYp
> > > > jmDDUQ&m=C6gAvciwgXz7uJX__01AVZ3YHxSotIhuktwbnBnNuTvnLRU-NHMRqOd
> > > > NENKhsqKe&s=8p3WYlpcXfcLl1n4BnrSpBWNIdPIUhslTPsb41eE4IM&e= ,
> > > which
> > > > are quite important to ensure Springboot messaging is working.
> > > >
> > > > On Tue, Jan 23, 2024 at 6:38 PM ricardo zanini fernandes < 
> > > > ricardozan...@gmail.com> wrote:
> > > >
> > > > > +1
> > > > >
> > > > > On Tue, Jan 23, 2024 at 1:03 PM Martin Cimbálek 
> > > > > <cim...@gmail.com>
> > > > wrote:
> > > > >
> > > > > > +1
> > > > > >
> > > > > > Regards,
> > > > > > Martin Cimbalek
> > > > > >
> > > > > > On Tue, Jan 23, 2024 at 3:36 PM Tibor Zimányi <
> tzima...@apache.org
> > >
> > > > > wrote:
> > > > > >
> > > > > > > Hi everyone,
> > > > > > >
> > > > > > > I want to propose a workflow for situations, when some 
> > > > > > > tests
> are
> > > > > failing
> > > > > > > for a longer time. In such cases, my proposed workflow is:
> > > > > > > - If a test or a set of tests is failing for two days
> (nightlies
> > or
> > > > PR
> > > > > > > checks),  ignore those tests in the codebase, so they are 
> > > > > > > not
> > > > executed.
> > > > > > > - File an issue in kie-issues repository, reporting the 
> > > > > > > test
> > > > failures.
> > > > > > > - If the issue is not resolved for half a year, delete 
> > > > > > > those
> > tests
> > > > from
> > > > > > the
> > > > > > > codebase.
> > > > > > >
> > > > > > > What do you think please? This should make sure all 
> > > > > > > failures
> that
> > > > don't
> > > > > > get
> > > > > > > fixed immediately after they occur get logged in the issue
> > tracker,
> > > > so
> > > > > > they
> > > > > > > can be appropriately handled and don't block unrelated PR
> checks
> > > and
> > > > > > > builds. It will also make sure that tests that are not 
> > > > > > > fixed
> for
> > a
> > > > long
> > > > > > > time (therefore they could be perceived as not important) 
> > > > > > > are
> > not a
> > > > > > > maintenance burden for the future.
> > > > > > >
> > > > > > > Best regards,
> > > > > > > Tibor
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
>

RE: [PROPOSAL] Workflow for ignoring long failing tests

Reply via email to