On 11/30/20 9:10 AM, Cornelia Huck wrote:
> On Fri, 27 Nov 2020 19:46:29 +0100
> Philippe Mathieu-Daudé <phi...@redhat.com> wrote:
>> On 11/27/20 7:29 PM, Thomas Huth wrote:
>>> On 27/11/2020 18.57, Philippe Mathieu-Daudé wrote:  
>>>> On 11/27/20 6:47 PM, Thomas Huth wrote:  
>>>>> On 27/11/2020 18.41, Philippe Mathieu-Daudé wrote:  
>>>>>> We lately realized that the Avocado framework was not designed
>>>>>> to be regularly run on CI environments. Therefore, as of 5.2
>>>>>> we deprecate the gitlab-ci jobs using Avocado. To not disrupt
>>>>>> current users, it is possible to keep the current behavior by
>>>>>> setting the QEMU_CI_INTEGRATION_JOBS_PRE_5_2_RELEASE variable
>>>>>> (see [*]).
>>>>>> From now on, using these jobs (or adding new tests to them)
>>>>>> is strongly discouraged.
>>>>>>
>>>>>> Tests based on Avocado will be ported to new job schemes during
>>>>>> the next releases, with better documentation and templates.  
>>>>>
>>>>> Why should we disable the jobs by default as long as there is no 
>>>>> replacement
>>>>> available yet?  
>>>>
>>>> Why keep it enabled if it is failing randomly  
>>>
>>> We can still disable single jobs if they are not stable, but that's no
>>> reason to disable all of them by default, is it?
>>>   
>>>> if images hardcoded
>>>> in tests are being removed from public servers, etc...?  
>>>
>>> That's independent from Avocado, you'll always have that problem if you want
>>> to test with external images, unless you mirror them into a repository on
>>> the same server (ie. gitlab), which, however, might not always be 
>>> possible...
>>>   
>>>> They are not disabled, they are still runnable manually or setting
>>>> QEMU_CI_INTEGRATION_JOBS_PRE_5_2_RELEASE...  
>>>
>>> And who do you think is going to set that variable? Hardly anybody, I 
>>> guess.  
>>
>> Does that mean nobody cares about these tests?
> 
> If I first have to set some random config option before tests are run,
> that's an extra hurdle. I want a simple workflow where I just push to
> gitlab and don't have to care about extra configuration.

Good, that means you are not particularly interested by
this specific test, so you don't need to set
QEMU_CI_INTEGRATION_JOBS_PRE_5_2_RELEASE.

> 
>>
>>> So you could also simply remove the stuff from the yml file completely 
>>> instead.  
>>
>> Yes, I'd prefer that too, but Alex objected.
>>
>>>> We realized by default Avocado runs all tests on the CI jobs,
>>>> triggering failures and complains. Developer stopped to contribute/
>>>> review integration tests because of that.  
>>>
>>> Did anybody really stop contributing "acceptance" test since they were
>>> afraid of the gitlab-CI running them? That's new to me, do you have a 
>>> pointer?  
>>
>> No, but alternatively, how many tests were contributed / reviewed
>> last year?
> 
> I added one, just last week... plan to do more, but there's always less
> time than things I want/need to do. Maybe this just needs more
> advertising?

I noticed your test. Do you plan it to be run regularly on CI too
(a.k.a. a gating test)?

It is small, simple, run fast, it got quickly reviewed.

Nobody cared to review the COLO test 3 months, because it is
more complex:
https://www.mail-archive.com/qemu-devel@nongnu.org/msg727381.html

> 
>>
>>>> We want developers be
>>>> able to contribute tests to the repository fearlessly.  
>>>
>>> You can always mark your test with @skipIf(os.getenv('GITLAB_CI')) if you
>>> don't want to see it running in the gitlab-CI, so that's not a reason to be
>>> afraid.  
>>
>> This was the idea here (opposite, tag jobs with 'gating-ci' to run
>> them on GitLab):
>> https://www.mail-archive.com/qemu-devel@nongnu.org/msg756464.html
> 
> I guess you want all of that:
> - tag tests that you know to not work, so they're not run
> - tag tests that you know to be stable, so they can be gating
> - all non-tagged tests are expected to work usually, but there might be
>   an occasional failure
> ?

Yes, but IIUC Alex didn't agree to use a gating tag for stable
tests, he wants everything to be run on GitLab.

> 
>>
>>>   
>>>> If we don't change anything, we'll keep having CI failures due
>>>> to Avocado design issues (artifacts removed from remote servers,
>>>> etc...).  
>>>
>>> I fail to see the relation between Avocado and vanishing artifacts from 3rd
>>> party servers... what do you plan to do instead if something gets (re-)moved
>>> from a server that is not under our control?  
>>
>> Avocado tests and CI are orthogonal, but it will be painful to
>> fix Avocado tests with the current Avocado CI jobs.
> 
> What's the problem? Cryptic error messages when artifacts cannot be
> fetched?

More importantly to display a reproducible command line on failure.

But there are some design issue (Avocado was not design to run on
CI environment at all) that will be hard to solve if we have to keep
what we current have.

> 
>>
>>>   
>>>> I haven't seen any attempt on this list to improve the current
>>>> fragile situation, but better suggestions are certainly welcome.  
>>>
>>> At least I am hoping that the "check-acceptance" tests will break a little
>>> bit less often once Peter uses the gitlab-CI for his gating tests, too. That
>>> will at least prevent that one of the tests gets completely broken by a new
>>> merged pull request. Of course there's still the risk that tests only fail
>>> occasionally due to new bugs, but that can also happen for all other test
>>> suites (unit, qtest, iotests, ...), too.  
>>
>> Or Peter (as other users) will get grumpy at these tests because they
>> are unreliable, hard to understand what fail and debug.
>>
>> Thus the removal suggestion, so we can "fix" the missing Avocado parts
>> before it is used heavily.
> 
> I think disabling _all_ of them is way too harsh.

I'm looking for ways to:

- allow developers to contribute integration tests easily,
  not being blocked by CI.
- allow maintainers to expand their default tests set (by
  marking non-gating tests as gating for their workflow).
- allow developers to disable tests they are not interested
  in to reduce how many CI minutes they burn on each pipeline
- figure out who to contact in case a job failed (runner
  owner? test owner? breaking patch author?)

I thought marking our current jobs as optional would be simpler
that introducing an Avocado2 framework and duplicating the tests,
so I wanted to try the fast path first.

Regards,

Phil.


Reply via email to