Bug#1059904: mariadb autopkgtest "upstream" is flaky, time sink

2024-01-05 Thread Otto Kekäläinen
New 1:10.116-2 uploaded, looking good so far at
https://tracker.debian.org/pkg/mariadb:

autopkgtest for mariadb/1:10.11.6-2: amd64: Pass, arm64: Pass, armel:
Pass, armhf: Pass, i386: Pass, ppc64el: Test in progress, s390x: Pass



Bug#1059904: mariadb autopkgtest "upstream" is flaky, time sink

2024-01-04 Thread Otto Kekäläinen
## arm64

All green at https://ci.debian.net/packages/m/mariadb/testing/arm64/
Latest log has "Completed: All 1049 tests were successful."

## i386

Likewise, all green at https://ci.debian.net/packages/m/mariadb/testing/i386/
Latest log has "Completed: All 1047 tests were successful.".

## armel

Flaky at https://ci.debian.net/packages/m/mariadb/testing/armel/
Last passing a couple days ago: Completed: All 1047 tests were successful.
All failing ones on main.subselect* for which I have a bugfix.

## armhf

Flaky at https://ci.debian.net/packages/m/mariadb/testing/armhf/
Latest passed: Completed: All 1046 tests were successful.
Likewise to armel, failures on main.subselect*

## s390x

Flaky at https://ci.debian.net/packages/m/mariadb/testing/s390x/
Last passed a couple days ago: All 1047 tests were successful.
Likewise to armel, failures on main.subselect*

## amd64

Finally, amd64 flaky at https://ci.debian.net/packages/m/mariadb/testing/amd64/
Last passed a couple days ago: Completed: All 1049 tests were successful.

All failures on subselect as well, which is an upstream regression in
10.11.6: https://jira.mariadb.org/browse/MDEV-32843
Fix is pending on
https://salsa.debian.org/otto/mariadb-server/-/commits/dev-otto but
still need to finalize a bit before upload.


Summary: the number of tests is not flaky on passed runs. The number
of attempted test runs is fluctuating only when failures happend and
tests re-run.



Bug#1059904: mariadb autopkgtest "upstream" is flaky, time sink

2024-01-04 Thread Otto Kekäläinen
Summary of current status of https://ci.debian.net/packages/m/mariadb/

## ppc64el

Last passed in 
https://ci.debian.net/packages/m/mariadb/testing/ppc64el/40870773/
where 1048 tests were successful:
MariaDB: 1:10.6.11-1
kernel: Linux 6.5.0-0.deb12.4-powerpc64le

The log above includes 4 variants of main.innodb_ext_key passing.


Later we see:

Completed: Failed 8/1053 tests, 99.24% were successful.
Failing test(s): main.innodb_ext_key main.stat_tables_innodb
main.xa_prepared_binlog_off

Completed: Failed 11/1055 tests, 98.96% were successful.
Failing test(s): main.innodb_ext_key main.subselect_sj2_jcl6

Completed: Failed 3/1050 tests, 99.71% were successful.
Failing test(s): main.innodb_ext_key

Completed: Failed 2/1049 tests, 99.81% were successful.
Failing test(s): main.innodb_ext_key

Completed: Failed 3/1050 tests, 99.71% were successful.
Failing test(s): main.xa_prepared_binlog_off

Completed: Failed 10/1054 tests, 99.05% were successful.
Failing test(s): main.innodb_ext_key main.subselect

MariaDB: 1:10.6.11-1
kernel: Linux 6.5.0-0.deb12.4-powerpc64le

In https://ci.debian.net/packages/m/mariadb/testing/ppc64el/40894691/ we see:

Completed: All 1048 tests were successful.
Errors/warnings were found in logfiles during server shutdown after running the
following sequence(s) of tests:
main.long_unique_bugs
183 tests were skipped, 76 by the test itself.

The test emitted a MariaDB crash:

main.long_unique_bugs 'innodb' w1 [ pass ] 324
14:51:15 [ERROR] mysqld got signal 11 ;
Attempting backtrace. You can use the following information to find out

Since 2023-12-18 16:13:06 UTC the test has failed to even start.


The ppc64el logs have crashes and most likely due to
https://jira.mariadb.org/browse/MDEV-30728, in particular the
main.innodb_ext_key one.



Bug#1059904: mariadb autopkgtest "upstream" is flaky, time sink

2024-01-03 Thread Otto Kekäläinen
I am working on a 1:10.11.6-2 upload to fix these. If I can make them
reliable, I will temporarily disable the test suite.


Bug#1059904: mariadb autopkgtest "upstream" is flaky, time sink

2024-01-03 Thread Chris Hofstaedtler
Source: mariadb
Version: 1:10.11.6-1
Severity: important

Hi,

the "upstream" autopkgtest appears to be flaky. On various test runs
that I've inspected in the last few weeks, it often ended up with
the following message, and a re-run seemingly passed without any
other changes.

>  356s Only  1242  of 1240 completed.
...
> 356s Completed: Failed 3/1048 tests, 99.71% were successful.

Somehow it would seem great if these numbers would actually add up.
Also, from the log output I can see only *one* test marked as "[
fail ]". Even that one apparently gets retried automatically and
then passes?

But more importantly, if these tests cannot be made to pass or
succeed consistently, please mark them as flaky, as right now this
is just a waste of everybodys time. mariadb gets tested as part of
the migration tests for so many other packages, it's not funny.

Thanks,
Chris