Bug#986256: [Debian-med-packaging] Bug#986256: simka: flaky amd64 autopkgtest: regularly times out after 2:47 h

2021-04-07 Thread Étienne Mollier
Hi Graham,

Graham Inggs, on 2021-04-06 20:15:30 +0200:
> This is still occurring with simka 1.5.3-3, see:
> https://ci.debian.net/packages/s/simka/testing/amd64/

Thanks for having noticed it.  It looks like I got caught by the
Python script not taking the -nb-cores argument, contrary to the
shell script.  While I couldn't reproduce the hang on emulator,
it seems I can just reproduce the problem by enforcing the cores
count to 48 within the Python script.  The test is precisely
hanging at this location:

[...]
Command for visualizing results with metadata annotations:
python ../scripts/visualization/run-visualization.py -in 
./simka_results/ -out ./simka_results/ -pca -heatmap -tree -metadata-in 
../example/dataset_metadata.csv -metadata-variable VARIABLE_1
TESTING k=31 t=0
simka -in simka_input.txt -out ./__results__/results_k31_t0 -out-tmp 
./temp_output -simple-dist -complex-dist -kmer-size 31 -abundance-min 0 
-verbose 0 -nb-cores 48
OK
TESTING k=21 t=0
simka -in simka_input.txt -out ./__results__/results_k21_t0 -out-tmp 
./temp_output -simple-dist -complex-dist -kmer-size 21 -abundance-min 0 
-verbose 0 -nb-cores 48
OK
TESTING k=31 t=2
simka -in simka_input.txt -out ./__results__/results_k31_t2 -out-tmp 
./temp_output -simple-dist -complex-dist -kmer-size 31 -abundance-min 2 
-verbose 0 -nb-cores 48
_

The output on CI infrastructure is a bit misleading, because the
standard output is not flushed on each output line by default.
I enforced flushed output manually in the Python script to have
a cleared view.  But I /believe/ I could reproduce precisely the
deadlock this time.  Hopefully a further upload with controlled
cores count at this level should do the trick.

Have a nice day,  :)
-- 
Étienne Mollier 
Fingerprint:  8f91 b227 c7d6 f2b1 948c  8236 793c f67e 8f0d 11da
Sent from /dev/pts/2, please excuse my verbosity.


signature.asc
Description: PGP signature


Bug#986256: simka: flaky amd64 autopkgtest: regularly times out after 2:47 h

2021-04-06 Thread Graham Inggs
Control: reopen -1

This is still occurring with simka 1.5.3-3, see:
https://ci.debian.net/packages/s/simka/testing/amd64/



Bug#986256: simka: flaky amd64 autopkgtest: regularly times out after 2:47 h

2021-04-02 Thread Paul Gevers
Hi Nilesh,

On 02-04-2021 13:40, Nilesh Patra wrote:
> Hi,
> 
> On Thu, 1 Apr 2021 20:53:00 +0200 Paul Gevers  wrote:
>  
>> Your package has an autopkgtest, great. However, I looked into
>> the history of your autopkgtest [1] and I noticed version 1.5.3-2 fails
>> regularly on amd64, while sporadically a rerun passes. I copied some of
>> the output at the bottom of this report. It hits the autopkgtest time
>> out after 2hours and 47 minutes. Successful runs pass in less than a minute.
>>
>> Because the unstable-to-testing migration software now blocks on
>> regressions in testing, flaky tests, i.e. tests that flip between
>> passing and failing without changes to the list of installed packages,
>> are causing people unrelated to your package to spend time on these
>> tests.
> 
> That makes sense  - do you think marking this test as flaky can be
> solution?

No, because when a test times out, that restriction doesn't work. You'll
need to keep the test below 2:47, and as it normally takes less than a
minute, it may point at something seriously hanging.

Paul



OpenPGP_signature
Description: OpenPGP digital signature


Bug#986256: simka: flaky amd64 autopkgtest: regularly times out after 2:47 h

2021-04-02 Thread Nilesh Patra
Hi,

On Thu, 1 Apr 2021 20:53:00 +0200 Paul Gevers  wrote:
 
> Your package has an autopkgtest, great. However, I looked into
> the history of your autopkgtest [1] and I noticed version 1.5.3-2 fails
> regularly on amd64, while sporadically a rerun passes. I copied some of
> the output at the bottom of this report. It hits the autopkgtest time
> out after 2hours and 47 minutes. Successful runs pass in less than a minute.
> 
> Because the unstable-to-testing migration software now blocks on
> regressions in testing, flaky tests, i.e. tests that flip between
> passing and failing without changes to the list of installed packages,
> are causing people unrelated to your package to spend time on these
> tests.

That makes sense  - do you think marking this test as flaky can be
solution?

Nilesh


signature.asc
Description: PGP signature


Bug#986256: simka: flaky amd64 autopkgtest: regularly times out after 2:47 h

2021-04-01 Thread Paul Gevers
Source: simka
Version: 1.5.3-2
Severity: serious
Tags: sid bullseye
X-Debbugs-CC: debian...@lists.debian.org
User: debian...@lists.debian.org
Usertags: flaky timeout

Dear maintainer(s),

Your package has an autopkgtest, great. However, I looked into
the history of your autopkgtest [1] and I noticed version 1.5.3-2 fails
regularly on amd64, while sporadically a rerun passes. I copied some of
the output at the bottom of this report. It hits the autopkgtest time
out after 2hours and 47 minutes. Successful runs pass in less than a minute.

Because the unstable-to-testing migration software now blocks on
regressions in testing, flaky tests, i.e. tests that flip between
passing and failing without changes to the list of installed packages,
are causing people unrelated to your package to spend time on these
tests.

Paul

[1] https://ci.debian.net/packages/s/simka/testing/amd64/

https://ci.debian.net/data/autopkgtest/testing/amd64/s/simka/11355132/log.gz

Computing stats...

Stats
Reads
Total:8470M0G
Min:980M0G
Max:4020M0G
Average:1690M0G
Kmers
Distinct Kmers (before merging):146700M0G
Distinct Kmers (after merging):81230M0G
Shared distinct Kmers:43220M0G
Kmers:501300M0G
Mean k-mer coverage: 2.66341


Output dir: ./simka_results/




*** Test: PASSED

Command used:
/usr/bin/simka -in ../example/simka_input.txt -out ./simka_results/
-out-tmp ./simka_temp_output

Command for visualizing results:
python ../scripts/visualization/run-visualization.py -in
./simka_results/ -out ./simka_results/ -pca -heatmap -tree

Command for visualizing results with metadata annotations:
python ../scripts/visualization/run-visualization.py -in
./simka_results/ -out ./simka_results/ -pca -heatmap -tree -metadata-in
../example/dataset_metadata.csv -metadata-variable VARIABLE_1
autopkgtest [07:42:20]: ERROR: timed out on command "su -s /bin/bash
debci -c set -e; export USER=`id -nu`; . /etc/profile >/dev/null 2>&1 ||
true;  . ~/.profile >/dev/null 2>&1 || true;
buildtree="/tmp/autopkgtest-lxc.zk5p_5wg/downtmp/build.NZn/src"; mkdir
-p -m 1777 --
"/tmp/autopkgtest-lxc.zk5p_5wg/downtmp/run-unit-test-artifacts"; export
AUTOPKGTEST_ARTIFACTS="/tmp/autopkgtest-lxc.zk5p_5wg/downtmp/run-unit-test-artifacts";
export ADT_ARTIFACTS="$AUTOPKGTEST_ARTIFACTS"; mkdir -p -m 755
"/tmp/autopkgtest-lxc.zk5p_5wg/downtmp/autopkgtest_tmp"; export
AUTOPKGTEST_TMP="/tmp/autopkgtest-lxc.zk5p_5wg/downtmp/autopkgtest_tmp";
export ADTTMP="$AUTOPKGTEST_TMP"; export DEBIAN_FRONTEND=noninteractive;
export LANG=C.UTF-8; export DEB_BUILD_OPTIONS=parallel=48; unset
LANGUAGE LC_CTYPE LC_NUMERIC LC_TIME LC_COLLATE   LC_MONETARY
LC_MESSAGES LC_PAPER LC_NAME LC_ADDRESS   LC_TELEPHONE LC_MEASUREMENT
LC_IDENTIFICATION LC_ALL;rm -f /tmp/autopkgtest_script_pid; set -C; echo
$$ > /tmp/autopkgtest_script_pid; set +C; trap "rm -f
/tmp/autopkgtest_script_pid" EXIT INT QUIT PIPE; cd "$buildtree"; chmod
+x
/tmp/autopkgtest-lxc.zk5p_5wg/downtmp/build.NZn/src/debian/tests/run-unit-test;
touch /tmp/autopkgtest-lxc.zk5p_5wg/downtmp/run-unit-test-stdout
/tmp/autopkgtest-lxc.zk5p_5wg/downtmp/run-unit-test-stderr;
/tmp/autopkgtest-lxc.zk5p_5wg/downtmp/build.NZn/src/debian/tests/run-unit-test
2> >(tee -a /tmp/autopkgtest-lxc.zk5p_5wg/downtmp/run-unit-test-stderr
>&2) > >(tee -a
/tmp/autopkgtest-lxc.zk5p_5wg/downtmp/run-unit-test-stdout);" (kind: test)
autopkgtest [07:42:20]: test run-unit-test: ---]



OpenPGP_signature
Description: OpenPGP digital signature