Bug#1005479: toil: FTBFS: dh_auto_test: error: pybuild --test --test-pytest -i python{version} -p "3.10 3.9" returned exit code 13

2022-03-19 Thread Paul Gevers

Hi,

On 19-03-2022 14:48, Nilesh Patra wrote:

For avoidance of doubt, the canonical definition lives in the autopkgtest
package: 
https://salsa.debian.org/ci-team/autopkgtest/-/blob/master/doc/README.package-tests.rst


I usually refer /usr/share/doc/autopkgtest/README.package-tests.rst.gz in the 
autopkgtest package,
I believe that's the same thingy.


Agree, except it doesn't refer so nicely in on-line discussions *and* 
the on-line document may be more up-to-date than your local copy. But, 
you're right of course.



And I would have recommended negative architectures: "-arm64" and try to
test on all architectures not explicitly mentioned for the case where new
architectures are added.


That is what I always do, I add a negation as "!", which is also what I did in
this case:

https://salsa.debian.org/med-team/toil/-/blob/master/debian/tests/control#L8


Oops. I should use the right syntax if I teach something...

Paul


OpenPGP_signature
Description: OpenPGP digital signature


Bug#1005479: toil: FTBFS: dh_auto_test: error: pybuild --test --test-pytest -i python{version} -p "3.10 3.9" returned exit code 13

2022-03-19 Thread Nilesh Patra
On Sat, Mar 19, 2022 at 02:32:12PM +0100, Paul Gevers wrote:
> > [1] https://people.debian.org/~eriberto/README.package-tests.html
> 
> For avoidance of doubt, the canonical definition lives in the autopkgtest
> package: 
> https://salsa.debian.org/ci-team/autopkgtest/-/blob/master/doc/README.package-tests.rst

I usually refer /usr/share/doc/autopkgtest/README.package-tests.rst.gz in the 
autopkgtest package,
I believe that's the same thingy.

> And I would have recommended negative architectures: "-arm64" and try to
> test on all architectures not explicitly mentioned for the case where new
> architectures are added.

That is what I always do, I add a negation as "!", which is also what I did in
this case:

https://salsa.debian.org/med-team/toil/-/blob/master/debian/tests/control#L8

Regards,
Nilesh


signature.asc
Description: PGP signature


Bug#1005479: toil: FTBFS: dh_auto_test: error: pybuild --test --test-pytest -i python{version} -p "3.10 3.9" returned exit code 13

2022-03-19 Thread Paul Gevers

Hi,

On 19-03-2022 09:26, Sascha Steinbiss wrote:

Would it be possible to add a hint to ignore arm64 autopkgtest suite?


BTW I think this is possible already in the autopkgtest definition [1] by 
adding an Architecture: section and leaving out arm64 in the list of archs you 
list in there — if that is what you mean.

Cheers
Sascha

[1] https://people.debian.org/~eriberto/README.package-tests.html


For avoidance of doubt, the canonical definition lives in the 
autopkgtest package: 
https://salsa.debian.org/ci-team/autopkgtest/-/blob/master/doc/README.package-tests.rst


And I would have recommended negative architectures: "-arm64" and try to 
test on all architectures not explicitly mentioned for the case where 
new architectures are added.


Paul


OpenPGP_signature
Description: OpenPGP digital signature


Bug#1005479: toil: FTBFS: dh_auto_test: error: pybuild --test --test-pytest -i python{version} -p "3.10 3.9" returned exit code 13

2022-03-19 Thread Nilesh Patra



On 19 March 2022 1:56:17 pm IST, Sascha Steinbiss  wrote:
>Hi Nilesh,
>
>[…]
>> Would it be possible to add a hint to ignore arm64 autopkgtest suite?
>
>BTW I think this is possible already in the autopkgtest definition [1] by 
>adding an Architecture: section and leaving out arm64 in the list of archs you 
>list in there — if that is what you mean.

That is what I eventually did. The package even migrated to testing now[2], 
along with cwltool. So all good, I guess.

Regards,
Nilesh


>[1] https://people.debian.org/~eriberto/README.package-tests.html
[2]: https://tracker.debian.org/news/1311322/toil-560-5-migrated-to-testing/



Bug#1005479: toil: FTBFS: dh_auto_test: error: pybuild --test --test-pytest -i python{version} -p "3.10 3.9" returned exit code 13

2022-03-19 Thread Sascha Steinbiss
Hi Nilesh,

[…]
> Would it be possible to add a hint to ignore arm64 autopkgtest suite?

BTW I think this is possible already in the autopkgtest definition [1] by 
adding an Architecture: section and leaving out arm64 in the list of archs you 
list in there — if that is what you mean.

Cheers
Sascha

[1] https://people.debian.org/~eriberto/README.package-tests.html


Bug#1005479: toil: FTBFS: dh_auto_test: error: pybuild --test --test-pytest -i python{version} -p "3.10 3.9" returned exit code 13

2022-03-10 Thread Nilesh Patra

Hi again, Paul.

On 3/10/22 6:28 PM, Nilesh Patra wrote:

On Thu, Mar 10, 2022 at 01:09:41PM +0100, Paul Gevers wrote:

I was finally able to make time and do this. I first ran

admin@ci-worker08:/tmp/toil$ apt source toil/unstable

admin@ci-worker08:/tmp/toil/toil-5.6.0$ autopkgtest --shell --shell-fail .
-- lxc --sudo autopkgtest-unstable-i386

to confirm it triggers the issue. Than I put the try/except around the block
you proposed (patch didn't apply). The test then ran OK.


Thanks a lot, Paul! I uploaded with the patch.


It passes everywhere now, except arm64[1]. Chokes with

| File "/usr/lib/python3.9/http/client.py", line 302, in _read_status
|   raise BadStatusLine(line)
|   http.client.BadStatusLine: The request is too frequent. Please try 
again later!
|   [2022-03-11T03:59:28+] [MainThread] [E] [toil.worker] Exiting the 
worker because of a failed job on host ci-070-d6e3879e

No idea why this is the case - not sure how to even debug this.
It passes in the porter machine so I do not know what is up. It does not look 
like
$arch specific to me but somewhat $worker-spec specific?

Would it be possible to add a hint to ignore arm64 autopkgtest suite?

Regards,
Nilesh



OpenPGP_signature
Description: OpenPGP digital signature


Bug#1005479: toil: FTBFS: dh_auto_test: error: pybuild --test --test-pytest -i python{version} -p "3.10 3.9" returned exit code 13

2022-03-10 Thread Nilesh Patra
On Thu, Mar 10, 2022 at 01:09:41PM +0100, Paul Gevers wrote:
> I was finally able to make time and do this. I first ran
> 
> admin@ci-worker08:/tmp/toil$ apt source toil/unstable
> 
> admin@ci-worker08:/tmp/toil/toil-5.6.0$ autopkgtest --shell --shell-fail .
> -- lxc --sudo autopkgtest-unstable-i386
> 
> to confirm it triggers the issue. Than I put the try/except around the block
> you proposed (patch didn't apply). The test then ran OK.

Thanks a lot, Paul! I uploaded with the patch.

Regards,
Nilesh


signature.asc
Description: PGP signature


Bug#1005479: toil: FTBFS: dh_auto_test: error: pybuild --test --test-pytest -i python{version} -p "3.10 3.9" returned exit code 13

2022-03-10 Thread Paul Gevers

Hi,

On 03-03-2022 21:35, Paul Gevers wrote:

On 03-03-2022 21:01, Paul Gevers wrote:
b) If you could bypass the unlink (patch pasted below to do so) and 
check once, that would be

great


That's a bit hard to do on the infra. I just started the test locally 
to see if it reproduces on my system. If it does, I'll try with the 
patch.


Unfortunately it passes on my local system. I'll try again on ci.d.n 
infrastructure, but that will take a couple of days.


I was finally able to make time and do this. I first ran

admin@ci-worker08:/tmp/toil$ apt source toil/unstable

admin@ci-worker08:/tmp/toil/toil-5.6.0$ autopkgtest --shell --shell-fail 
. -- lxc --sudo autopkgtest-unstable-i386


to confirm it triggers the issue. Than I put the try/except around the 
block you proposed (patch didn't apply). The test then ran OK.


Paul


OpenPGP_signature
Description: OpenPGP digital signature


Bug#1005479: toil: FTBFS: dh_auto_test: error: pybuild --test --test-pytest -i python{version} -p "3.10 3.9" returned exit code 13

2022-03-03 Thread Paul Gevers

Hi,

On 03-03-2022 21:01, Paul Gevers wrote:
b) If you could bypass the unlink (patch pasted below to do so) and 
check once, that would be

great


That's a bit hard to do on the infra. I just started the test locally to 
see if it reproduces on my system. If it does, I'll try with the patch.


Unfortunately it passes on my local system. I'll try again on ci.d.n 
infrastructure, but that will take a couple of days.


Paul


OpenPGP_signature
Description: OpenPGP digital signature


Bug#1005479: toil: FTBFS: dh_auto_test: error: pybuild --test --test-pytest -i python{version} -p "3.10 3.9" returned exit code 13

2022-03-03 Thread Paul Gevers

Hi,

On 02-03-2022 21:13, Nilesh Patra wrote:

a) As per Sascha's idea[1], if you can simply run the test once, and check
there is nothing(no daemon/service) removing files in /tmp/.. dir


How would I do that? autopkgtest itself does that, but only of tries it 
just finished using. So I'm not sure what you're looking for.


b) If you could bypass the unlink (patch pasted below to do so) and 
check once, that would be

great


That's a bit hard to do on the infra. I just started the test locally to 
see if it reproduces on my system. If it does, I'll try with the patch.


Paul


OpenPGP_signature
Description: OpenPGP digital signature


Bug#1005479: toil: FTBFS: dh_auto_test: error: pybuild --test --test-pytest -i python{version} -p "3.10 3.9" returned exit code 13

2022-03-02 Thread Nilesh Patra

Hi Paul,

On 3/2/22 11:51 PM, Paul Gevers wrote:

I tried re-triggering it to check once again but looks like runners are not up 
unfortunately.


You mean the salsa ones, right?


Yes


And so if you could trigger a test suite for this package at your end once, and 
help debug it,
that'd be really nice.


What do you propose I look for? I'm not very good at debugging random software.


Me neither, but I could propose to check for two things:

a) As per Sascha's idea[1], if you can simply run the test once, and check
there is nothing(no daemon/service) removing files in /tmp/.. dir
b) If you could bypass the unlink (patch pasted below to do so) and check once, 
that would be
great

[1]: https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=1005479#10

--- a/src/toil/jobStores/fileJobStore.py
+++ b/src/toil/jobStores/fileJobStore.py
@@ -489,13 +489,16 @@
 return
 except OSError as e:
 if e.errno == errno.EEXIST:
-# Overwrite existing file, emulating shutil.copyfile().
-os.unlink(localFilePath)
-# It would be very unlikely to fail again for same reason 
but possible
-# nonetheless in which case we should just give up.
-os.link(jobStoreFilePath, localFilePath)
-# Now we succeeded and don't need to copy
-return
+try:
+# Overwrite existing file, emulating shutil.copyfile().
+os.unlink(localFilePath)
+# It would be very unlikely to fail again for same 
reason but possible
+# nonetheless in which case we should just give up.
+os.link(jobStoreFilePath, localFilePath)
+# Now we succeeded and don't need to copy
+return
+except:
+pass
 elif e.errno == errno.EXDEV:
 # It's a cross-device link even though it didn't appear to 
be.
 # Just keep going and hit the file copy case.



OpenPGP_signature
Description: OpenPGP digital signature


Bug#1005479: toil: FTBFS: dh_auto_test: error: pybuild --test --test-pytest -i python{version} -p "3.10 3.9" returned exit code 13

2022-03-02 Thread Paul Gevers

Hi,

On 02-03-2022 16:53, Nilesh Patra wrote:
Sascha even pointed out that it goes fine on reproducible build 
machines, and this package runs same build time

test as autopkgtest


Yes, but normally not in lxc.

The salsa pipeline was passing as well[1] which IIRC uses similar setup 
as debci.


True, as far as I'm aware.

I tried re-triggering it to check once again but looks like runners are 
not up unfortunately.


You mean the salsa ones, right? ci.d.n runners should all be up.

And so if you could trigger a test suite for this package at your end 
once, and help debug it,

that'd be really nice.


What do you propose I look for? I'm not very good at debugging random 
software.


Paul


OpenPGP_signature
Description: OpenPGP digital signature


Bug#1005479: toil: FTBFS: dh_auto_test: error: pybuild --test --test-pytest -i python{version} -p "3.10 3.9" returned exit code 13

2022-03-02 Thread Nilesh Patra

Hi Paul,

On 3/2/22 9:09 PM, Paul Gevers wrote:

On 02-03-2022 14:58, Nilesh Patra wrote:

@Paul, could you please help reproduce this in the debci environment and
help us with any clues as to what might be wrong?


The autopkgtest command line is at the top of each log.


I am aware, and I use something $similar, but that is not working for me and 
Sascha locally, hence asked for help :)
Sascha even pointed out that it goes fine on reproducible build machines, and 
this package runs same build time
test as autopkgtest
The salsa pipeline was passing as well[1] which IIRC uses similar setup as 
debci.
I tried re-triggering it to check once again but looks like runners are not up 
unfortunately.

And so if you could trigger a test suite for this package at your end once, and 
help debug it,
that'd be really nice.

[1]: https://salsa.debian.org/med-team/toil/-/jobs/2396200

Regards,
Nilesh



OpenPGP_signature
Description: OpenPGP digital signature


Bug#1005479: toil: FTBFS: dh_auto_test: error: pybuild --test --test-pytest -i python{version} -p "3.10 3.9" returned exit code 13

2022-03-02 Thread Paul Gevers

Hi,

On 02-03-2022 14:58, Nilesh Patra wrote:

@Paul, could you please help reproduce this in the debci environment and
help us with any clues as to what might be wrong?


The autopkgtest command line is at the top of each log.


| File "/usr/lib/python3/dist-packages/toil/jobStores/fileJobStore.py", line 
498, in read_file
|os.link(jobStoreFilePath, local_path)
| OSError: [Errno 18] Invalid cross-device link: 
'/tmp/autopkgtest-lxc.rgt3f050/downtmp/build.YHo/src/jobstore-test-de9381ff-d8be-4b36-a691-be2c5fa759d2/files/no-job/file-d7145278c6b84845bd03cf5cff29e690/stream'
 -> '/home/debci/tmp/tmp1lgiu28c'

>

It looks (to me) that home and tmp are mounted on different partitions and 
os.link does not work
with such a setting.


I *think* the /tmp inside the lxc is mapped to /tmp of the host. /home 
is completely native to the container. (Not sure if all this matters). 
On *some* architectures /tmp and /var/lib/lxc of the host are on a 
separate partitions with respect to the rest. I'm pretty sure that for 
all our architectures all files inside the container are on the same 
partition.


Paul


OpenPGP_signature
Description: OpenPGP digital signature