Bug#1059995: Re: Bug#1059995: pdns: flaky autopkgtest (host dependent): pdns.service: Failed to set up IPC namespacing: Resource temporarily unavailable
* Paul Gevers [240126 22:25]: > Hi zeha, > > On 26-01-2024 10:21, Chris Hofstaedtler wrote: > > I see this "works", but now the tests fail after one try on the > > problematic worker and then are never retried. Can this please be > > fixed? > > What do you have in mind? I think you need to wait until issue 166 [1] is > fixed, which I guess isn't going to happen soon. 166 seems like an option, or auto-retry on a different worker, if thats possible? Chris
Bug#1059995: Re: Bug#1059995: pdns: flaky autopkgtest (host dependent): pdns.service: Failed to set up IPC namespacing: Resource temporarily unavailable
clone 1059995 -1 reopen -1 reassign -1 systemd found -1 systemd/254.3-1 forwarded -1 https://github.com/systemd/systemd/issues/31037 thanks Dear systemd Packagers, Paul Gevers noted that src:pdns's autopkgtests fail every so often on a large amd64 debci worker and on s390x workers. Apparently a similar problem can be seen in src:pdns-recursor's debci runs. As there is no pdns(-recursor) code running at this point, this seems to be a problem somewhere in the space of systemd <> lxc <> apparmor <> kernel. I've opened a bug with systemd upstream, unfortunately with very little info as I don't know how to provide additional info from within a debci run. Help with providing additional info would be very welcome. Thanks, Chris
Bug#1059995: Re: Bug#1059995: pdns: flaky autopkgtest (host dependent): pdns.service: Failed to set up IPC namespacing: Resource temporarily unavailable
Hi Paul, * Paul Gevers [240104 18:14]: > Can you figure out decent numbers for these? Below I printed the output of > lsipc and AFAICT SHMMAX is already pretty big ;) (and the same on all our > hosts, which is also true for MSGMAX). > > On the other hand, $(ipcs -a) doesn't show anything on the host, not even if > I let it run in a while-loop (1 second interval) while I schedule the test > of pdns. So, could this be a bug in systemd (which you claim below should be > handeling this) or is this just not really supported in lxc and do you need > a full VM. Because it works elsewhere, I feel more like a bug, and it would > not be the first instance where code fails to properly handle 64 cores or > 256GB or RAM. Likely, but it is probably in systemd or in lxc or in apparmor or elsewhere. > > > > I wouldn't know what to do about this, its not really under the > > > > control of src:pdns. > > > > > > Well, maybe check for it and fail gracefully? > > > > But how? systemd sets up the IPC namespace. > > exit with 77 when you detect problems and add the skippable restriction. I see this "works", but now the tests fail after one try on the problematic worker and then are never retried. Can this please be fixed? Thanks, Chris