Bug#1059995: Re: Bug#1059995: pdns: flaky autopkgtest (host dependent): pdns.service: Failed to set up IPC namespacing: Resource temporarily unavailable

2024-01-29 Thread Chris Hofstaedtler
* Paul Gevers  [240126 22:25]:
> Hi zeha,
> 
> On 26-01-2024 10:21, Chris Hofstaedtler wrote:
> > I see this "works", but now the tests fail after one try on the
> > problematic worker and then are never retried. Can this please be
> > fixed?
> 
> What do you have in mind? I think you need to wait until issue 166 [1] is
> fixed, which I guess isn't going to happen soon.

166 seems like an option, or auto-retry on a different worker, if
thats possible?

Chris



Bug#1059995: Re: Bug#1059995: pdns: flaky autopkgtest (host dependent): pdns.service: Failed to set up IPC namespacing: Resource temporarily unavailable

2024-01-26 Thread Chris Hofstaedtler
clone 1059995 -1
reopen -1
reassign -1 systemd
found -1 systemd/254.3-1
forwarded -1 https://github.com/systemd/systemd/issues/31037
thanks

Dear systemd Packagers,

Paul Gevers noted that src:pdns's autopkgtests fail every so often
on a large amd64 debci worker and on s390x workers. Apparently a
similar problem can be seen in src:pdns-recursor's debci runs.

As there is no pdns(-recursor) code running at this point, this
seems to be a problem somewhere in the space of systemd <> lxc <>
apparmor <> kernel.

I've opened a bug with systemd upstream, unfortunately with very
little info as I don't know how to provide additional info from
within a debci run. Help with providing additional info would be
very welcome.

Thanks,
Chris



Bug#1059995: Re: Bug#1059995: pdns: flaky autopkgtest (host dependent): pdns.service: Failed to set up IPC namespacing: Resource temporarily unavailable

2024-01-26 Thread Chris Hofstaedtler
Hi Paul,

* Paul Gevers  [240104 18:14]:
> Can you figure out decent numbers for these? Below I printed the output of
> lsipc and AFAICT SHMMAX is already pretty big ;) (and the same on all our
> hosts, which is also true for MSGMAX).
> 
> On the other hand, $(ipcs -a) doesn't show anything on the host, not even if
> I let it run in a while-loop (1 second interval) while I schedule the test
> of pdns. So, could this be a bug in systemd (which you claim below should be
> handeling this) or is this just not really supported in lxc and do you need
> a full VM. Because it works elsewhere, I feel more like a bug, and it would
> not be the first instance where code fails to properly handle 64 cores or
> 256GB or RAM.

Likely, but it is probably in systemd or in lxc or in apparmor or
elsewhere.

> > > > I wouldn't know what to do about this, its not really under the
> > > > control of src:pdns.
> > > 
> > > Well, maybe check for it and fail gracefully?
> > 
> > But how? systemd sets up the IPC namespace.
> 
> exit with 77 when you detect problems and add the skippable restriction.

I see this "works", but now the tests fail after one try on the
problematic worker and then are never retried. Can this please be
fixed?

Thanks,
Chris