Package: linux-image-amd64
Version: 6.1.76+1
Source: linux
Source-Version: 6.1.76+1
Severity: important
Control: notfound -1 6.6.15-2

Dear Maintainers,

We discovered a bug affecting dlm that prevents any tcp communications by dlm when booted with debian kernel 6.1.76-1.

Dlm startup works (corosync-cpgtool shows the dlm:controld group with all expected nodes) but as soon as we try to add a lockspace dmesg shows:
```
dlm: Using TCP for communications
dlm: cannot start dlm midcomms -97
```

It seems that commit "dlm: use kernel_connect() and kernel_bind()" (e9cdebbe) was merged to 6.1.

Checking the code it seems that the changed function dlm_tcp_listen_bind() fails with exit code 97 (EAFNOSUPPORT)
It is called from

dlm/lockspace.c: threads_start() -> dlm_midcomms_start()
dlm/midcomms.c: dlm_midcomms_start() -> dlm_lowcomms_start()
dlm/lowcomms.c: dlm_lowcomms_start() -> dlm_listen_for_all() -> dlm_proto_ops->listen_bind() = dlm_tcp_listen_bind()

The error code is returned all the way to threads_start() where the error message is emmitted.

Booting with the unsigned kernel from testing (6.6.15-2), which also contains this commit, works without issues.

I'm not sure what additional changes are required to get this working or if rolling back this change is an option.

We'd be happy to test patches that might fix this issue.

Thanks for your help,
Valentin

Reply via email to