disk-io lockup in 4.14.13 kernel

2018-02-22 Thread Jaco Kroon
Hi, We've been seeing sporadic IO lockups on recent kernels. Currently installed on the server is 4.14.13. Previously we ran 4.0.9, due to various problems from 4.1 onward with respect to RAID problems, and netfilter etc ... with 4.13 being the first usable kernel again that doesn't require addi

Re: disk-io lockup in 4.14.13 kernel

2018-02-23 Thread Jaco Kroon
Hi Bart, Thank you for your response. On 22/02/2018 18:46, Bart Van Assche wrote: > On 02/22/18 02:58, Jaco Kroon wrote: >> We've been seeing sporadic IO lockups on recent kernels. > > Are you using the legacy I/O stack or blk-mq? If you are not yet using > blk-mq, c

Re: disk-io lockup in 4.14.13 kernel

2018-03-10 Thread Jaco Kroon
Hi Bart, On 11/03/2018 05:08, Bart Van Assche wrote: > On Sat, 2018-03-10 at 22:56 +0200, Jaco Kroon wrote: >> On 22/02/2018 18:46, Bart Van Assche wrote: >>> On 02/22/18 02:58, Jaco Kroon wrote: >>>> We've been seeing sporadic IO lockups on recent kernels. >

Re: disk-io lockup in 4.14.13 kernel

2018-03-13 Thread Jaco Kroon
Hi Bart, On 11/03/2018 07:00, Bart Van Assche wrote: > On Sun, 2018-03-11 at 06:33 +0200, Jaco Kroon wrote: >> crowsnest ~ # find /sys -name sdm >> /sys/kernel/debug/block/sdm >> /sys/devices/pci:00/:00:01.0/:01:00.0/host0/port-0:0/expander-0:0/port-0:0:0/ex

Re: disk-io lockup in 4.14.13 kernel

2018-03-13 Thread Jaco Kroon
Hi Bart, On 13/03/2018 16:10, Bart Van Assche wrote: > On Tue, 2018-03-13 at 11:30 +0200, Jaco Kroon wrote: >> On 11/03/2018 07:00, Bart Van Assche wrote: >>> Did I see correctly that /dev/sdm is behind a MPT SAS controller? You may >>> want to contact the authors of t

Re: disk-io lockup in 4.14.13 kernel

2018-03-13 Thread Jaco Kroon
Hi Bart, On 13/03/2018 19:24, Bart Van Assche wrote: > On Tue, 2018-03-13 at 19:16 +0200, Jaco Kroon wrote: >> The server in question is the destination of numerous rsync/ssh cases >> (used primarily for backups) and is not intended as a real-time system. >> I'm happy t

Re: disk-io lockup in 4.14.13 kernel

2018-03-24 Thread Jaco Kroon
on or something, and had to reboot, so not sure if that can be done still. Kind Regards, Jaco On 13/03/2018 19:24, Bart Van Assche wrote: > On Tue, 2018-03-13 at 19:16 +0200, Jaco Kroon wrote: >> The server in question is the destination of numerous rsync/ssh cases >> (used primarily for

Re: disk-io lockup in 4.14.13 kernel

2018-03-27 Thread Jaco Kroon
Hi Bart, > The above call trace means that SysRq-l was triggered, either via the keyboard > or through procfs. I don't think that there is any information in the above > that reveals the root cause of why a reboot was necessary. I triggered it hoping to get a stack trace of the process which is dea