[lustre-discuss] RDMA too fragmented, OSTs unavailable (permanently)

2016-09-10 Thread Thomas Roth
Hi all, we are running Lustre 2.5.3 on Infiniband. We have massive problems with clients being unable to communicate with any number of OSTs, rendering the entire cluster quite unusable. Clients show > LNetError: 1399:0:(o2iblnd_cb.c:1140:kiblnd_init_rdma()) RDMA too fragmented for

Re: [lustre-discuss] RDMA too fragmented, OSTs unavailable (permanently)

2016-09-10 Thread Patrick Farrell
Thomas, It is somewhat sideways from your questions, but when Cray has seen this problem historically, it has almost always been due to lots of small direct I/O from a user code. - Patrick From: lustre-discuss on