Re: [lustre-discuss] RDMA too fragmented, OSTs unavailable (permanently)

2016-10-14 Thread Thomas Roth
Hi Doug, Thomas Stibor has already added our findings to LU-5718 (he is our Dr. Lustre here at GSI). Just to have this also on the mailing list: the error occurs if a user is close to their quota limit, while the RPC size is at default value. Workaround is setting "max_pages_per_rpc=64"

Re: [lustre-discuss] RDMA too fragmented, OSTs unavailable (permanently)

2016-09-22 Thread Oucharek, Doug S
Hi Thomas, It is interesting that you have encountered this error without a router. Good information. I have updated LU-5718 with a link to this discussion. The original fix posted to LU-5718 by Liang will fix his problem for you (it does not assume a router is the cause). That fix does

Re: [lustre-discuss] RDMA too fragmented, OSTs unavailable (permanently)

2016-09-10 Thread Patrick Farrell
org> on behalf of Thomas Roth <t.r...@gsi.de> Sent: Saturday, September 10, 2016 2:38:37 AM To: lustre-discuss@lists.lustre.org Subject: [lustre-discuss] RDMA too fragmented, OSTs unavailable (permanently) Hi all, we are running Lustre 2.5.3 on Infiniband. We have massive problems with clients

[lustre-discuss] RDMA too fragmented, OSTs unavailable (permanently)

2016-09-10 Thread Thomas Roth
Hi all, we are running Lustre 2.5.3 on Infiniband. We have massive problems with clients being unable to communicate with any number of OSTs, rendering the entire cluster quite unusable. Clients show > LNetError: 1399:0:(o2iblnd_cb.c:1140:kiblnd_init_rdma()) RDMA too fragmented for