Hi Doug,
Thomas Stibor has already added our findings to LU-5718 (he is our Dr. Lustre
here at GSI).
Just to have this also on the mailing list: the error occurs if a user is close to their quota limit,
while the RPC size is at default value.
Workaround is setting "max_pages_per_rpc=64"
Hi Thomas,
It is interesting that you have encountered this error without a router. Good
information. I have updated LU-5718 with a link to this discussion.
The original fix posted to LU-5718 by Liang will fix his problem for you (it
does not assume a router is the cause). That fix does
org> on behalf of
Thomas Roth <t.r...@gsi.de>
Sent: Saturday, September 10, 2016 2:38:37 AM
To: lustre-discuss@lists.lustre.org
Subject: [lustre-discuss] RDMA too fragmented, OSTs unavailable (permanently)
Hi all,
we are running Lustre 2.5.3 on Infiniband. We have massive problems with
clients
Hi all,
we are running Lustre 2.5.3 on Infiniband. We have massive problems with clients being unable to communicate with any number of OSTs, rendering the
entire cluster quite unusable.
Clients show
> LNetError: 1399:0:(o2iblnd_cb.c:1140:kiblnd_init_rdma()) RDMA too fragmented
for