thanks for sharing your experience.
I am running peer credits value 63 as it was for mlx4.
I did not notice problems with these settings anyway to saturate the EDR bandwith writing and reading using Lustre I have to do it from 2 different clients, so that I can reach up to 9GB/s per OSS. Otherwise from a single client I cannot reach more than 5GB/s. it is not clear to me why.

On 11/8/18 4:58 AM, Martin Hecht wrote:
On 11/7/18 9:44 PM, Riccardo Veraldi wrote:
Anyway I Was wondering if something different is needed for mlx5 and
what are the suggested values in that case ?

Anyone has experience with mlx5 LNET performance tunings ?
Hi Riccardo,

We have recently integrated mlx5 nodes into our fabric, and we had to
reduce the values to

peer_credits = 16
concurrent_sends = 16

because mlx5 doesn't support larger values for some reason. The peer_credits 
must have the same value in all connected lnets, even across routers (at least 
it used to be like this. I believe we are currently running some Lustre 2.5.x 
derivates on the server side, and newer versions on the various clients).

kind regards,
Martin


_______________________________________________
lustre-discuss mailing list
[email protected]
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org


_______________________________________________
lustre-discuss mailing list
[email protected]
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org

Reply via email to