A big thanks to Karsten - I've downgraded the kernels on two OSS nodes and one of the MDS to 3.10.0-1160.2.1.el7.x86_64, placed the others in standby and everything has run overnight with 50,000 continuous reads/writes/deletes/per cycle and bulk deletes in a shell script running continuously and this morning its all still up and running :)
Thanks everyone for your suggestions. Next challenge RoCE over 100G ConnectX5 cards :) Sid Young
_______________________________________________ lustre-discuss mailing list [email protected] http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org
