Jeffrey E Altman <jalt...@auristor.com> wrote on 05/07/2021 04:44:24 PM: > John, > > What are your observations of how this code behaves on congested links. > I expect that the sorting of received packets distorts the ACK clock > and packet skew measurements reducing the ability to accurately measure > the congestion window. Processing ACK packets in bulk is likely to > produce a bursty transmission pattern which can result in overflowing > the link capacity. As a result, fairness is reduced and packet loss > might be increased. > > Jeffrey Altman > AuriStor, Inc. >
With our server hardware and the grid environment used in testing I was never able to get over about 7Gb/s out of the 10Gb/s connection to the servers and wasn't seeing any packet loss/rx retransmits. I know Andrew reported more than that was possible in the presentation regarding these patches but I didn't have time to debug why our setup wasn't matching those results. My impression was that some other sites might be running these patches in production. Can anyone comment if that is the case and if they are able to saturate links and have the problem described? > On 5/6/2021 10:22 PM, John P Janosik (jpjan...@us.ibm.com) wrote: > > Hi Ben, > > > > We have been importing these patches into our IBM internal OpenAFS 1.8.X > > builds for over a year and have had our busiest cells running these > > versions since fall last year. We hit some deadlock issue early on but > > that was fixed and I believe those patches made it to gerrit as well. > > > > I did the work to get the patches to apply to the versions of OpenAFS we > > are running, but I don't feel confident calling it a review. I missed > > the deadlock issue until we actually put it into production :). > > > > John Janosik > > jpjan...@us.ibm.com > > [attachment "jaltman.vcf" deleted by John P Janosik/Rochester/IBM] > [attachment "OpenPGP_signature" deleted by John P Janosik/Rochester/IBM]