Time for an update since I just did some unnotified maintenance since everything was very quiet.
On 07/25/2016 01:56 PM, Andrew Grimberg wrote: > Greetings folks, > > Just wanting to give an infrastructure update give all the changes that > have happened. While Thanh sent out a note last week, I want to > re-iterate our current _known_ infrastructure issues that we're working on: > > * Nexus timeouts: We still believe that this is related to another issue > we see specifically with our JJB jobs of authentication failures. I am > still working on setting up an LDAP replica in the environment to see if > we have a latency issue between the environment and our core > authentication services. I expect to have this operational by EOW but > that's dependent on a few factors outside my control. We are now using our new LDAP replica. As such we anticipate that this will resolve the Nexus timeouts. However, since the Release Engineering team doesn't have the bandwidth to be checking every failed job, if you do happen to encounter one this week and it's due to a timeout in Nexus we want to know about it! > * Newish issue that cropped up this weekend: Since the push to make all > the CSIT jobs run back inside the public cloud we have twice had to > pause Jenkins to clear out stale lab instances. We are unclear as to > what is causing this issue but are monitoring it at present until we can > get a better handle on it. We still haven't determined what the cause of this issue is. However, we're _hoping_ that it's related to the LDAP timeouts we had been seeing before the cutover to our new LDAP replica. We will continue to monitor this situation as it's obviously a problem. -Andy- -- Andrew J Grimberg Systems Administrator Release Engineering Team Lead The Linux Foundation
signature.asc
Description: OpenPGP digital signature
_______________________________________________ infrastructure mailing list [email protected] https://lists.opendaylight.org/mailman/listinfo/infrastructure
