Re: [openstack-dev] [gate] job failure rate at ~ 12% (check queue) <= issue?
Sean Dague wrote on 12/17/2015 04:48:17 PM: > From: Sean Dague > To: openstack-dev@lists.openstack.org > Date: 12/17/2015 04:48 PM > Subject: Re: [openstack-dev] [gate] job failure rate at ~ 12% (check > queue) <= issue? > > On 12/17/2015 05:52 AM, Markus Zoeller wrote: > > The job failure rates had an unusual rise at 06:30 UTC this morning [1]. > > I couldn't figure out if this is a real issue or somewhat related to > > the gerrit update ~ 18 hours ago. The only thing I found was a time > > frame of ~ 1h where the jobs failed to update the apt repos [2]. As > > this issue is not present anymore in logstash, I expected that the job > > failure rate would drop, but that didn't happen. Long story short, > > do we have an issue? Or is this the aftermath of bug 1526675? > > > > [1] http://grafana.openstack.org/dashboard/db/tempest-failure-rate > > [2] logstash query: http://bit.ly/1O8qjtn > > > > Regards, Markus Zoeller (markus_z) > > That graph is a pretty narrow time slice. What's the rolling average on > that? > >-Sean > > -- > Sean Dague > http://dague.net If I get my math right, the averages are: <30 days <7 days <2 days -- gate-tempest-dsvm-full (check) ~10% ~14% ~7% gate-tempest-dsvm-neutron-full (check) ~11% ~16% ~9% gate-grenade-dsvm (check)~8% ~15% ~7% I guess this means the value I observed is within the expected range and there is no issue. I'm going to take that into account when I try to interpret the dashboard in the future, thanks. Regards, Markus Zoeller (markus_z) __ OpenStack Development Mailing List (not for usage questions) Unsubscribe: openstack-dev-requ...@lists.openstack.org?subject:unsubscribe http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev
Re: [openstack-dev] [gate] job failure rate at ~ 12% (check queue) <= issue?
On 12/17/2015 05:52 AM, Markus Zoeller wrote: > The job failure rates had an unusual rise at 06:30 UTC this morning [1]. > I couldn't figure out if this is a real issue or somewhat related to > the gerrit update ~ 18 hours ago. The only thing I found was a time > frame of ~ 1h where the jobs failed to update the apt repos [2]. As > this issue is not present anymore in logstash, I expected that the job > failure rate would drop, but that didn't happen. Long story short, > do we have an issue? Or is this the aftermath of bug 1526675? > > [1] http://grafana.openstack.org/dashboard/db/tempest-failure-rate > [2] logstash query: http://bit.ly/1O8qjtn > > Regards, Markus Zoeller (markus_z) That graph is a pretty narrow time slice. What's the rolling average on that? -Sean -- Sean Dague http://dague.net __ OpenStack Development Mailing List (not for usage questions) Unsubscribe: openstack-dev-requ...@lists.openstack.org?subject:unsubscribe http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev
[openstack-dev] [gate] job failure rate at ~ 12% (check queue) <= issue?
The job failure rates had an unusual rise at 06:30 UTC this morning [1]. I couldn't figure out if this is a real issue or somewhat related to the gerrit update ~ 18 hours ago. The only thing I found was a time frame of ~ 1h where the jobs failed to update the apt repos [2]. As this issue is not present anymore in logstash, I expected that the job failure rate would drop, but that didn't happen. Long story short, do we have an issue? Or is this the aftermath of bug 1526675? [1] http://grafana.openstack.org/dashboard/db/tempest-failure-rate [2] logstash query: http://bit.ly/1O8qjtn Regards, Markus Zoeller (markus_z) __ OpenStack Development Mailing List (not for usage questions) Unsubscribe: openstack-dev-requ...@lists.openstack.org?subject:unsubscribe http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev