Re: [openstack-dev] [gate] job failure rate at ~ 12% (check queue) <= issue?

2015-12-18 Thread Markus Zoeller
Sean Dague <s...@dague.net> wrote on 12/17/2015 04:48:17 PM:

> From: Sean Dague <s...@dague.net>
> To: openstack-dev@lists.openstack.org
> Date: 12/17/2015 04:48 PM
> Subject: Re: [openstack-dev] [gate] job failure rate at ~ 12% (check 
> queue) <= issue?
> 
> On 12/17/2015 05:52 AM, Markus Zoeller wrote:
> > The job failure rates had an unusual rise at 06:30 UTC this morning 
[1].
> > I couldn't figure out if this is a real issue or somewhat related to
> > the gerrit update ~ 18 hours ago. The only thing I found was a time
> > frame of ~ 1h where the jobs failed to update the apt repos [2]. As
> > this issue is not present anymore in logstash, I expected that the job
> > failure rate would drop, but that didn't happen. Long story short,
> > do we have an issue? Or is this the aftermath of bug 1526675? 
> > 
> > [1] http://grafana.openstack.org/dashboard/db/tempest-failure-rate
> > [2] logstash query: http://bit.ly/1O8qjtn
> > 
> > Regards, Markus Zoeller (markus_z)
> 
> That graph is a pretty narrow time slice. What's the rolling average on
> that?
> 
>-Sean
> 
> -- 
> Sean Dague
> http://dague.net

 
If I get my math right, the averages are:

<30 days  <7 days  <2 days
--
gate-tempest-dsvm-full (check)  ~10% ~14%  ~7%
gate-tempest-dsvm-neutron-full (check)  ~11% ~16%  ~9%
gate-grenade-dsvm (check)~8% ~15%  ~7%
 
I guess this means the value I observed is within the expected range and
there is no issue. I'm going to take that into account when I try to 
interpret the dashboard in the future, thanks.

Regards, Markus Zoeller (markus_z)


__
OpenStack Development Mailing List (not for usage questions)
Unsubscribe: openstack-dev-requ...@lists.openstack.org?subject:unsubscribe
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev


Re: [openstack-dev] [gate] job failure rate at ~ 12% (check queue) <= issue?

2015-12-17 Thread Sean Dague
On 12/17/2015 05:52 AM, Markus Zoeller wrote:
> The job failure rates had an unusual rise at 06:30 UTC this morning [1].
> I couldn't figure out if this is a real issue or somewhat related to
> the gerrit update ~ 18 hours ago. The only thing I found was a time
> frame of ~ 1h where the jobs failed to update the apt repos [2]. As
> this issue is not present anymore in logstash, I expected that the job
> failure rate would drop, but that didn't happen. Long story short,
> do we have an issue? Or is this the aftermath of bug 1526675? 
> 
> [1] http://grafana.openstack.org/dashboard/db/tempest-failure-rate
> [2] logstash query: http://bit.ly/1O8qjtn
> 
> Regards, Markus Zoeller (markus_z)

That graph is a pretty narrow time slice. What's the rolling average on
that?

-Sean

-- 
Sean Dague
http://dague.net

__
OpenStack Development Mailing List (not for usage questions)
Unsubscribe: openstack-dev-requ...@lists.openstack.org?subject:unsubscribe
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev


[openstack-dev] [gate] job failure rate at ~ 12% (check queue) <= issue?

2015-12-17 Thread Markus Zoeller
The job failure rates had an unusual rise at 06:30 UTC this morning [1].
I couldn't figure out if this is a real issue or somewhat related to
the gerrit update ~ 18 hours ago. The only thing I found was a time
frame of ~ 1h where the jobs failed to update the apt repos [2]. As
this issue is not present anymore in logstash, I expected that the job
failure rate would drop, but that didn't happen. Long story short,
do we have an issue? Or is this the aftermath of bug 1526675? 

[1] http://grafana.openstack.org/dashboard/db/tempest-failure-rate
[2] logstash query: http://bit.ly/1O8qjtn

Regards, Markus Zoeller (markus_z)


__
OpenStack Development Mailing List (not for usage questions)
Unsubscribe: openstack-dev-requ...@lists.openstack.org?subject:unsubscribe
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev