> And good ol' snowball06 just did it again.

If some boards are doing it and some aren't with the same software,
it really does sound like a h/w issue to me.

> > sounds good. can we also pick one of the boards that we believe is
> > good/better and do the same there?
> > 
> > On Tue, Oct 16, 2012 at 7:50 PM, Dave Pigott <[email protected]> wrote:
> >> 
> >> On 16 Oct 2012, at 17:22, Alexander Sack <[email protected]> wrote:
> >> 
> >>> +anmar
> >>> 
> >>> On Tue, Oct 16, 2012 at 5:59 PM, Andy Doan <[email protected]> wrote:
> >>>> On 10/16/2012 02:26 AM, Lee Jones wrote:
> >>>>> 
> >>>>> On Mon, 15 Oct 2012, Andy Doan wrote:
> >>>>> 
> >>>>>> On 10/15/2012 01:04 PM, Alexander Sack wrote:
> >>>>>>>>>>> 
> >>>>>>>>>>> --------------------
> >>>>>>>>>>> snowball06/08
> >>>>>>>>>>> --------------------
> >>>>>>>>>>> http://192.168.1.10/lava-server/scheduler/job/35179
> >>>>>>>>>>> 
> >>>>>>>>>>> eth0 failed to come up. We see this a lot with snowballs.
> >>>>>>>>> 
> >>>>>>>>> 
> >>>>>>>>> "We see this a lot" -- do we have actual numbers?  To everyone:
> >>>>>>>>> assuming
> >>>>>>>>> not, what can we do to get some?
> >>>>>> 
> >>>>>> 
> >>>>>> I keep the log of health check failures at:
> >>>>>> 
> >>>>>> 
> >>>>>> 
> >>>>>> https://docs.google.com/a/linaro.org/spreadsheet/ccc?key=0AnxpY5uv-BlNdG9zYTdDLWZWRVFGaWFxQzRLNWtaNmc#gid=8
> >>>>>> 
> >>>>>> In the past 5 days its happened 4 times on snowball.
> >>>>>> 
> >>>>>> Prior to that. In a span of 25 health failures snowball accounted
> >>>>>> for 8 of the failures. Half of those failures look like this
> >>>>>> problem. So this snowball issue is accounting for around 16% of our
> >>>>>> health check failures.
> >>>>> 
> >>>>> 
> >>>>> So it works sometimes, but not others? Sounds like a h/w bug.
> >>> 
> >>> could be hwbug, but driver bugs can also give undeterministic
> >>> behaviour in full system stacks from what i experience (racy things
> >>> etc.). Since we are in software business I feel we should look closer
> >>> at the software side before disregarding something as hwbug ...
> >>> 
> >>> How can we nail the source of this? Maybe we have a kernel that we
> >>> have the guts feeling is better than the 12.02 and could give that a
> >>> stress test try?
> >> 
> >> Idea for a plan: We take snowball06 and run loop tests on 12.{03-09} for a 
> >> few days and see if any one seems to behave better than the others?
> >> 
> >> Dave
> > 
> > 
> > 
> > -- 
> > Alexander Sack
> > Technical Director, Linaro Platform Teams
> > http://www.linaro.org | Open source software for ARM SoCs
> > http://twitter.com/#!/linaroorg - http://www.linaro.org/linaro-blog
> 

-- 
Lee Jones
Linaro ST-Ericsson Landing Team Lead
Linaro.org │ Open source software for ARM SoCs
Follow Linaro: Facebook | Twitter | Blog

_______________________________________________
linaro-validation mailing list
[email protected]
http://lists.linaro.org/mailman/listinfo/linaro-validation

Reply via email to