On 2 Jul 2012, at 17:58, Andy Doan wrote:

> On 07/01/2012 07:03 PM, Michael Hudson-Doyle wrote:
>> Andy Doan <[email protected]> writes:
>> 
>>> We've seen a significant reduction in health job failures, but I still
>>> wanted to send out a report on these so people could see how things are
>>> still breaking.
>>> 
>>> We've had 25 real health failures over the past 2 weeks.
>>> 
>>> By device type:
>>> 
>>>   6 snowball
>>>   1 imx53
>>>   1 vexpress
>>>   2 beagle
>>>   6 origen
>>>   9 panda
>>> 
>>> By failure type:
>>> 
>>>   2 SD cards died: (both on Origen)
>> 
>> Yay!  That's the sort of problem we are _supposed_ to be finding :)
>> 
>>>   7 Serial Console Related:
>>>    - 5 connection never established at start of job
>> 
>> I'd dearly love to know what's going on here.  I could implement a kind
>> of ~exponential back off where we wait 5 seconds, 1 minute, 5 minutes
>> between attempts to reset the port?
> 
> Maybe Dave has some thoughts. I haven't played around enough with that stuff 
> to have a very informed opinion, but that does sound worth trying on the 
> surface.

I'm not completely sure what's going on, but I know that essentially when it 
happens you have to power cycle the board to get back to it. One thing I 
haven't tried as yet is to reset the serial port when it's stuck like this, but 
next time it happens I'll play around a bit more, rather than just doing a 
quick fix to get the board back online.

Dave
_______________________________________________
linaro-validation mailing list
[email protected]
http://lists.linaro.org/mailman/listinfo/linaro-validation

Reply via email to