> >There was a small program posted by Roger Larsson that monitored
> >processes running with SCHED_FIFO and if one of them hogged cpu for more
> >than a number of seconds it would get downgraded to SCHED_OTHER. If the
> >bug is a lockup due to SCHED_FIFO this may revive the machine after it
> >hangs (it did that for me when I was poking around trying to find the
> >cause of lockups that latter were solved by Andrew Morton). 
> 
> true, but JACK almost does this by itself. when run with -R, a highpri
> SCHED_FIFO task runs every 5 (?) seconds, and requires that the engine
> has checked in since the last time. this prevents loops within JACK
> from stalling the entire system.

"should prevent" :-), in my case the machine would just lock. Whatever
was making the timer happy was still happening. I think that when the
extra program brought down the priority of jack then the watchdog timer
would kick in, but a little too late. 

> anyway, the fact that ctrl-alt-sysrq doesn't work either is a good
> indication of a deeper problem than a runaway SCHED_FIFO task. i think.

Yup, that sounds much worse... althugh I seem to remember some similar
issue with my lockups as well (those were caused by the ext3 bug), if
you did not break out to a debugger or rebooted with sysrq soon,
eventually the machine would not respond to anything and the only way
out was to power off. 

I'm building current alsa cvs and will try to find the time to test this
on an smp machine. 

-- Fernando




-------------------------------------------------------
This SF.Net email sponsored by: Free pre-built ASP.NET sites including
Data Reports, E-commerce, Portals, and Forums are available now.
Download today and enter to win an XBOX or Visual Studio .NET.
http://aspnet.click-url.com/go/psa00100003ave/direct;at.aspnet_072303_01/01
_______________________________________________
Alsa-devel mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/alsa-devel

Reply via email to