Takao Indoh (on Fri, 15 Sep 2006 14:34:16 +0900) wrote:
>I have one question. When one cpu sends IPI and other cpus receive INIT
>interrupt, which cpus become monarch? Is there possibility that all cpus
>which receive INIT become slave cpu? If so, these cpus must wait until
>monarch cpu comes. 
>
>        if (!sos->monarch) {
>                ia64_mc_info.imi_rendez_checkin[cpu] = 
> IA64_MCA_RENDEZ_CHECKIN_INIT;
>                while (monarch_cpu == -1)
>                       cpu_relax();     /* spin until monarch enters */
>
>It means these cpus enter infinite loop because the remaining one cpu
>never comes to ia64_init_handler.

Correct, and it does happen.  KDB has already solved this problem, the
monarch cpu is the one that started the debug[*] process, all the other
cpus have to be treated as slaves, whether they are interrupted by IPI
or NMI/INIT.  It gets complicated by broken proms that treat some INIT
events as monarch when they are really slaves.

I am back from vacation and am now working on a common 'crash stop'
synchronization interface that can be used by all the debug[*] tools,
instead of everybody inventing their own and tripping over each other.
Obviously it is based on the KDB patch, but it will be generic enough
that anybody can use it.

[*] Using the term 'debug' generically here.  It can be a debugger like
    kdb, kgdb, nlkd or a crash dump tool like lkcd, crash or kexec.
    They all have the same synchronization requirements, stop all cpus
    using NMI/INIT where necessary and save the state of all the cpus
    for later diagnosis.

_______________________________________________
fastboot mailing list
[email protected]
https://lists.osdl.org/mailman/listinfo/fastboot

Reply via email to