Hello,

I'm currently investigating a Live-Patch application failure in core-
scheduling mode and this is an example of what I usually get:
(it's easily reproducible)

    (XEN) [  342.528305] livepatch: lp: CPU8 - IPIing the other 15 CPUs
    (XEN) [  342.558340] livepatch: lp: Timed out on semaphore in CPU quiesce 
phase 13/15
    (XEN) [  342.558343] bad cpus: 6 9

    (XEN) [  342.559293] CPU:    6
    (XEN) [  342.559562] Xen call trace:
    (XEN) [  342.559565]    [<ffff82d08023f304>] R 
common/schedule.c#sched_wait_rendezvous_in+0xa4/0x270
    (XEN) [  342.559568]    [<ffff82d08023f8aa>] F 
common/schedule.c#schedule+0x17a/0x260
    (XEN) [  342.559571]    [<ffff82d080240d5a>] F 
common/softirq.c#__do_softirq+0x5a/0x90
    (XEN) [  342.559574]    [<ffff82d080278ec5>] F 
arch/x86/domain.c#guest_idle_loop+0x35/0x60

    (XEN) [  342.559761] CPU:    9
    (XEN) [  342.560026] Xen call trace:
    (XEN) [  342.560029]    [<ffff82d080241661>] R _spin_lock_irq+0x11/0x40
    (XEN) [  342.560032]    [<ffff82d08023f323>] F 
common/schedule.c#sched_wait_rendezvous_in+0xc3/0x270
    (XEN) [  342.560036]    [<ffff82d08023f8aa>] F 
common/schedule.c#schedule+0x17a/0x260
    (XEN) [  342.560039]    [<ffff82d080240d5a>] F 
common/softirq.c#__do_softirq+0x5a/0x90
    (XEN) [  342.560042]    [<ffff82d080279db5>] F 
arch/x86/domain.c#idle_loop+0x55/0xb0

The first HT sibling is waiting for the second in the LP-application
context while the second waits for the first in the scheduler context.

Any suggestions on how to improve this situation are welcome.

--
Thanks,
Sergey

_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xenproject.org
https://lists.xenproject.org/mailman/listinfo/xen-devel

Reply via email to