Public bug reported:

Problem Description: 
=========================
Tried to take dump using virsh dumpxml command and it fails with Timed out 
'held by remoteDispatchDomainCoreDump' error.

Steps to re-create:
============================
1. boslcp3g4 is installed with 4.15.0-15-generic kernel.
2. LTP & memory map tests were running inside guest.
3. After some time guest in hung state.
4. Tried to take dump using virsh dumpxml.

root@boslcp3:~# virsh dump boslcp3g4 boslcp3g4_mmap_ltp --memory-only

error: Failed to core dump domain boslcp3g4 to boslcp3g4_mmap_ltp
error: Disconnected from qemu:///system due to keepalive timeout
error: Timed out during operation: cannot acquire state change lock (held by 
remoteDispatchDomainCoreDump)

root@boslcp3:~# virsh list --all
 Id    Name                           State
 1     boslcp3g3                      running
 2     boslcp3g4                      paused
 4     boslcp3g1                      running

5. It fails with Timed out during opearation & with held by 
remoteDispatchDomainCoreDump error
6. /var/log/syslog dumps 

Apr 18 03:29:14 boslcp3 libvirtd[5538]: 2018-04-18 08:29:13.956+0000: 5576: 
warning : qemuDomainObjBeginJobInternal:4863 : Cannot start job (query, none) 
for domain boslcp3g4; current job is (async nested, dump) owned by (5574 
remoteDispatchDomainCoreDump, 5574 remoteDispatchDomainCoreDump) for (701s, 
701s)
Apr 18 03:29:14 boslcp3 libvirtd[5538]: 2018-04-18 08:29:13.958+0000: 5576: 
error : qemuDomainObjBeginJobInternal:4875 : Timed out during operation: cannot 
acquire state change lock (held by remoteDispatchDomainCoreDump)
Apr 18 03:29:44 boslcp3 libvirtd[5538]: 2018-04-18 08:29:44.492+0000: 5573: 
warning : qemuDomainObjBeginJobInternal:4863 : Cannot start job (query, none) 
for domain boslcp3g4; current job is (async nested, dump) owned by (5574 
remoteDispatchDomainCoreDump, 5574 remoteDispatchDomainCoreDump) for (731s, 
732s)

7. Attached syslog & sosreport

== Comment: #3 - Application Cdeadmin <cdead...@us.ibm.com> - 2018-04-18 
08:11:01 ==
When i tried for second time same command it was successful but syslog dumps 
below warnings continuously

warning : :4863 : Cannot start job (query, none) for domain boslcp3g4;
current job is (async nested, dump) owned by (5574
remoteDispatchDomainCoreDump, 5574

root@boslcp3:~# virsh dump boslcp3g4 boslcp3g4_mmapltp --memory-only
Domain boslcp3g4 dumped to boslcp3g4_mmapltp

vmcore located at:
vmcore at kte111:/LOGS/boslcp3g4/boslcp3g4_mmapltp
Access kte111 using debug@9.3.111.155  (don2rry)

== Comment: #8 - Application Cdeadmin <cdead...@us.ibm.com> - 2018-04-19 
05:26:32 ==
Tried to start the guest boslcp3g1 guest which has qlogic disk as boot & IO disk
root@boslcp3:~# virsh list --all
 Id    Name                           State
 1     boslcp3g4                      running
 3     boslcp3g3                      running
 -     boslcp3g1                      shut off

root@boslcp3:~# echo 10240 > /proc/sys/vm/nr_hugepages
root@boslcp3:~# virsh start --console boslcp3g1

--> Than saw guest went to paused state.
root@boslcp3:/home# virsh list --all
 Id    Name                           State
 1     boslcp3g4                      running
 3     boslcp3g3                      running
 5     boslcp3g1                      paused

Then tried to destroy the guest and its fails with Timed out during
operation: cannot acquire state change lock. Even resume command also
failing as below

Corresponding syslog from /var/log:
Apr 19 05:17:09 boslcp3 libvirtd[5576]: 2018-04-19 10:17:09.056+0000: 5635: 
error : virProcessKillPainfully:401 : Failed to terminate process 142520 with 
SIGKILL: Device or resource busy

== Comment: #26 - Shivaprasad G. Bhat <shiva...@in.ibm.com> - 2018-05-17 
08:57:25 ==
Got to test the patches independently. The below commits from upstream fix the 
false alarms and allows the dump to go through clean.

a5bc7130f3
e712579200
150930e309
9a1755b7fe
501e3c3c96
88c2360753
3455a7359c
fd1a9e5c56
2a4d847e77
9d73df98c2
93412bb827
a8ef7b69dc
5870f95a7a
3f99bb06d1

** Affects: libvirt (Ubuntu)
     Importance: Undecided
     Assignee: Ubuntu on IBM Power Systems Bug Triage (ubuntu-power-triage)
         Status: New


** Tags: architecture-ppc64le bugnameltc-166912 severity-medium 
targetmilestone-inin1804

** Tags added: architecture-ppc64le bugnameltc-166912 severity-medium
targetmilestone-inin1804

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1771827

Title:
  Timed out message while taking dump using virsh dumpxml command &
  fails with 'held by remoteDispatchDomainCoreDump' error

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/libvirt/+bug/1771827/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to