Re: [ovirt-users] Diagnosing Migration Error
Hi Guys, Yep, that was the problem, thank you very much. :) I had the DNS entries setup as virt-X but the hostnames were registered as ovirt-X so it failed. Migration now works perfectly. :) On 16-03-18 04:47 PM, Nir Soffer wrote: On Fri, Mar 18, 2016 at 9:37 PM, Nir Sofferwrote: On Fri, Mar 18, 2016 at 9:27 PM, Nir Soffer wrote: On Fri, Mar 18, 2016 at 7:59 PM, Charles Tassell wrote: Hi Martin, Thanks, I checked that log and found the error " AttributeError: 'SourceThread' object has no attribute '_destServer'" which from what I see from my googling was supposedly fixed... I'm running vdsm-4.17.23-0.el7.centos.noarch which I believe is the latest. The full error is: Thread-2789::DEBUG::2016-03-18 14:47:04,200::blockSD::422::Storage.Misc.excCmd::(getReadDelay) /usr/bin/taskset --cpu-list 0-23 /usr/bin/dd if=/dev/383a1e7d-8b86-4a50-8bc3-e4374c57a3e5/metadata iflag=direct of=/dev/null bs=4096 count=1 (cwd None) Thread-2789::DEBUG::2016-03-18 14:47:04,211::blockSD::422::Storage.Misc.excCmd::(getReadDelay) SUCCESS: = '1+0 records in\n1+0 records out\n4096 bytes (4.1 kB) copied, 0.000377289 s, 10.9 MB/s\n'; = 0 mailbox.SPMMonitor::DEBUG::2016-03-18 14:47:04,999::storage_mailbox::735::Storage.Misc.excCmd::(_checkForMail) /usr/bin/taskset --cpu-list 0-23 dd if=/rhev/data-center/0001-0001-0001-0001-02e9/mastersd/dom_md/inbox iflag=direct,fullblock count=1 bs=1024000 (cwd None) mailbox.SPMMonitor::DEBUG::2016-03-18 14:47:05,015::storage_mailbox::735::Storage.Misc.excCmd::(_checkForMail) SUCCESS: = '1+0 records in\n1+0 records out\n1024000 bytes (1.0 MB) copied, 0.0057206 s, 179 MB/s\n'; = 0 jsonrpc.Executor/0::DEBUG::2016-03-18 14:47:06,551::__init__::503::jsonrpc.JsonRpcServer::(_serveRequest) Calling 'VM.migrate' in bridge with {u'params': {u'tunneled': u'false', u'autoConverge': u'false', u'src': u'ovirt-01.virt.roblib.upei.ca', u'dst': u'ovirt-02.virt.roblib.upei.ca:54321', u'vmId': u'2be4938e-f4a3-4322-bae3-8a9628b81835', u'abortOnError': u'true', u'compressed': u'false', u'method': u'online'}, u'vmID': u'2be4938e-f4a3-4322-bae3-8a9628b81835'} jsonrpc.Executor/0::DEBUG::2016-03-18 14:47:06,552::API::546::vds::(migrate) {u'tunneled': u'false', u'autoConverge': u'false', u'src': u'ovirt-01.virt.roblib.upei.ca', u'dst': u'ovirt-02.virt.roblib.upei.ca:54321', u'vmId': u'2be4938e-f4a3-4322-bae3-8a9628b81835', u'abortOnError': u'true', u'compressed': u'false', u'method': u'online'} jsonrpc.Executor/0::DEBUG::2016-03-18 14:47:06,555::__init__::206::jsonrpc.Notification::(emit) Sending event {"params": {"2be4938e-f4a3-4322-bae3-8a9628b81835": {"status": "Migration Source"}, "notify_time": 4311153220}, "jsonrpc": "2.0", "method": "|virt|VM_status|2be4938e-f4a3-4322-bae3-8a9628b81835"} jsonrpc.Executor/0::DEBUG::2016-03-18 14:47:06,555::__init__::533::jsonrpc.JsonRpcServer::(_serveRequest) Return 'VM.migrate' in bridge with True Thread-41096::ERROR::2016-03-18 14:47:06,559::migration::208::virt.vm::(_recover) vmId=`2be4938e-f4a3-4322-bae3-8a9628b81835`::[Errno -2] Name or service not known Thread-41096::ERROR::2016-03-18 14:47:06,559::migration::213::virt.vm::(_recover) vmId=`2be4938e-f4a3-4322-bae3-8a9628b81835`::Failed to destroy remote VM Traceback (most recent call last): File "/usr/share/vdsm/virt/migration.py", line 211, in _recover self._destServer.destroy(self._vm.id) AttributeError: 'SourceThread' object has no attribute '_destServer' Thread-41096::DEBUG::2016-03-18 14:47:06,560::__init__::206::jsonrpc.Notification::(emit) Sending event {"params": {"2be4938e-f4a3-4322-bae3-8a9628b81835": {"status": "Migration Source"}, "notify_time": 4311153220}, "jsonrpc": "2.0", "method": "|virt|VM_status|2be4938e-f4a3-4322-bae3-8a9628b81835"} Thread-41096::ERROR::2016-03-18 14:47:06,560::migration::309::virt.vm::(run) vmId=`2be4938e-f4a3-4322-bae3-8a9628b81835`::Failed to migrate Traceback (most recent call last): File "/usr/share/vdsm/virt/migration.py", line 278, in run self._setupVdsConnection() File "/usr/share/vdsm/virt/migration.py", line 143, in _setupVdsConnection client = self._createClient(port) File "/usr/share/vdsm/virt/migration.py", line 130, in _createClient self.remoteHost, int(port), sslctx) File "/usr/lib/python2.7/site-packages/vdsm/utils.py", line 1269, in create_connected_socket sock.connect((host, port)) File "/usr/lib64/python2.7/site-packages/M2Crypto/SSL/Connection.py", line 181, in connect self.socket.connect(addr) File "/usr/lib64/python2.7/socket.py", line 224, in meth return getattr(self._sock,name)(*args) gaierror: [Errno -2] Name or service not known This looks like a bug in the current code (looking in 4.0 development code). Correction, this is fixed in 4.0, but broken in latest 3.6 (v4.17.24). This issue was fixed in https://gerrit.ovirt.org/52897, but not backported to 3.6. Update: there is patch here if you want to test it:
Re: [ovirt-users] Diagnosing Migration Error
On Fri, Mar 18, 2016 at 9:37 PM, Nir Sofferwrote: > On Fri, Mar 18, 2016 at 9:27 PM, Nir Soffer wrote: >> On Fri, Mar 18, 2016 at 7:59 PM, Charles Tassell >> wrote: >>> Hi Martin, >>> >>> Thanks, I checked that log and found the error " AttributeError: >>> 'SourceThread' object has no attribute '_destServer'" which from what I see >>> from my googling was supposedly fixed... I'm running >>> vdsm-4.17.23-0.el7.centos.noarch which I believe is the latest. The full >>> error is: >>> >>> Thread-2789::DEBUG::2016-03-18 >>> 14:47:04,200::blockSD::422::Storage.Misc.excCmd::(getReadDelay) >>> /usr/bin/taskset --cpu-list 0-23 /usr/bin/dd >>> if=/dev/383a1e7d-8b86-4a50-8bc3-e4374c57a3e5/metadata iflag=direct >>> of=/dev/null bs=4096 count=1 (cwd None) >>> Thread-2789::DEBUG::2016-03-18 >>> 14:47:04,211::blockSD::422::Storage.Misc.excCmd::(getReadDelay) SUCCESS: >>> = '1+0 records in\n1+0 records out\n4096 bytes (4.1 kB) copied, >>> 0.000377289 s, 10.9 MB/s\n'; = 0 >>> mailbox.SPMMonitor::DEBUG::2016-03-18 >>> 14:47:04,999::storage_mailbox::735::Storage.Misc.excCmd::(_checkForMail) >>> /usr/bin/taskset --cpu-list 0-23 dd >>> if=/rhev/data-center/0001-0001-0001-0001-02e9/mastersd/dom_md/inbox >>> iflag=direct,fullblock count=1 bs=1024000 (cwd None) >>> mailbox.SPMMonitor::DEBUG::2016-03-18 >>> 14:47:05,015::storage_mailbox::735::Storage.Misc.excCmd::(_checkForMail) >>> SUCCESS: = '1+0 records in\n1+0 records out\n1024000 bytes (1.0 MB) >>> copied, 0.0057206 s, 179 MB/s\n'; = 0 >>> jsonrpc.Executor/0::DEBUG::2016-03-18 >>> 14:47:06,551::__init__::503::jsonrpc.JsonRpcServer::(_serveRequest) Calling >>> 'VM.migrate' in bridge with {u'params': {u'tunneled': u'false', >>> u'autoConverge': u'false', u'src': u'ovirt-01.virt.roblib.upei.ca', u'dst': >>> u'ovirt-02.virt.roblib.upei.ca:54321', u'vmId': >>> u'2be4938e-f4a3-4322-bae3-8a9628b81835', u'abortOnError': u'true', >>> u'compressed': u'false', u'method': u'online'}, u'vmID': >>> u'2be4938e-f4a3-4322-bae3-8a9628b81835'} >>> jsonrpc.Executor/0::DEBUG::2016-03-18 14:47:06,552::API::546::vds::(migrate) >>> {u'tunneled': u'false', u'autoConverge': u'false', u'src': >>> u'ovirt-01.virt.roblib.upei.ca', u'dst': >>> u'ovirt-02.virt.roblib.upei.ca:54321', u'vmId': >>> u'2be4938e-f4a3-4322-bae3-8a9628b81835', u'abortOnError': u'true', >>> u'compressed': u'false', u'method': u'online'} >>> jsonrpc.Executor/0::DEBUG::2016-03-18 >>> 14:47:06,555::__init__::206::jsonrpc.Notification::(emit) Sending event >>> {"params": {"2be4938e-f4a3-4322-bae3-8a9628b81835": {"status": "Migration >>> Source"}, "notify_time": 4311153220}, "jsonrpc": "2.0", "method": >>> "|virt|VM_status|2be4938e-f4a3-4322-bae3-8a9628b81835"} >>> jsonrpc.Executor/0::DEBUG::2016-03-18 >>> 14:47:06,555::__init__::533::jsonrpc.JsonRpcServer::(_serveRequest) Return >>> 'VM.migrate' in bridge with True >>> Thread-41096::ERROR::2016-03-18 >>> 14:47:06,559::migration::208::virt.vm::(_recover) >>> vmId=`2be4938e-f4a3-4322-bae3-8a9628b81835`::[Errno -2] Name or service not >>> known >>> Thread-41096::ERROR::2016-03-18 >>> 14:47:06,559::migration::213::virt.vm::(_recover) >>> vmId=`2be4938e-f4a3-4322-bae3-8a9628b81835`::Failed to destroy remote VM >>> Traceback (most recent call last): >>> File "/usr/share/vdsm/virt/migration.py", line 211, in _recover >>> self._destServer.destroy(self._vm.id) >>> AttributeError: 'SourceThread' object has no attribute '_destServer' >>> Thread-41096::DEBUG::2016-03-18 >>> 14:47:06,560::__init__::206::jsonrpc.Notification::(emit) Sending event >>> {"params": {"2be4938e-f4a3-4322-bae3-8a9628b81835": {"status": "Migration >>> Source"}, "notify_time": 4311153220}, "jsonrpc": "2.0", "method": >>> "|virt|VM_status|2be4938e-f4a3-4322-bae3-8a9628b81835"} >>> Thread-41096::ERROR::2016-03-18 14:47:06,560::migration::309::virt.vm::(run) >>> vmId=`2be4938e-f4a3-4322-bae3-8a9628b81835`::Failed to migrate >>> Traceback (most recent call last): >>> File "/usr/share/vdsm/virt/migration.py", line 278, in run >>> self._setupVdsConnection() >>> File "/usr/share/vdsm/virt/migration.py", line 143, in _setupVdsConnection >>> client = self._createClient(port) >>> File "/usr/share/vdsm/virt/migration.py", line 130, in _createClient >>> self.remoteHost, int(port), sslctx) >>> File "/usr/lib/python2.7/site-packages/vdsm/utils.py", line 1269, in >>> create_connected_socket >>> sock.connect((host, port)) >>> File "/usr/lib64/python2.7/site-packages/M2Crypto/SSL/Connection.py", line >>> 181, in connect >>> self.socket.connect(addr) >>> File "/usr/lib64/python2.7/socket.py", line 224, in meth >>> return getattr(self._sock,name)(*args) >>> gaierror: [Errno -2] Name or service not known >> >> This looks like a bug in the current code (looking in 4.0 development code). > > Correction, this is fixed in 4.0, but broken in latest 3.6 (v4.17.24). > > This issue was fixed in
Re: [ovirt-users] Diagnosing Migration Error
Hi Martin, Thanks, I checked that log and found the error " AttributeError: 'SourceThread' object has no attribute '_destServer'" which from what I see from my googling was supposedly fixed... I'm running vdsm-4.17.23-0.el7.centos.noarch which I believe is the latest. The full error is: Thread-2789::DEBUG::2016-03-18 14:47:04,200::blockSD::422::Storage.Misc.excCmd::(getReadDelay) /usr/bin/taskset --cpu-list 0-23 /usr/bin/dd if=/dev/383a1e7d-8b86-4a50-8bc3-e4374c57a3e5/metadata iflag=direct of=/dev/null bs=4096 count=1 (cwd None) Thread-2789::DEBUG::2016-03-18 14:47:04,211::blockSD::422::Storage.Misc.excCmd::(getReadDelay) SUCCESS: = '1+0 records in\n1+0 records out\n4096 bytes (4.1 kB) copied, 0.000377289 s, 10.9 MB/s\n'; = 0 mailbox.SPMMonitor::DEBUG::2016-03-18 14:47:04,999::storage_mailbox::735::Storage.Misc.excCmd::(_checkForMail) /usr/bin/taskset --cpu-list 0-23 dd if=/rhev/data-center/0001-0001-0001-0001-02e9/mastersd/dom_md/inbox iflag=direct,fullblock count=1 bs=1024000 (cwd None) mailbox.SPMMonitor::DEBUG::2016-03-18 14:47:05,015::storage_mailbox::735::Storage.Misc.excCmd::(_checkForMail) SUCCESS: = '1+0 records in\n1+0 records out\n1024000 bytes (1.0 MB) copied, 0.0057206 s, 179 MB/s\n'; = 0 jsonrpc.Executor/0::DEBUG::2016-03-18 14:47:06,551::__init__::503::jsonrpc.JsonRpcServer::(_serveRequest) Calling 'VM.migrate' in bridge with {u'params': {u'tunneled': u'false', u'autoConverge': u'false', u'src': u'ovirt-01.virt.roblib.upei.ca', u'dst': u'ovirt-02.virt.roblib.upei.ca:54321', u'vmId': u'2be4938e-f4a3-4322-bae3-8a9628b81835', u'abortOnError': u'true', u'compressed': u'false', u'method': u'online'}, u'vmID': u'2be4938e-f4a3-4322-bae3-8a9628b81835'} jsonrpc.Executor/0::DEBUG::2016-03-18 14:47:06,552::API::546::vds::(migrate) {u'tunneled': u'false', u'autoConverge': u'false', u'src': u'ovirt-01.virt.roblib.upei.ca', u'dst': u'ovirt-02.virt.roblib.upei.ca:54321', u'vmId': u'2be4938e-f4a3-4322-bae3-8a9628b81835', u'abortOnError': u'true', u'compressed': u'false', u'method': u'online'} jsonrpc.Executor/0::DEBUG::2016-03-18 14:47:06,555::__init__::206::jsonrpc.Notification::(emit) Sending event {"params": {"2be4938e-f4a3-4322-bae3-8a9628b81835": {"status": "Migration Source"}, "notify_time": 4311153220}, "jsonrpc": "2.0", "method": "|virt|VM_status|2be4938e-f4a3-4322-bae3-8a9628b81835"} jsonrpc.Executor/0::DEBUG::2016-03-18 14:47:06,555::__init__::533::jsonrpc.JsonRpcServer::(_serveRequest) Return 'VM.migrate' in bridge with True Thread-41096::ERROR::2016-03-18 14:47:06,559::migration::208::virt.vm::(_recover) vmId=`2be4938e-f4a3-4322-bae3-8a9628b81835`::[Errno -2] Name or service not known Thread-41096::ERROR::2016-03-18 14:47:06,559::migration::213::virt.vm::(_recover) vmId=`2be4938e-f4a3-4322-bae3-8a9628b81835`::Failed to destroy remote VM Traceback (most recent call last): File "/usr/share/vdsm/virt/migration.py", line 211, in _recover self._destServer.destroy(self._vm.id) AttributeError: 'SourceThread' object has no attribute '_destServer' Thread-41096::DEBUG::2016-03-18 14:47:06,560::__init__::206::jsonrpc.Notification::(emit) Sending event {"params": {"2be4938e-f4a3-4322-bae3-8a9628b81835": {"status": "Migration Source"}, "notify_time": 4311153220}, "jsonrpc": "2.0", "method": "|virt|VM_status|2be4938e-f4a3-4322-bae3-8a9628b81835"} Thread-41096::ERROR::2016-03-18 14:47:06,560::migration::309::virt.vm::(run) vmId=`2be4938e-f4a3-4322-bae3-8a9628b81835`::Failed to migrate Traceback (most recent call last): File "/usr/share/vdsm/virt/migration.py", line 278, in run self._setupVdsConnection() File "/usr/share/vdsm/virt/migration.py", line 143, in _setupVdsConnection client = self._createClient(port) File "/usr/share/vdsm/virt/migration.py", line 130, in _createClient self.remoteHost, int(port), sslctx) File "/usr/lib/python2.7/site-packages/vdsm/utils.py", line 1269, in create_connected_socket sock.connect((host, port)) File "/usr/lib64/python2.7/site-packages/M2Crypto/SSL/Connection.py", line 181, in connect self.socket.connect(addr) File "/usr/lib64/python2.7/socket.py", line 224, in meth return getattr(self._sock,name)(*args) gaierror: [Errno -2] Name or service not known On 16-03-18 01:22 PM, Martin Sivak wrote: Hi Charles, you can probably get much more detailed explanation from the vdsm and libvirt logs from the two hosts. Take a look at /var/log/vdsm/vdsm.log. Best regards Martin Sivak SLA / oVirt On Fri, Mar 18, 2016 at 3:37 PM, Charles Tassellwrote: Hello, I'm trying to test migrating a VM from one host to another and it's failing, but with no apparent error message. I took a look at the server logs (which I've copied below) but there doesn't seem to be anything in here that would help track down the problem. Should I be looking somewhere besides engine.log on the hosted engine? This is the latest version of oVirt
Re: [ovirt-users] Diagnosing Migration Error
Hi Charles, you can probably get much more detailed explanation from the vdsm and libvirt logs from the two hosts. Take a look at /var/log/vdsm/vdsm.log. Best regards Martin Sivak SLA / oVirt On Fri, Mar 18, 2016 at 3:37 PM, Charles Tassellwrote: > Hello, > > I'm trying to test migrating a VM from one host to another and it's > failing, but with no apparent error message. I took a look at the server > logs (which I've copied below) but there doesn't seem to be anything in here > that would help track down the problem. Should I be looking somewhere > besides engine.log on the hosted engine? This is the latest version of oVirt > (3.6.3.4-1.el7.centos) that I just upgraded this morning. The VM has an ISO > attached over NFS storage and the disk image is on fiber channel. > > 2016-03-18 14:31:18,913 INFO > [org.ovirt.engine.core.bll.MigrateVmToServerCommand] (default task-25) > [549545f2] Lock Acquired to object > 'EngineLock:{exclusiveLocks='[2be4938e-f4a3-4322-bae3-8a9628b81835= ACTION_TYPE_FAILED_VM_IS_BEING_MIGRATED$VmName cdTest02>]', > sharedLocks='null'}' > 2016-03-18 14:31:19,131 INFO > [org.ovirt.engine.core.bll.MigrateVmToServerCommand] > (org.ovirt.thread.pool-8-thread-45) [549545f2] Running command: > MigrateVmToServerCommand internal: false. Entities affected : ID: > 2be4938e-f4a3-4322-bae3-8a9628b81835 Type: VMAction group MIGRATE_VM with > role type USER > 2016-03-18 14:31:19,186 INFO > [org.ovirt.engine.core.vdsbroker.MigrateVDSCommand] > (org.ovirt.thread.pool-8-thread-45) [549545f2] START, MigrateVDSCommand( > MigrateVDSCommandParameters:{runAsync='true', > hostId='cbfd733b-8ced-487d-8754-a2217ce1210f', > vmId='2be4938e-f4a3-4322-bae3-8a9628b81835', > srcHost='ovirt-01.virt.roblib.upei.ca', > dstVdsId='1200a78f-6d05-4e5e-9ef7-6798cf741310', > dstHost='ovirt-02.virt.roblib.upei.ca:54321', migrationMethod='ONLINE', > tunnelMigration='false', migrationDowntime='0', autoConverge='false', > migrateCompressed='false', consoleAddress='null'}), log id: 8f303ba > 2016-03-18 14:31:19,188 INFO > [org.ovirt.engine.core.vdsbroker.vdsbroker.MigrateBrokerVDSCommand] > (org.ovirt.thread.pool-8-thread-45) [549545f2] START, > MigrateBrokerVDSCommand(HostName = oVirt-01, > MigrateVDSCommandParameters:{runAsync='true', > hostId='cbfd733b-8ced-487d-8754-a2217ce1210f', > vmId='2be4938e-f4a3-4322-bae3-8a9628b81835', > srcHost='ovirt-01.virt.roblib.upei.ca', > dstVdsId='1200a78f-6d05-4e5e-9ef7-6798cf741310', > dstHost='ovirt-02.virt.roblib.upei.ca:54321', migrationMethod='ONLINE', > tunnelMigration='false', migrationDowntime='0', autoConverge='false', > migrateCompressed='false', consoleAddress='null'}), log id: 792d7e9e > 2016-03-18 14:31:19,455 INFO > [org.ovirt.engine.core.vdsbroker.vdsbroker.MigrateBrokerVDSCommand] > (org.ovirt.thread.pool-8-thread-45) [549545f2] FINISH, > MigrateBrokerVDSCommand, log id: 792d7e9e > 2016-03-18 14:31:19,470 INFO > [org.ovirt.engine.core.vdsbroker.MigrateVDSCommand] > (org.ovirt.thread.pool-8-thread-45) [549545f2] FINISH, MigrateVDSCommand, > return: MigratingFrom, log id: 8f303ba > 2016-03-18 14:31:19,498 INFO > [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] > (org.ovirt.thread.pool-8-thread-45) [549545f2] Correlation ID: 549545f2, Job > ID: 3f97f737-d32a-4ffa-b213-122dcd3ca048, Call Stack: null, Custom Event ID: > -1, Message: Migration started (VM: cdTest02, Source: oVirt-01, Destination: > oVirt-02, User: admin@internal). > 2016-03-18 14:31:28,055 INFO [org.ovirt.engine.core.vdsbroker.VmAnalyzer] > (DefaultQuartzScheduler_Worker-39) [] VM > '2be4938e-f4a3-4322-bae3-8a9628b81835'(cdTest02) moved from 'MigratingFrom' > --> 'Up' > 2016-03-18 14:31:28,055 INFO [org.ovirt.engine.core.vdsbroker.VmAnalyzer] > (DefaultQuartzScheduler_Worker-39) [] Adding VM > '2be4938e-f4a3-4322-bae3-8a9628b81835' to re-run list > 2016-03-18 14:31:28,114 ERROR > [org.ovirt.engine.core.vdsbroker.VmsMonitoring] > (DefaultQuartzScheduler_Worker-39) [] Rerun VM > '2be4938e-f4a3-4322-bae3-8a9628b81835'. Called from VDS 'oVirt-01' > 2016-03-18 14:31:28,202 INFO > [org.ovirt.engine.core.vdsbroker.vdsbroker.MigrateStatusVDSCommand] > (org.ovirt.thread.pool-8-thread-46) [] START, > MigrateStatusVDSCommand(HostName = oVirt-01, > MigrateStatusVDSCommandParameters:{runAsync='true', > hostId='cbfd733b-8ced-487d-8754-a2217ce1210f', > vmId='2be4938e-f4a3-4322-bae3-8a9628b81835'}), log id: 618baaa6 > 2016-03-18 14:31:29,209 INFO > [org.ovirt.engine.core.vdsbroker.vdsbroker.MigrateStatusVDSCommand] > (org.ovirt.thread.pool-8-thread-46) [] FINISH, MigrateStatusVDSCommand, log > id: 618baaa6 > 2016-03-18 14:31:29,250 ERROR > [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] > (org.ovirt.thread.pool-8-thread-46) [] Correlation ID: 549545f2, Job ID: > 3f97f737-d32a-4ffa-b213-122dcd3ca048, Call Stack: null, Custom Event ID: -1, > Message: Migration failed (VM: cdTest02, Source: oVirt-01, Destination: > oVirt-02). > 2016-03-18
Re: [ovirt-users] Diagnosing Migration Error
On Fri, Mar 18, 2016 at 7:59 PM, Charles Tassellwrote: > Hi Martin, > > Thanks, I checked that log and found the error " AttributeError: > 'SourceThread' object has no attribute '_destServer'" which from what I see > from my googling was supposedly fixed... I'm running > vdsm-4.17.23-0.el7.centos.noarch which I believe is the latest. The full > error is: > > Thread-2789::DEBUG::2016-03-18 > 14:47:04,200::blockSD::422::Storage.Misc.excCmd::(getReadDelay) > /usr/bin/taskset --cpu-list 0-23 /usr/bin/dd > if=/dev/383a1e7d-8b86-4a50-8bc3-e4374c57a3e5/metadata iflag=direct > of=/dev/null bs=4096 count=1 (cwd None) > Thread-2789::DEBUG::2016-03-18 > 14:47:04,211::blockSD::422::Storage.Misc.excCmd::(getReadDelay) SUCCESS: > = '1+0 records in\n1+0 records out\n4096 bytes (4.1 kB) copied, > 0.000377289 s, 10.9 MB/s\n'; = 0 > mailbox.SPMMonitor::DEBUG::2016-03-18 > 14:47:04,999::storage_mailbox::735::Storage.Misc.excCmd::(_checkForMail) > /usr/bin/taskset --cpu-list 0-23 dd > if=/rhev/data-center/0001-0001-0001-0001-02e9/mastersd/dom_md/inbox > iflag=direct,fullblock count=1 bs=1024000 (cwd None) > mailbox.SPMMonitor::DEBUG::2016-03-18 > 14:47:05,015::storage_mailbox::735::Storage.Misc.excCmd::(_checkForMail) > SUCCESS: = '1+0 records in\n1+0 records out\n1024000 bytes (1.0 MB) > copied, 0.0057206 s, 179 MB/s\n'; = 0 > jsonrpc.Executor/0::DEBUG::2016-03-18 > 14:47:06,551::__init__::503::jsonrpc.JsonRpcServer::(_serveRequest) Calling > 'VM.migrate' in bridge with {u'params': {u'tunneled': u'false', > u'autoConverge': u'false', u'src': u'ovirt-01.virt.roblib.upei.ca', u'dst': > u'ovirt-02.virt.roblib.upei.ca:54321', u'vmId': > u'2be4938e-f4a3-4322-bae3-8a9628b81835', u'abortOnError': u'true', > u'compressed': u'false', u'method': u'online'}, u'vmID': > u'2be4938e-f4a3-4322-bae3-8a9628b81835'} > jsonrpc.Executor/0::DEBUG::2016-03-18 14:47:06,552::API::546::vds::(migrate) > {u'tunneled': u'false', u'autoConverge': u'false', u'src': > u'ovirt-01.virt.roblib.upei.ca', u'dst': > u'ovirt-02.virt.roblib.upei.ca:54321', u'vmId': > u'2be4938e-f4a3-4322-bae3-8a9628b81835', u'abortOnError': u'true', > u'compressed': u'false', u'method': u'online'} > jsonrpc.Executor/0::DEBUG::2016-03-18 > 14:47:06,555::__init__::206::jsonrpc.Notification::(emit) Sending event > {"params": {"2be4938e-f4a3-4322-bae3-8a9628b81835": {"status": "Migration > Source"}, "notify_time": 4311153220}, "jsonrpc": "2.0", "method": > "|virt|VM_status|2be4938e-f4a3-4322-bae3-8a9628b81835"} > jsonrpc.Executor/0::DEBUG::2016-03-18 > 14:47:06,555::__init__::533::jsonrpc.JsonRpcServer::(_serveRequest) Return > 'VM.migrate' in bridge with True > Thread-41096::ERROR::2016-03-18 > 14:47:06,559::migration::208::virt.vm::(_recover) > vmId=`2be4938e-f4a3-4322-bae3-8a9628b81835`::[Errno -2] Name or service not > known > Thread-41096::ERROR::2016-03-18 > 14:47:06,559::migration::213::virt.vm::(_recover) > vmId=`2be4938e-f4a3-4322-bae3-8a9628b81835`::Failed to destroy remote VM > Traceback (most recent call last): > File "/usr/share/vdsm/virt/migration.py", line 211, in _recover > self._destServer.destroy(self._vm.id) > AttributeError: 'SourceThread' object has no attribute '_destServer' > Thread-41096::DEBUG::2016-03-18 > 14:47:06,560::__init__::206::jsonrpc.Notification::(emit) Sending event > {"params": {"2be4938e-f4a3-4322-bae3-8a9628b81835": {"status": "Migration > Source"}, "notify_time": 4311153220}, "jsonrpc": "2.0", "method": > "|virt|VM_status|2be4938e-f4a3-4322-bae3-8a9628b81835"} > Thread-41096::ERROR::2016-03-18 14:47:06,560::migration::309::virt.vm::(run) > vmId=`2be4938e-f4a3-4322-bae3-8a9628b81835`::Failed to migrate > Traceback (most recent call last): > File "/usr/share/vdsm/virt/migration.py", line 278, in run > self._setupVdsConnection() > File "/usr/share/vdsm/virt/migration.py", line 143, in _setupVdsConnection > client = self._createClient(port) > File "/usr/share/vdsm/virt/migration.py", line 130, in _createClient > self.remoteHost, int(port), sslctx) > File "/usr/lib/python2.7/site-packages/vdsm/utils.py", line 1269, in > create_connected_socket > sock.connect((host, port)) > File "/usr/lib64/python2.7/site-packages/M2Crypto/SSL/Connection.py", line > 181, in connect > self.socket.connect(addr) > File "/usr/lib64/python2.7/socket.py", line 224, in meth > return getattr(self._sock,name)(*args) > gaierror: [Errno -2] Name or service not known This looks like a bug in the current code (looking in 4.0 development code). 1. We try to connect to the remote host: 2. Connectioin fails with DNS resolving error 3. self._destServer is not initialized becuase of 2 4. Code in _recover fail when access self._destServer We should fix the initialization to avoid this error, but it will fix your issue, which is DNS error. Maybe the address of the destination vm is wrong, or you have DNS issues in your environment? Adding Francesco. Nir > > > On 16-03-18 01:22 PM, Martin
[ovirt-users] Diagnosing Migration Error
Hello, I'm trying to test migrating a VM from one host to another and it's failing, but with no apparent error message. I took a look at the server logs (which I've copied below) but there doesn't seem to be anything in here that would help track down the problem. Should I be looking somewhere besides engine.log on the hosted engine? This is the latest version of oVirt (3.6.3.4-1.el7.centos) that I just upgraded this morning. The VM has an ISO attached over NFS storage and the disk image is on fiber channel. 2016-03-18 14:31:18,913 INFO [org.ovirt.engine.core.bll.MigrateVmToServerCommand] (default task-25) [549545f2] Lock Acquired to object 'EngineLock:{exclusiveLocks='[2be4938e-f4a3-4322-bae3-8a9628b81835=]', sharedLocks='null'}' 2016-03-18 14:31:19,131 INFO [org.ovirt.engine.core.bll.MigrateVmToServerCommand] (org.ovirt.thread.pool-8-thread-45) [549545f2] Running command: MigrateVmToServerCommand internal: false. Entities affected : ID: 2be4938e-f4a3-4322-bae3-8a9628b81835 Type: VMAction group MIGRATE_VM with role type USER 2016-03-18 14:31:19,186 INFO [org.ovirt.engine.core.vdsbroker.MigrateVDSCommand] (org.ovirt.thread.pool-8-thread-45) [549545f2] START, MigrateVDSCommand( MigrateVDSCommandParameters:{runAsync='true', hostId='cbfd733b-8ced-487d-8754-a2217ce1210f', vmId='2be4938e-f4a3-4322-bae3-8a9628b81835', srcHost='ovirt-01.virt.roblib.upei.ca', dstVdsId='1200a78f-6d05-4e5e-9ef7-6798cf741310', dstHost='ovirt-02.virt.roblib.upei.ca:54321', migrationMethod='ONLINE', tunnelMigration='false', migrationDowntime='0', autoConverge='false', migrateCompressed='false', consoleAddress='null'}), log id: 8f303ba 2016-03-18 14:31:19,188 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.MigrateBrokerVDSCommand] (org.ovirt.thread.pool-8-thread-45) [549545f2] START, MigrateBrokerVDSCommand(HostName = oVirt-01, MigrateVDSCommandParameters:{runAsync='true', hostId='cbfd733b-8ced-487d-8754-a2217ce1210f', vmId='2be4938e-f4a3-4322-bae3-8a9628b81835', srcHost='ovirt-01.virt.roblib.upei.ca', dstVdsId='1200a78f-6d05-4e5e-9ef7-6798cf741310', dstHost='ovirt-02.virt.roblib.upei.ca:54321', migrationMethod='ONLINE', tunnelMigration='false', migrationDowntime='0', autoConverge='false', migrateCompressed='false', consoleAddress='null'}), log id: 792d7e9e 2016-03-18 14:31:19,455 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.MigrateBrokerVDSCommand] (org.ovirt.thread.pool-8-thread-45) [549545f2] FINISH, MigrateBrokerVDSCommand, log id: 792d7e9e 2016-03-18 14:31:19,470 INFO [org.ovirt.engine.core.vdsbroker.MigrateVDSCommand] (org.ovirt.thread.pool-8-thread-45) [549545f2] FINISH, MigrateVDSCommand, return: MigratingFrom, log id: 8f303ba 2016-03-18 14:31:19,498 INFO [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (org.ovirt.thread.pool-8-thread-45) [549545f2] Correlation ID: 549545f2, Job ID: 3f97f737-d32a-4ffa-b213-122dcd3ca048, Call Stack: null, Custom Event ID: -1, Message: Migration started (VM: cdTest02, Source: oVirt-01, Destination: oVirt-02, User: admin@internal). 2016-03-18 14:31:28,055 INFO [org.ovirt.engine.core.vdsbroker.VmAnalyzer] (DefaultQuartzScheduler_Worker-39) [] VM '2be4938e-f4a3-4322-bae3-8a9628b81835'(cdTest02) moved from 'MigratingFrom' --> 'Up' 2016-03-18 14:31:28,055 INFO [org.ovirt.engine.core.vdsbroker.VmAnalyzer] (DefaultQuartzScheduler_Worker-39) [] Adding VM '2be4938e-f4a3-4322-bae3-8a9628b81835' to re-run list 2016-03-18 14:31:28,114 ERROR [org.ovirt.engine.core.vdsbroker.VmsMonitoring] (DefaultQuartzScheduler_Worker-39) [] Rerun VM '2be4938e-f4a3-4322-bae3-8a9628b81835'. Called from VDS 'oVirt-01' 2016-03-18 14:31:28,202 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.MigrateStatusVDSCommand] (org.ovirt.thread.pool-8-thread-46) [] START, MigrateStatusVDSCommand(HostName = oVirt-01, MigrateStatusVDSCommandParameters:{runAsync='true', hostId='cbfd733b-8ced-487d-8754-a2217ce1210f', vmId='2be4938e-f4a3-4322-bae3-8a9628b81835'}), log id: 618baaa6 2016-03-18 14:31:29,209 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.MigrateStatusVDSCommand] (org.ovirt.thread.pool-8-thread-46) [] FINISH, MigrateStatusVDSCommand, log id: 618baaa6 2016-03-18 14:31:29,250 ERROR [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (org.ovirt.thread.pool-8-thread-46) [] Correlation ID: 549545f2, Job ID: 3f97f737-d32a-4ffa-b213-122dcd3ca048, Call Stack: null, Custom Event ID: -1, Message: Migration failed (VM: cdTest02, Source: oVirt-01, Destination: oVirt-02). 2016-03-18 14:31:29,262 INFO [org.ovirt.engine.core.bll.MigrateVmToServerCommand] (org.ovirt.thread.pool-8-thread-46) [] Lock freed to object 'EngineLock:{exclusiveLocks='[2be4938e-f4a3-4322-bae3-8a9628b81835= ]', sharedLocks='null'}' ___ Users mailing list Users@ovirt.org
Re: [ovirt-users] Diagnosing Migration Error
On Fri, Mar 18, 2016 at 9:27 PM, Nir Sofferwrote: > On Fri, Mar 18, 2016 at 7:59 PM, Charles Tassell > wrote: >> Hi Martin, >> >> Thanks, I checked that log and found the error " AttributeError: >> 'SourceThread' object has no attribute '_destServer'" which from what I see >> from my googling was supposedly fixed... I'm running >> vdsm-4.17.23-0.el7.centos.noarch which I believe is the latest. The full >> error is: >> >> Thread-2789::DEBUG::2016-03-18 >> 14:47:04,200::blockSD::422::Storage.Misc.excCmd::(getReadDelay) >> /usr/bin/taskset --cpu-list 0-23 /usr/bin/dd >> if=/dev/383a1e7d-8b86-4a50-8bc3-e4374c57a3e5/metadata iflag=direct >> of=/dev/null bs=4096 count=1 (cwd None) >> Thread-2789::DEBUG::2016-03-18 >> 14:47:04,211::blockSD::422::Storage.Misc.excCmd::(getReadDelay) SUCCESS: >> = '1+0 records in\n1+0 records out\n4096 bytes (4.1 kB) copied, >> 0.000377289 s, 10.9 MB/s\n'; = 0 >> mailbox.SPMMonitor::DEBUG::2016-03-18 >> 14:47:04,999::storage_mailbox::735::Storage.Misc.excCmd::(_checkForMail) >> /usr/bin/taskset --cpu-list 0-23 dd >> if=/rhev/data-center/0001-0001-0001-0001-02e9/mastersd/dom_md/inbox >> iflag=direct,fullblock count=1 bs=1024000 (cwd None) >> mailbox.SPMMonitor::DEBUG::2016-03-18 >> 14:47:05,015::storage_mailbox::735::Storage.Misc.excCmd::(_checkForMail) >> SUCCESS: = '1+0 records in\n1+0 records out\n1024000 bytes (1.0 MB) >> copied, 0.0057206 s, 179 MB/s\n'; = 0 >> jsonrpc.Executor/0::DEBUG::2016-03-18 >> 14:47:06,551::__init__::503::jsonrpc.JsonRpcServer::(_serveRequest) Calling >> 'VM.migrate' in bridge with {u'params': {u'tunneled': u'false', >> u'autoConverge': u'false', u'src': u'ovirt-01.virt.roblib.upei.ca', u'dst': >> u'ovirt-02.virt.roblib.upei.ca:54321', u'vmId': >> u'2be4938e-f4a3-4322-bae3-8a9628b81835', u'abortOnError': u'true', >> u'compressed': u'false', u'method': u'online'}, u'vmID': >> u'2be4938e-f4a3-4322-bae3-8a9628b81835'} >> jsonrpc.Executor/0::DEBUG::2016-03-18 14:47:06,552::API::546::vds::(migrate) >> {u'tunneled': u'false', u'autoConverge': u'false', u'src': >> u'ovirt-01.virt.roblib.upei.ca', u'dst': >> u'ovirt-02.virt.roblib.upei.ca:54321', u'vmId': >> u'2be4938e-f4a3-4322-bae3-8a9628b81835', u'abortOnError': u'true', >> u'compressed': u'false', u'method': u'online'} >> jsonrpc.Executor/0::DEBUG::2016-03-18 >> 14:47:06,555::__init__::206::jsonrpc.Notification::(emit) Sending event >> {"params": {"2be4938e-f4a3-4322-bae3-8a9628b81835": {"status": "Migration >> Source"}, "notify_time": 4311153220}, "jsonrpc": "2.0", "method": >> "|virt|VM_status|2be4938e-f4a3-4322-bae3-8a9628b81835"} >> jsonrpc.Executor/0::DEBUG::2016-03-18 >> 14:47:06,555::__init__::533::jsonrpc.JsonRpcServer::(_serveRequest) Return >> 'VM.migrate' in bridge with True >> Thread-41096::ERROR::2016-03-18 >> 14:47:06,559::migration::208::virt.vm::(_recover) >> vmId=`2be4938e-f4a3-4322-bae3-8a9628b81835`::[Errno -2] Name or service not >> known >> Thread-41096::ERROR::2016-03-18 >> 14:47:06,559::migration::213::virt.vm::(_recover) >> vmId=`2be4938e-f4a3-4322-bae3-8a9628b81835`::Failed to destroy remote VM >> Traceback (most recent call last): >> File "/usr/share/vdsm/virt/migration.py", line 211, in _recover >> self._destServer.destroy(self._vm.id) >> AttributeError: 'SourceThread' object has no attribute '_destServer' >> Thread-41096::DEBUG::2016-03-18 >> 14:47:06,560::__init__::206::jsonrpc.Notification::(emit) Sending event >> {"params": {"2be4938e-f4a3-4322-bae3-8a9628b81835": {"status": "Migration >> Source"}, "notify_time": 4311153220}, "jsonrpc": "2.0", "method": >> "|virt|VM_status|2be4938e-f4a3-4322-bae3-8a9628b81835"} >> Thread-41096::ERROR::2016-03-18 14:47:06,560::migration::309::virt.vm::(run) >> vmId=`2be4938e-f4a3-4322-bae3-8a9628b81835`::Failed to migrate >> Traceback (most recent call last): >> File "/usr/share/vdsm/virt/migration.py", line 278, in run >> self._setupVdsConnection() >> File "/usr/share/vdsm/virt/migration.py", line 143, in _setupVdsConnection >> client = self._createClient(port) >> File "/usr/share/vdsm/virt/migration.py", line 130, in _createClient >> self.remoteHost, int(port), sslctx) >> File "/usr/lib/python2.7/site-packages/vdsm/utils.py", line 1269, in >> create_connected_socket >> sock.connect((host, port)) >> File "/usr/lib64/python2.7/site-packages/M2Crypto/SSL/Connection.py", line >> 181, in connect >> self.socket.connect(addr) >> File "/usr/lib64/python2.7/socket.py", line 224, in meth >> return getattr(self._sock,name)(*args) >> gaierror: [Errno -2] Name or service not known > > This looks like a bug in the current code (looking in 4.0 development code). Correction, this is fixed in 4.0, but broken in latest 3.6 (v4.17.24). This issue was fixed in https://gerrit.ovirt.org/52897, but not backported to 3.6. > > 1. We try to connect to the remote host: > 2. Connectioin fails with DNS resolving error > 3. self._destServer is not initialized