[
https://ovirt-jira.atlassian.net/browse/OVIRT-609?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17600#comment-17600
]
Evgheni Dereveanchin commented on OVIRT-609:
--------------------------------------------
Here are some relevant messages from engine.log:
{quote}
grep 1394b752 /var/log/ovirt-engine/engine.log
2016-06-23 09:06:34,099 INFO
[org.ovirt.engine.core.bll.CreateAllSnapshotsFromVmCommand]
(ajp--127.0.0.1-8702-1) [1394b752] Lock Acquired to object EngineLock
[exclusiveLocks= key: e7a7b735-0310-4f88-9ed9-4fed85835a01 value: VM
2016-06-23 09:06:35,708 INFO
[org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector]
(org.ovirt.thread.pool-8-thread-15) Correlation ID: 1394b752, Job ID:
a8fab0bf-d45e-46eb-8314-e22db8e6a3f4, Call Stack: null, Custom Event ID: -1,
Message: Snapshot 'ngoldin_before_cluster_move' creation for VM
'jenkins-phx-ovirt-org' was initiated by admin.
2016-06-23 09:09:46,038 WARN
[org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector]
(org.ovirt.thread.pool-8-thread-14) Correlation ID: 1394b752, Job ID:
a8fab0bf-d45e-46eb-8314-e22db8e6a3f4, Call Stack:
org.ovirt.engine.core.common.errors.VdcBLLException: VdcBLLException:
org.ovirt.engine.core.vdsbroker.vdsbroker.VDSNetworkException:
java.util.concurrent.TimeoutException (Failed with error VDS_NETWORK_ERROR and
code 5022)
2016-06-23 09:09:47,859 ERROR
[org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector]
(org.ovirt.thread.pool-8-thread-14) Correlation ID: 1394b752, Job ID:
a8fab0bf-d45e-46eb-8314-e22db8e6a3f4, Call Stack:
org.ovirt.engine.core.common.errors.VdcBLLException: VdcBLLException:
org.ovirt.engine.core.vdsbroker.vdsbroker.VDSNetworkException:
java.util.concurrent.TimeoutException (Failed with error VDS_NETWORK_ERROR and
code 5022){quote}
Looks like VDSM was slow to respond (probably due to storage slowness) while
the snapshot is likely to have completed fine. I'll review host logs and share
my findings.
> Jenkins snapshot creation failed
> --------------------------------
>
> Key: OVIRT-609
> URL: https://ovirt-jira.atlassian.net/browse/OVIRT-609
> Project: oVirt - virtualization made easy
> Issue Type: Bug
> Reporter: Evgheni Dereveanchin
> Assignee: infra
>
> [[email protected]] issued a live snapshot creation on the Jenkins VM to
> prepare it for cluster move. This failed and it's not really clear why.
> Relevant event logs below, suggesting that the hypervisor started dumping VM
> memory to the snapshot which caused a storage slowdown.
> {quote}2016-Jun-23, 18:06 Snapshot 'ngoldin_before_cluster_move' creation for
> VM 'jenkins-phx-ovirt-org' was initiated by admin.
> 2016-Jun-23, 18:09 Failed to create live snapshot
> 'ngoldin_before_cluster_move' for VM 'jenkins-phx-ovirt-org'. VM restart is
> recommended. Note that using the created snapshot might cause data
> inconsistency.
> 2016-Jun-23, 18:13 Host ovirt-srv02 has network interface which exceeded the
> defined threshold [95%] (em1: transmit rate[100%], receive rate [0%])
> 2016-Jun-23, 18:13 Storage domain Production experienced a high latency of
> 18.7802 seconds from host ovirt-srv11. This may cause performance and
> functional issues. Please consult your Storage Administrator.{quote}
--
This message was sent by Atlassian JIRA
(v1000.98.4#100004)
_______________________________________________
Infra mailing list
[email protected]
http://lists.ovirt.org/mailman/listinfo/infra