[ 
https://ovirt-jira.atlassian.net/browse/OVIRT-1763?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=35342#comment-35342
 ] 

Evgheni Dereveanchin commented on OVIRT-1763:
---------------------------------------------

>From the provided job log I see an upgrade suite failure during add-host. 
In engine.log [1] there's a long update sequence at the end, that contains 570 
packages including the kernel, systemd, glibc, rpm and looks like a full "yum 
update" is being run on the hypervisor. Is this expected? Shouldn't we just 
install VDSM and friends?

On the host itself [2] I see the upgrade progressing normally, no severe 
hangups. So I cannot see any direct proof of lack of enropy causing this. On 
successful runs the "yum update" with 570 packages takes 5 minutes, not 15 so 
will continue investigating.

In general, the timeout is not happening on lago hosts themselves but in VMs 
where OST is running. Those should have 'haveged' installed and running inside 
to provide entropy. If they don't - it needs to be installed and running. 
[[email protected]] - could you please confirm if we have haveged in lago 
VMs?

[1] 
http://jenkins.ovirt.org/job/ovirt-master_change-queue-tester/3795/artifact/exported-artifacts/upgrade-from-release-suit-master-el7/test_logs/upgrade-from-release-suite-master/post-002_bootstrap.py/lago-upgrade-from-release-suite-master-engine/_var_log/ovirt-engine/engine.log
[2] 
http://jenkins.ovirt.org/job/ovirt-master_change-queue-tester/3795/artifact/exported-artifacts/upgrade-from-release-suit-master-el7/test_logs/upgrade-from-release-suite-master/post-002_bootstrap.py/lago-upgrade-from-release-suite-master-host0/_var_log/messages

> Increase entropy for hosts
> --------------------------
>
>                 Key: OVIRT-1763
>                 URL: https://ovirt-jira.atlassian.net/browse/OVIRT-1763
>             Project: oVirt - virtualization made easy
>          Issue Type: Bug
>            Reporter: Dafna Ron
>            Assignee: infra
>
> we had a failure in ost that was really hard to debug: 
> http://jenkins.ovirt.org/job/ovirt-master_change-queue-tester/3795/
> There are no failures in the logs and the test itself was terminated by a 
> timeout.
> It took the vms a long time to download packages and install and didi seems 
> to think that this is due to limited entropy on the physical host. 
> we need to review this issue and increase the entropy on the hosts. 



--
This message was sent by Atlassian Jira
(v1001.0.0-SNAPSHOT#100071)
_______________________________________________
Infra mailing list
[email protected]
http://lists.ovirt.org/mailman/listinfo/infra

Reply via email to