Hi Piotr,

Any update on this?

Thanks.
Dafna


On Mon, May 28, 2018 at 10:59 AM, Piotr Kliczewski <
[email protected]> wrote:

> On Mon, May 28, 2018 at 11:41 AM, Barak Korren <[email protected]> wrote:
> >
> >
> > On 28 May 2018 at 12:38, Piotr Kliczewski <[email protected]>
> > wrote:
> >>
> >> On Mon, May 28, 2018 at 10:57 AM, Barak Korren <[email protected]>
> wrote:
> >> > Note: we're now seeing a very similar issue in the 4.2 branch as well
> >> > that
> >> > seems to have been introduced by the following patch:
> >>
> >> Can you point to specific job so we could take a look at the logs?
> >
> >
> > Whoops, sorry, here:
> > http://jenkins.ovirt.org/job/ovirt-4.2_change-queue-tester/2034/
> >
>
> Looks like the same issue:
>
> 2018-05-28 03:41:03,606-04 ERROR
> [org.ovirt.engine.core.uutils.ssh.SSHDialog]
> (EE-ManagedThreadFactory-engine-Thread-1) [1244c90f] SSH error running
> command root@lago-upgrade-from-prevrelease-suite-4-2-host-0:'umask
> 0077; MYTMP="$(TMPDIR="${OVIRT_TMPDIR}" mktemp -d -t
> ovirt-XXXXXXXXXX)"; trap "chmod -R u+rwX \"${MYTMP}\" > /dev/null
> 2>&1; rm -fr \"${MYTMP}\" > /dev/null 2>&1" 0; tar
> --warning=no-timestamp -C "${MYTMP}" -x &&
> "${MYTMP}"/ovirt-host-deploy DIALOG/dialect=str:machine
> DIALOG/customization=bool:True': TimeLimitExceededException: SSH
> session timeout host
> 'root@lago-upgrade-from-prevrelease-suite-4-2-host-0'
> 2018-05-28 03:41:03,606-04 ERROR
> [org.ovirt.engine.core.bll.hostdeploy.VdsDeployBase] (VdsDeploy)
> [1244c90f] Error during deploy dialog
> 2018-05-28 03:41:03,611-04 ERROR
> [org.ovirt.engine.core.bll.hostdeploy.VdsDeployBase]
> (EE-ManagedThreadFactory-engine-Thread-1) [1244c90f] Timeout during
> host lago-upgrade-from-prevrelease-suite-4-2-host-0 install: SSH
> session timeout host
> 'root@lago-upgrade-from-prevrelease-suite-4-2-host-0'
>
> >>
> >>
> >> >
> >> > https://gerrit.ovirt.org/c/91638/2 - core: Enable only strong ciphers
> >> > for
> >> > 4.2 hosts
> >> >
> >> > On 28 May 2018 at 10:26, Barak Korren <[email protected]> wrote:
> >> >>
> >> >>
> >> >>
> >> >> On 28 May 2018 at 10:19, Martin Perina <[email protected]> wrote:
> >> >>>
> >> >>>
> >> >>>
> >> >>> On Mon, May 28, 2018 at 9:00 AM, Piotr Kliczewski
> >> >>> <[email protected]>
> >> >>> wrote:
> >> >>>>
> >> >>>> Simone,
> >> >>>>
> >> >>>> What do you think about this failure?
> >> >>>>
> >> >>>> Thanks,
> >> >>>> Piotr
> >> >>>>
> >> >>>> On Mon, May 28, 2018 at 7:12 AM, Barak Korren <[email protected]>
> >> >>>> wrote:
> >> >>>>>
> >> >>>>>
> >> >>>>>
> >> >>>>> On 27 May 2018 at 14:59, Piotr Kliczewski <[email protected]>
> >> >>>>> wrote:
> >> >>>>>>
> >> >>>>>> Martin,
> >> >>>>>>
> >> >>>>>> I only can see:
> >> >>>>>>
> >> >>>>>> 2018-05-25 13:57:44,255-04 ERROR
> >> >>>>>> [org.ovirt.engine.core.uutils.ssh.SSHDialog]
> >> >>>>>> (EE-ManagedThreadFactory-engine-Thread-1) [55a7b15b] SSH error
> >> >>>>>> running
> >> >>>>>> command root@lago-upgrade-from-release-suite-master-host-0:'
> umask
> >> >>>>>> 0077;
> >> >>>>>> MYTMP="$(TMPDIR="${OVIRT_TMPDIR}" mktemp -d -t
> ovirt-XXXXXXXXXX)";
> >> >>>>>> trap
> >> >>>>>> "chmod -R u+rwX \"${MYTMP}\" > /dev/null 2>&1; rm -fr
> \"${MYTMP}\"
> >> >>>>>> >
> >> >>>>>> /dev/null 2>&1" 0; tar --warning=no-timestamp -C "${MYTMP}" -x &&
> >> >>>>>> "${MYTMP}"/ovirt-host-deploy DIALOG/dialect=str:machine
> >> >>>>>> DIALOG/customization=bool:True': TimeLimitExceededException: SSH
> >> >>>>>> session
> >> >>>>>> timeout host 'root@lago-upgrade-from-
> release-suite-master-host-0'
> >> >>>>>> 2018-05-25 13:57:44,259-04 ERROR
> >> >>>>>> [org.ovirt.engine.core.bll.hostdeploy.VdsDeployBase]
> >> >>>>>> (EE-ManagedThreadFactory-engine-Thread-1) [55a7b15b] Timeout
> during
> >> >>>>>> host
> >> >>>>>> lago-upgrade-from-release-suite-master-host-0 install: SSH
> session
> >> >>>>>> timeout
> >> >>>>>> host 'root@lago-upgrade-from-release-suite-master-host-0'
> >> >>>>>>
> >> >>>>>> There are no additional logs. SSH to host timeout. Are we sure
> that
> >> >>>>>> it
> >> >>>>>> is an issue caused by Ravi's change?
> >> >>>>>
> >> >>>>>
> >> >>>>> We have some quite strong circumstantial evidence:
> >> >>>>> - Issue had affected all engine patches since that patch in a
> >> >>>>> similar
> >> >>>>> fashion.
> >> >>>>> - Prior engine patch [1] passed successfully [2]
> >> >>>>> - Other subsequent OST runs without engine patches passed
> >> >>>>> successfully
> >> >>>>> as well [3].
> >> >>>>>
> >> >>>>> [1]: https://gerrit.ovirt.org/c/91595/2
> >> >>>>> [2]:
> >> >>>>> http://jenkins.ovirt.org/job/ovirt-master_change-queue-
> tester/7777/
> >> >>>>> [3]:
> >> >>>>> http://jenkins.ovirt.org/job/ovirt-master_change-queue-
> tester/7778/
> >> >>>>>
> >> >>>>>
> >> >>>>> Please note - the issue is affecting a test that is run by an
> >> >>>>> upgrade
> >> >>>>> suit on the post-upgrade system. It has no affect on the basic
> suit.
> >> >>>>> So it
> >> >>>>> probably has to do with some behaviour that is specific to
> upgraded
> >> >>>>> systems.
> >> >>>
> >> >>>
> >> >>> I will try to reproduce later today in dev env, but I agree with
> >> >>> Piotr's
> >> >>> investigation, engine was not able to connect to the host using SSH
> >> >>> and
> >> >>> that's why no host-deploy logs were fetched.
> >> >>
> >> >>
> >> >> Lago fetches the logs from the host too (And it can take then from
> the
> >> >> VM
> >> >> image directly if the host is not responsive over SSH), can we get at
> >> >> the
> >> >> host-deploy logs that way?
> >> >>
> >> >>
> >> >>>>>
> >> >>>>>
> >> >>>>>
> >> >>>>>>
> >> >>>>>>
> >> >>>>>> Thanks,
> >> >>>>>> Piotr
> >> >>>>>>
> >> >>>>>> On Sun, May 27, 2018 at 11:21 AM, Martin Perina
> >> >>>>>> <[email protected]>
> >> >>>>>> wrote:
> >> >>>>>>>
> >> >>>>>>> Adding also Piotr to the thread
> >> >>>>>>>
> >> >>>>>>>
> >> >>>>>>> On Sun, 27 May 2018, 08:46 Barak Korren, <[email protected]>
> >> >>>>>>> wrote:
> >> >>>>>>>>
> >> >>>>>>>> Test failed: [ AddHost (in upgrade-from-release-suite) ]
> >> >>>>>>>>
> >> >>>>>>>> Link to suspected patches:
> >> >>>>>>>> https://gerrit.ovirt.org/#/c/91445/5 - Disable TLS versions <
> 1.2
> >> >>>>>>>> for hosts with cluster level>=4.1
> >> >>>>>>>>
> >> >>>>>>>> Link to Job:
> >> >>>>>>>>
> >> >>>>>>>> http://jenkins.ovirt.org/job/ovirt-master_change-queue-
> tester/7776/
> >> >>>>>>>>
> >> >>>>>>>> Link to all logs:
> >> >>>>>>>>
> >> >>>>>>>>
> >> >>>>>>>> http://jenkins.ovirt.org/job/ovirt-master_change-queue-
> tester/7776/artifact/exported-artifacts/upgrade-from-
> release-suit-master-el7/test_logs/upgrade-from-release-
> suite-master/post-002_bootstrap.py/
> >> >>>>>>>>
> >> >>>>>>>> Error snippet from log:
> >> >>>>>>>>
> >> >>>>>>>> From nosetst log:
> >> >>>>>>>> <error>
> >> >>>>>>>>
> >> >>>>>>>> AssertionError: False != True after 1200 seconds
> >> >>>>>>>>
> >> >>>>>>>> </error>
> >> >>>>>>>>
> >> >>>>>>>> Not finding a host deploy log in /var/log/ovirt-engine for some
> >> >>>>>>>> reason.
> >> >>>>>>>> This seems to have cause consistent failure in all other engine
> >> >>>>>>>> patches that followed it.
> >> >>>>>>>>
> >> >>>>>>>>
> >> >>>>>>>> --
> >> >>>>>>>> Barak Korren
> >> >>>>>>>> RHV DevOps team , RHCE, RHCi
> >> >>>>>>>> Red Hat EMEA
> >> >>>>>>>> redhat.com | TRIED. TESTED. TRUSTED. | redhat.com/trusted
> >> >>>>>>
> >> >>>>>>
> >> >>>>>
> >> >>>>>
> >> >>>>>
> >> >>>>> --
> >> >>>>> Barak Korren
> >> >>>>> RHV DevOps team , RHCE, RHCi
> >> >>>>> Red Hat EMEA
> >> >>>>> redhat.com | TRIED. TESTED. TRUSTED. | redhat.com/trusted
> >> >>>>
> >> >>>>
> >> >>>
> >> >>>
> >> >>>
> >> >>> --
> >> >>> Martin Perina
> >> >>> Associate Manager, Software Engineering
> >> >>> Red Hat Czech s.r.o.
> >> >>
> >> >>
> >> >>
> >> >>
> >> >> --
> >> >> Barak Korren
> >> >> RHV DevOps team , RHCE, RHCi
> >> >> Red Hat EMEA
> >> >> redhat.com | TRIED. TESTED. TRUSTED. | redhat.com/trusted
> >> >
> >> >
> >> >
> >> >
> >> > --
> >> > Barak Korren
> >> > RHV DevOps team , RHCE, RHCi
> >> > Red Hat EMEA
> >> > redhat.com | TRIED. TESTED. TRUSTED. | redhat.com/trusted
> >> >
> >> > _______________________________________________
> >> > Devel mailing list -- [email protected]
> >> > To unsubscribe send an email to [email protected]
> >> > Privacy Statement: https://www.ovirt.org/site/privacy-policy/
> >> > oVirt Code of Conduct:
> >> > https://www.ovirt.org/community/about/community-guidelines/
> >> > List Archives:
> >> >
> >> > https://lists.ovirt.org/archives/list/[email protected]/message/
> QIZ5L4FKII7X5FHQ4OXBBR2SLUIK5C74/
> >> >
> >
> >
> >
> >
> > --
> > Barak Korren
> > RHV DevOps team , RHCE, RHCi
> > Red Hat EMEA
> > redhat.com | TRIED. TESTED. TRUSTED. | redhat.com/trusted
> _______________________________________________
> Devel mailing list -- [email protected]
> To unsubscribe send an email to [email protected]
> Privacy Statement: https://www.ovirt.org/site/privacy-policy/
> oVirt Code of Conduct: https://www.ovirt.org/community/about/community-
> guidelines/
> List Archives: https://lists.ovirt.org/archives/list/[email protected]/
> message/RDK42TYJKMX3M2DNUFKZO7CGNNOYWMJI/
>
_______________________________________________
Devel mailing list -- [email protected]
To unsubscribe send an email to [email protected]
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/[email protected]/message/PXT3KM543JNBZS72MA7QLDRYXYZBQZWU/

Reply via email to