[ovirt-users] Re: [Gluster-users] VMs paused - unknown storage error - Stale file handle - distribute 2 - replica 3 volume with sharding

2018-12-13 Thread Marco Lorenzo Crociani

Hi,
is there a way to recover file from "Stale file handle" errors?

Here some of the tests we have done:

- compared the extended attributes of all of the three replicas of the 
involved shard. Found identical attributes.


- compared SHA512 message digest of all of the three replicas of the 
involved shard. Found identical digests.


- tried to delete the shard from a replica set, one at a time, along 
with its hard link. Shard is always rebuilt correctly but error from 
client persists.


Regards,

--
Marco Crociani

Il 22/11/18 13:19, Marco Lorenzo Crociani ha scritto:

Hi,
I opened a bug on gluster because I have reading errors on files on a 
gluster volume:

https://bugzilla.redhat.com/show_bug.cgi?id=1652548

The files are many of the VMs images of the oVirt DATA storage domain. 
oVirt pause the vms because unknown storage errors.
It's impossibile to copy/clone, manage some snapshots of these vms. The 
errors on the low level are "stale file handle".

Volume is distribute 2 replicate 3 with sharding.

Should I open a bug also on oVirt?

Gluster 3.12.15-1.el7
oVirt 4.2.6.4-1.el7

Regards,



___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/QILZGHZP5LUKLB4EM54J5KXAAZQTPXLX/


[ovirt-users] VMs paused - unknown storage error - Stale file handle - distribute 2 - replica 3 volume with sharding

2018-11-22 Thread Marco Lorenzo Crociani

Hi,
I opened a bug on gluster because I have reading errors on files on a 
gluster volume:

https://bugzilla.redhat.com/show_bug.cgi?id=1652548

The files are many of the VMs images of the oVirt DATA storage domain. 
oVirt pause the vms because unknown storage errors.
It's impossibile to copy/clone, manage some snapshots of these vms. The 
errors on the low level are "stale file handle".

Volume is distribute 2 replicate 3 with sharding.

Should I open a bug also on oVirt?

Gluster 3.12.15-1.el7
oVirt 4.2.6.4-1.el7

Regards,

--
Marco Crociani
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/L3JGKXNC45STFUSMPFP7GI5PZ3RACQJY/


[ovirt-users] Re: VM stuck in paused mode with Cluster Compatibility Version 3.6 on 4.2 cluster

2018-11-19 Thread Marco Lorenzo Crociani

Hi, Darrell
thanks, it worked.

Regards,

Marco

On 20/09/2018 18:47, Darrell Budic wrote:
I had something similar happen while upgrading. Didn’t find a way to fix 
the configs on the fly, but was able to un-pause the VMs using virsh, 
then proceed to handle the ovirt portions. Probably work for you as well.




*From:* Marco Lorenzo Crociani <mailto:mar...@prismatelecomtesting.com>>
*Subject:* [ovirt-users] VM stuck in paused mode with Cluster 
Compatibility Version 3.6 on 4.2 cluster

*Date:* September 20, 2018 at 11:10:48 AM CDT
*To:* users

Hi,
we upgraded ovirt from version 4.1 to 4.2.6. Rebooted all vms.
We missed two vms that were at Cluster Compatibility Version 3.6.
There was a gluster/network IO problem and vms got paused. We were 
able to recover all the other vms from the paused state but we have 
two vms that won't run because:


"Cannot run VM. The Custom Compatibility Version of VM VM_NAME (3.6) 
is not supported in Data Center compatibility version 4.1."


Can we force the CCV of the paused vm to 4.1?

Regards,

--
Marco Crociani
___
Users mailing list -- users@ovirt.org <mailto:users@ovirt.org>
To unsubscribe send an email to users-le...@ovirt.org 
<mailto:users-le...@ovirt.org>

Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/5XH3H6ADEY3WFYNEVVREEGCA57NPDAQY/



___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/TL5ETCRU6L7DCGUJVKMBG2XHGCMR44S4/


[ovirt-users] GetGlusterLocalLogicalVolumeListVDSCommand execution failed: null

2018-10-25 Thread Marco Lorenzo Crociani

Hi,
I'm updating ovirt to 4.2.6.4-1.el7

Engine is updated
Compute nodes are updated
Storage nodes are not yet updated because ovirt 4.2.6.4-1.el7 depends on 
gluster 3.12 while now I have gluster 3.10.


Compatibility Versions:
Compute datacenter 4.2
Storage 4.1 (because I don't have yet updated the storage cluster)
so Data Centers is still 4.1

I was about to upgrade the storage nodes when I noticed that 
/var/log/ovirt-engine/engine.log is flooded with errors like:


2018-10-25 19:36:17,164+02 ERROR 
[org.ovirt.engine.core.vdsbroker.gluster.GetGlusterLocalLogicalVolumeListVDSCommand] 
(DefaultQuartzScheduler2) [55e89794] Command 
'GetGlusterLocalLogicalVolumeListVDSCommand(HostName = s23, 
VdsIdVDSCommandParametersBase:{hostId='84a33357-1d04-44db-b0e3-4638ebc39d6c'})' 
execution failed: null


6 errors (one for each storage server) every 6 seconds.

How could I fix them?
Can I continue to upgrade the storage nodes?

Best regards,

--
Marco Crociani
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/EOAR2VMN44I3CGOHGWIQYRAOAUVYZUNI/


[ovirt-users] VM stuck in paused mode with Cluster Compatibility Version 3.6 on 4.2 cluster

2018-09-20 Thread Marco Lorenzo Crociani

Hi,
we upgraded ovirt from version 4.1 to 4.2.6. Rebooted all vms.
We missed two vms that were at Cluster Compatibility Version 3.6.
There was a gluster/network IO problem and vms got paused. We were able 
to recover all the other vms from the paused state but we have two vms 
that won't run because:


"Cannot run VM. The Custom Compatibility Version of VM VM_NAME (3.6) is 
not supported in Data Center compatibility version 4.1."


Can we force the CCV of the paused vm to 4.1?

Regards,

--
Marco Crociani
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/5XH3H6ADEY3WFYNEVVREEGCA57NPDAQY/


Re: [ovirt-users] ovirt 4.1 - skylake - no avx512 support in virtual machines

2018-02-28 Thread Marco Lorenzo Crociani

Skylake-Client does _not_ have AVX512 (I tried now on a Kaby Lake Core
i7 laptop).  Only Skylake-Server has it and it will be in RHEL 7.5.

Thanks,

Paolo



Ok, we'll stay with pass-through until RHEL 7.5.
Thanks,

--
Marco Crociani
Prisma Telecom Testing S.r.l.
via Petrocchi, 4  20127 MILANO  ITALY
Phone:  +39 02 26113507
Fax:  +39 02 26113597
e-mail:  mar...@prismatelecomtesting.com
web:  http://www.prismatelecomtesting.com
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] ovirt 4.1 - skylake - no avx512 support in virtual machines

2018-02-26 Thread Marco Lorenzo Crociani

Hi,
I can't access avx512* instruction set from virtual machines.
I have made a one server compute cluster to test new hardware:

oVirt 4.1.9
CentOS 7
Cluster CPU Type: Intel Skylake Family
Compatibility Version: 4.1

HOST:
CPU Model Name:Intel(R) Xeon(R) Platinum 8180 CPU @ 2.50GHz
Family: Family
CPU Type: Intel Skylake Family

Virtual Machine settings
A) VM:
Custom CPU Type: Use cluster default(Intel Skylake Family)
General Tab shows: Guest CPU Type: Skylake-Client

avx512: NO

B) VM:
Custom CPU Type: Skylake-Client
General Tab shows: Guest CPU Type: Skylake-Client

avx512: NO

C) VM:
Custom CPU Type: Use cluster default(Intel Skylake Family) [grey - 
cannot modify]

Migration mode: Do not allow migration
Pass-Through Host CPU
General Tab shows: Guest CPU Type: Skylake-Client

avx512: YES   ( cat /proc/cpuinfo  |grep avx512: avx512f avx512dq 
avx512cd avx512bw avx512vl )


Using pass-through host cpu (disabling vm migration) is the only way to 
access avx512 in a VM, is it a bug or am I missing something?


Regards,

--
Marco Crociani
Prisma Telecom Testing S.r.l.
via Petrocchi, 4  20127 MILANO  ITALY
Phone:  +39 02 26113507
Fax:  +39 02 26113597
e-mail:  mar...@prismatelecomtesting.com
web:  http://www.prismatelecomtesting.com
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users