Re: [ovirt-users] MoM is failing!!!

2017-10-16 Thread Piotr Kliczewski
On Mon, Oct 16, 2017 at 4:51 PM, Erekle Magradze wrote: > That's the problem, at that time nobody has restarted the server. Please provide engine log from this time so we could see whether it was trigger by it. > > Is there any scenario when the hypervisor is

Re: [ovirt-users] MoM is failing!!!

2017-10-16 Thread Erekle Magradze
That's the problem, at that time nobody has restarted the server. Is there any scenario when the hypervisor is restarted by engine? Cheers Erekle On 10/16/2017 04:45 PM, Piotr Kliczewski wrote: Erekle, For the time period you mentioned I do not see anything wrong on vdsm side except of a

Re: [ovirt-users] MoM is failing!!!

2017-10-16 Thread Piotr Kliczewski
Erekle, For the time period you mentioned I do not see anything wrong on vdsm side except of a restart at 2017-10-15 16:28:50,993+0200. It looks like manual restart. The engine log starts at 2017-10-16 03:49:04,092+02 so not able to say whether there was anything else except of heartbeat issue

Re: [ovirt-users] MoM is failing!!!

2017-10-16 Thread Erekle Magradze
Hi Piotr, Several times I've restarted vdsm daemon on certain nods, that could be the reason. The failure, I've mentioned, has happened yesterday from 15:00 to 17:00 Cheers Erekle On 10/16/2017 04:13 PM, Piotr Kliczewski wrote: Erekle, In the logs you provided I see: IOError: [Errno 5]

Re: [ovirt-users] MoM is failing!!!

2017-10-16 Thread Piotr Kliczewski
Erekle, In the logs you provided I see: IOError: [Errno 5] _handleRequests._checkForMail - Could not read mailbox: /rhev/data-center/6d52512e-1c02-4509-880a-bf57cbad4bdf/mastersd/dom_md/inbox and StorageDomainMasterError: Error validating master storage domain: ('MD read error',) which seems

Re: [ovirt-users] MoM is failing!!!

2017-10-16 Thread Dafna Ron
Hi, Can you please tell us what is the issue that you are actually facing? :) it would be easier to debug an issue and not an error message that can be cause by several things. Also, can you provide the engine and the vdsm logs? thank you, Dafna On 10/16/2017 02:30 PM, Erekle Magradze wrote:

Re: [ovirt-users] MoM is failing!!!

2017-10-16 Thread Erekle Magradze
It's was a typo in the failure message, that's what I was getting: *VDSM hostname command GetStatsVDS failed: Connection reset by peer* On 10/16/2017 03:21 PM, Erekle Magradze wrote: Hi, It's getting clear now, indeed momd service is disabled ● momd.service - Memory Overcommitment Manager

Re: [ovirt-users] MoM is failing!!!

2017-10-16 Thread Erekle Magradze
Hi, It's getting clear now, indeed momd service is disabled ● momd.service - Memory Overcommitment Manager Daemon Loaded: loaded (/usr/lib/systemd/system/momd.service; static; vendor preset: disabled) Active: inactive (dead) mom-vdsm is enable and running. ● mom-vdsm.service - MOM

Re: [ovirt-users] MoM is failing!!!

2017-10-16 Thread Martin Sivak
Hi, how do you start MOM? MOM is supposed to talk to vdsm, we do not talk to libvirt directly. The line you posted comes from vdsm and vdsm is telling you it can't talk to MOM. Which MOM service is enabled? Because there are two momd and mom-vdsm, the second one is the one that should be

Re: [ovirt-users] MoM is failing!!!

2017-10-16 Thread Erekle Magradze
Hi Martin, Thanks for the answer, unfortunately this warning message persists, does it mean that mom cannot communicate with libvirt? how critical is it? Best Erekle On 10/16/2017 03:03 PM, Martin Sivak wrote: Hi, it is just a warning, there is nothing you have to solve unless it does

Re: [ovirt-users] MoM is failing!!!

2017-10-16 Thread Martin Sivak
Hi, it is just a warning, there is nothing you have to solve unless it does not resolve itself within a minute or so. If it happens only once or twice after vdsm or mom restart then you are fine. Best regards -- Martin Sivak SLA / oVirt On Mon, Oct 16, 2017 at 2:44 PM, Erekle Magradze

[ovirt-users] MoM is failing!!!

2017-10-16 Thread Erekle Magradze
Hi, after running systemctl status vdsm I am getting that it's running and this message at the end. Oct 16 14:26:52 hostname vdsmd[2392]: vdsm throttled WARN MOM not available. Oct 16 14:26:52 hostname vdsmd[2392]: vdsm throttled WARN MOM not available, KSM stats will be missing. Oct 16