Usually libvirt's log might provide hints (yet , no clues) of any issues.

For example: 
/var/log/libvirt/qemu/<VM_NAME>.log

Anything changed recently (maybe oVirt version was increased) ?

Best Regards,
Strahil Nikolov






В понеделник, 21 септември 2020 г., 23:28:13 Гринуич+3, Vinícius Ferrão 
<fer...@versatushpc.com.br> написа: 





Hi Strahil, 



Both disks are VirtIO-SCSI and are Preallocated:














Thanks,








>  
> On 21 Sep 2020, at 17:09, Strahil Nikolov <hunter86...@yahoo.com> wrote:
> 
> 
>  
> What type of disks are you using ? Any change you use thin disks ?
> 
> Best Regards,
> Strahil Nikolov
> 
> 
> 
> 
> 
> 
> В понеделник, 21 септември 2020 г., 07:20:23 Гринуич+3, Vinícius Ferrão via 
> Users <users@ovirt.org> написа: 
> 
> 
> 
> 
> 
> Hi, sorry to bump the thread.
> 
> But I still with this issue on the VM. This crashes are still happening, and 
> I really don’t know what to do. Since there’s nothing on logs, except from 
> that message on `dmesg` of the host machine I started changing setting to see 
> if anything changes or if I at least I get a pattern.
> 
> What I’ve tried:
> 1. Disabled I/O Threading on VM.
> 2. Increased I/O Threading to 2 form 1.
> 3. Disabled Memory Balooning.
> 4. Reduced VM resources form 10 CPU’s and 48GB of RAM to 6 CPU’s and 24GB of 
> RAM.
> 5. Moved the VM to another host.
> 6. Dedicated a host specific to this VM.
> 7. Check on the storage system to see if there’s any resource starvation, but 
> everything seems to be fine.
> 8. Checked both iSCSI switches to see if there’s something wrong with the 
> fabrics: 0 errors.
> 
> I’m really running out of ideas. The VM was working normally and suddenly 
> this started.
> 
> Thanks,
> 
> PS: When I was typing this message it crashed again:
> 
> [427483.126725] *** Guest State ***
> [427483.127661] CR0: actual=0x0000000000050032, shadow=0x0000000000050032, 
> gh_mask=fffffffffffffff7
> [427483.128505] CR4: actual=0x0000000000002050, shadow=0x0000000000000000, 
> gh_mask=fffffffffffff871
> [427483.129342] CR3 = 0x00000001849ff002
> [427483.130177] RSP = 0xffffb10186ffffb0  RIP = 0x0000000000008000
> [427483.131014] RFLAGS=0x00000002        DR7 = 0x0000000000000400
> [427483.131859] Sysenter RSP=0000000000000000 CS:RIP=0000:0000000000000000
> [427483.132708] CS:  sel=0x9b00, attr=0x08093, limit=0xffffffff, 
> base=0x000000007ff9b000
> [427483.133559] DS:  sel=0x0000, attr=0x08093, limit=0xffffffff, 
> base=0x0000000000000000
> [427483.134413] SS:  sel=0x0000, attr=0x08093, limit=0xffffffff, 
> base=0x0000000000000000
> [427483.135237] ES:  sel=0x0000, attr=0x08093, limit=0xffffffff, 
> base=0x0000000000000000
> [427483.136040] FS:  sel=0x0000, attr=0x08093, limit=0xffffffff, 
> base=0x0000000000000000
> [427483.136842] GS:  sel=0x0000, attr=0x08093, limit=0xffffffff, 
> base=0x0000000000000000
> [427483.137629] GDTR:                          limit=0x00000057, 
> base=0xffffb10186eb4fb0
> [427483.138409] LDTR: sel=0x0000, attr=0x10000, limit=0x000fffff, 
> base=0x0000000000000000
> [427483.139202] IDTR:                          limit=0x00000000, 
> base=0x0000000000000000
> [427483.139998] TR:  sel=0x0040, attr=0x0008b, limit=0x00000067, 
> base=0xffffb10186eb3000
> [427483.140816] EFER =    0x0000000000000000  PAT = 0x0007010600070106
> [427483.141650] DebugCtl = 0x0000000000000000  DebugExceptions = 
> 0x0000000000000000
> [427483.142503] Interruptibility = 00000009  ActivityState = 00000000
> [427483.143353] *** Host State ***
> [427483.144194] RIP = 0xffffffffc0c65024  RSP = 0xffff9253c0b9bc90
> [427483.145043] CS=0010 SS=0018 DS=0000 ES=0000 FS=0000 GS=0000 TR=0040
> [427483.145903] FSBase=00007fcc13816700 GSBase=ffff925adf240000 
> TRBase=ffff925adf244000
> [427483.146766] GDTBase=ffff925adf24c000 IDTBase=ffffffffff528000
> [427483.147630] CR0=0000000080050033 CR3=00000010597b6000 CR4=00000000001627e0
> [427483.148498] Sysenter RSP=0000000000000000 CS:RIP=0010:ffffffff8f196cc0
> [427483.149365] EFER = 0x0000000000000d01  PAT = 0x0007050600070106
> [427483.150231] *** Control State ***
> [427483.151077] PinBased=0000003f CPUBased=b6a1edfa SecondaryExec=00000ceb
> [427483.151942] EntryControls=0000d1ff ExitControls=002fefff
> [427483.152800] ExceptionBitmap=00060042 PFECmask=00000000 PFECmatch=00000000
> [427483.153661] VMEntry: intr_info=00000000 errcode=00000006 ilen=00000000
> [427483.154521] VMExit: intr_info=00000000 errcode=00000000 ilen=00000004
> [427483.155376]        reason=80000021 qualification=0000000000000000
> [427483.156230] IDTVectoring: info=00000000 errcode=00000000
> [427483.157068] TSC Offset = 0xfffccfc261506dd9
> [427483.157905] TPR Threshold = 0x0d
> [427483.158728] EPT pointer = 0x00000009b437701e
> [427483.159550] PLE Gap=00000080 Window=00080000
> [427483.160370] Virtual processor ID = 0x0004
> 
> 
> 
>> On 16 Sep 2020, at 17:11, Vinícius Ferrão <fer...@versatushpc.com.br> wrote:
>> 
>> Hello,
>> 
>> I’m an Exchange Server VM that’s going down to suspend without possibility 
>> of recovery. I need to click on shutdown and them power on. I can’t find 
>> anything useful on the logs, except on “dmesg” of the host:
>> 
>> [47807.747606] *** Guest State ***
>> [47807.747633] CR0: actual=0x0000000000050032, shadow=0x0000000000050032, 
>> gh_mask=fffffffffffffff7
>> [47807.747671] CR4: actual=0x0000000000002050, shadow=0x0000000000000000, 
>> gh_mask=fffffffffffff871
>> [47807.747721] CR3 = 0x00000000001ad002
>> [47807.747739] RSP = 0xffffc20904fa3770  RIP = 0x0000000000008000
>> [47807.747766] RFLAGS=0x00000002        DR7 = 0x0000000000000400
>> [47807.747792] Sysenter RSP=0000000000000000 CS:RIP=0000:0000000000000000
>> [47807.747821] CS:  sel=0x9100, attr=0x08093, limit=0xffffffff, 
>> base=0x000000007ff91000
>> [47807.747855] DS:  sel=0x0000, attr=0x08093, limit=0xffffffff, 
>> base=0x0000000000000000
>> [47807.747889] SS:  sel=0x0000, attr=0x08093, limit=0xffffffff, 
>> base=0x0000000000000000
>> [47807.747923] ES:  sel=0x0000, attr=0x08093, limit=0xffffffff, 
>> base=0x0000000000000000
>> [47807.747957] FS:  sel=0x0000, attr=0x08093, limit=0xffffffff, 
>> base=0x0000000000000000
>> [47807.747991] GS:  sel=0x0000, attr=0x08093, limit=0xffffffff, 
>> base=0x0000000000000000
>> [47807.748025] GDTR:                          limit=0x00000057, 
>> base=0xffff80817e7d5fb0
>> [47807.748059] LDTR: sel=0x0000, attr=0x10000, limit=0x000fffff, 
>> base=0x0000000000000000
>> [47807.748093] IDTR:                          limit=0x00000000, 
>> base=0x0000000000000000
>> [47807.748128] TR:  sel=0x0040, attr=0x0008b, limit=0x00000067, 
>> base=0xffff80817e7d4000
>> [47807.748162] EFER =    0x0000000000000000  PAT = 0x0007010600070106
>> [47807.748189] DebugCtl = 0x0000000000000000  DebugExceptions = 
>> 0x0000000000000000
>> [47807.748221] Interruptibility = 00000009  ActivityState = 00000000
>> [47807.748248] *** Host State ***
>> [47807.748263] RIP = 0xffffffffc0c65024  RSP = 0xffff9252bda5fc90
>> [47807.748290] CS=0010 SS=0018 DS=0000 ES=0000 FS=0000 GS=0000 TR=0040
>> [47807.748318] FSBase=00007f46d462a700 GSBase=ffff9252ffac0000 
>> TRBase=ffff9252ffac4000
>> [47807.748351] GDTBase=ffff9252ffacc000 IDTBase=ffffffffff528000
>> [47807.748377] CR0=0000000080050033 CR3=000000105ac8c000 CR4=00000000001627e0
>> [47807.748407] Sysenter RSP=0000000000000000 CS:RIP=0010:ffffffff8f196cc0
>> [47807.748435] EFER = 0x0000000000000d01  PAT = 0x0007050600070106
>> [47807.748461] *** Control State ***
>> [47807.748478] PinBased=0000003f CPUBased=b6a1edfa SecondaryExec=00000ceb
>> [47807.748507] EntryControls=0000d1ff ExitControls=002fefff
>> [47807.748531] ExceptionBitmap=00060042 PFECmask=00000000 PFECmatch=00000000
>> [47807.748561] VMEntry: intr_info=00000000 errcode=00000006 ilen=00000000
>> [47807.748589] VMExit: intr_info=00000000 errcode=00000000 ilen=00000001
>> [47807.748618]        reason=80000021 qualification=0000000000000000
>> [47807.748645] IDTVectoring: info=00000000 errcode=00000000
>> [47807.748669] TSC Offset = 0xfffff9b8c8d943b6
>> [47807.748699] TPR Threshold = 0x00
>> [47807.748715] EPT pointer = 0x000000105cd5601e
>> [47807.748735] PLE Gap=00000080 Window=00001000
>> [47807.748755] Virtual processor ID = 0x0003
>> 
>> So something really went crazy. The VM is going down at least two times a 
>> day for the last 5 days.
>> 
>> At first I thought it would be an hardware issue, so I restarted the VM on 
>> other host, and the same thing happened.
>> 
>> About the VM it’s configured with 10 CPUs, 48GB of RAM running oVirt 4.3.10 
>> with iSCSI storage to a FreeNAS box, where the VM disks are running; there 
>> are a 300GB disc for C:\ and 2TB disk for D:\.
>> 
>> Any ideia on how to start troubleshooting it?
>> 
>> Thanks,
>> 
>> 
> 
> _______________________________________________
> Users mailing list -- users@ovirt.org
> To unsubscribe send an email to users-le...@ovirt.org
> Privacy Statement: https://www.ovirt.org/privacy-policy.html
> oVirt Code of Conduct:  
> https://www.ovirt.org/community/about/community-guidelines/
> List Archives: 
> https://lists.ovirt.org/archives/list/users@ovirt.org/message/OIAIVV6I2MUVPVR4FBJQTZW5OL4MNS5Q/
> 
> 
> 
> 



_______________________________________________
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/CKTQPJHR4T7FEI5TAK4P3G2PVP44X6VJ/

Reply via email to