[ovirt-users] guest pxe boot fails
Hi, somehow guest pxe booting stopped working for me. PXE boot fails on getting ip via dhcp with error 'connection timed out' and i cannot find any reason why... In first few seconds of VM's life it actually manages to send some network packets - I can see dhcp request on dhcp server, I even see a dhcp reply on both host's network interface and vm's vnetX interface. However, gpxe still says "connection timeout". During this "first time" i also can see some network traffic via tcpdump on vnetX interface (some arp's, multicasts, etc from other vms) But after a while - all activity on vnetX interface ceases, no packets at all, no dhcp requests from VM, no replies, no arp - nothing at all. Also what caught my eye - 'brctl showmacs ' doesn't show VM's MAC at all, it lists it only after OS is booted. And same behaviour with any nic driver - virtio/e1000/rtl However, once VM is booted (if cloned, or imported via glance, or installed from CD) - network works just fine, no problems with dhcp/arp or anything else. I tried to re-create VM, tried to add/remove/change vm nic's, tried to change MAC, tried to restart vdsm and engine, tried to start vm on different hosts - nothing helps. The only recent change in my environment were update ovirt-3.4.3 -> ovirt-3.4.4. Please, tell me what went wrong? ovirt 3.4.4-el6, hosts on centos 6.6, guests - centos 6.6 -- Yuriy Demchenko ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [ovirt-users] Proper way to change and persist vdsm configuration options
Hi, sorry for digging up an old tread, but I have a problem with proposed way to preserve vdsm.conf between host redeployments I've created a file /etc/ovirt-host-deploy.conf.d/migration-bw.conf on ovirt-engine with contents: [environment:enforce] VDSM_CONFIG/vars/migration_max_bandwidth=str:300 Then i've tried to add a new host via GUI - it was added, but with default vdsm.conf, i.e. without my custom option "migration_max_bandwidth" Also tried to restart ovirt-engine and reinstalling host - same result, no custom options, only default vdsm.conf content. Am i missing something? Freshly installed ovirt 3.5-pre, centos 6.5 as engine, centos7 as hosts Yuriy Demchenko On 08/05/2014 10:45 PM, Alon Bar-Lev wrote: - Original Message - From: "Trey Dockendorf" To: "ybronhei" Cc: "users" , "Fabian Deutsch" , "Dan Kenigsberg" , "Itamar Heim" , "Douglas Landgraf" , "Alon Bar-Lev" Sent: Tuesday, August 5, 2014 9:36:24 PM Subject: Re: [ovirt-users] Proper way to change and persist vdsm configuration options On Tue, Aug 5, 2014 at 12:32 PM, ybronhei wrote: Hey, Just noticed something that I forgot about.. before filing new BZ, see in ovirt-host-deploy README.environment [1] the section: VDSM/configOverride(bool) [True] Override vdsm configuration file. changing it to false will keep your vdsm.conf file as is after deploying the host again (what happens after node upgrade) [1] https://github.com/oVirt/ovirt-host-deploy/blob/master/README.environment please check if that what you meant.. Thanks, Yaniv Bronhaim. I was unaware of that package. I will check that out as that seems to be what I am looking for. I have not filed this in BZ and will hold off pending ovirt-host-deploy. If you feel a BZ is still necessary then please do file one and I would be happy to provide input if it would help. Right now this is my workflow. 1. Foreman provisions bare-metal server with CentOS 6.5 2. Once provisioned and system rebooted Puppet applies puppet-ovirt [1] module that adds the necessary yum repos and should stop here.. , and installs packages. Part of my Puppet deployment is basic things like sudo management (vdsm's sudo is account for), sssd configuration, and other aspects that are needed by every system in my infrastructure. Part of the ovirt::node Puppet class is managing vdsm.conf, and in my case that means ensuring iSER is enabled for iSCSI over IB. you can create a file /etc/ovirt-host-deploy.conf.d/40-xxx.conf --- VDSM_CONFIG/section/key=str:content --- this will create a proper vdsm.conf when host-deploy is initiated. you should now use the rest api to initiate host-deploy. 3. Once host is online and has had the full Puppet catalog applied I log into ovirt-engine web interface and add those host (pulling it's data via the Foreman provider). right, but you should let this process install packages and manage configuration. What I've noticed is that after step #3, after a host is added by ovirt-engine, the vdsm.conf file is reset to default and I have to reapply Puppet before it can be used as the one of my Data Storage Domains requires iSER (not available over TCP). right, see above. What would be the workflow using ovirt-host-deploy? Thus far I've had to piece together my workflow based on the documentation and filling in blanks where possible since I do require customizations to vdsm.conf and the documented workflow of adding a host via web UI does not allow for such customization. Thanks, - Trey [1] - https://github.com/treydock/puppet-ovirt (README not fully updated as still working out how to use Puppet with oVirt) On 08/05/2014 08:12 AM, Trey Dockendorf wrote: I'll file BZ. As far as I can recall this has been an issue since 3.3.x as I have been using Puppet to modify values and have had to rerun Puppet after installing a node via GUI and when performing update from GUI. Given that it has occurred when VDSM version didn't change on the node it seems likely to be something being done by Python code that bootstraps a node and performs the other tasks. I won't have any systems available to test with for a few days. New hardware specifically for our oVirt deployment is on order so should be able to more thoroughly debug and capture logs at that time. Would using vdsm-reg be a better solution for adding new nodes? I only tried using vdsm-reg once and it went very poorly...lots of missing dependencies not pulled in from yum install I had to install manually via yum. Then the node was auto added to newest cluster with no ability to change the cluster. Be happy to debug that too if there's some docs that outline the expected behavior. Using vdsm-reg or something similar seems like a better fit for puppet deployed nodes, as opposed to requiring GUI steps to add the node. Thanks - Trey On Aug 4, 2014 5:53 A
[ovirt-users] [hosted-engine] cluster network setup deadlock
Hi, i'm testing hosted-engine in my lab and have some issues doing network configuration of my cluster I have 2 hosts + storage (iscsi/nfs), each with 2 NICs. My aim is highly available platform, so it must tolerate loss of one NIC, thus i want to use bonding + vlans, scheme like this: bond device used as ovirtmgmt network (mgmt/display/migration, not a VM net) with no vlan tagging and several vlan-tagged networks for VMs and storage connection on top of it. I use that kind of scheme all the time with non-selfhosted ovirt, as it provides reliability, requires only 2 nics, allows pxe-kickstarting and requires no switch reconfiguration during/after host setup. However, with self-hosted engine i'm stuck - at deploy stage installer setups engine VM into ovirtmgmt network, making it VM network and i cant change it afterwards: when i uncheck "vm network" in web-interface for ovirtmgmt - it couldnt apply changes on host that running engine-vm (ok, thats expected), but if i setup another host with desired network configuration (i.e. ovirtmgmt - not a vm network + separate vlan-tagged networks for vm) it just couldnt migrate or start engine-vm, as it lacks "ovirtmgmt" bridge interface. So, is there any way to change engine-vm network settings to run it on different bridge, not ovirtmgmt? -- Yuriy Demchenko ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [ovirt-users] power outage: HA vms not restarted
Hi, sorry for delay, guess i'll gonna plan to upgrade to 3.4 soon Eli, Artyom, Omer - big thanks for your valuable help, it was important to me to understand what went wrong in that incident. Yuriy Demchenko On 05/19/2014 06:26 PM, Artyom Lukianov wrote: Bug already fixed in 3.3 https://bugzilla.redhat.com/show_bug.cgi?id=1074478 and 3.4 https://bugzilla.redhat.com/show_bug.cgi?id=1078553 Thanks. - Original Message - From: "Yuriy Demchenko" To: "Eli Mesika" Cc: users@ovirt.org Sent: Monday, May 19, 2014 4:29:54 PM Subject: Re: [ovirt-users] power outage: HA vms not restarted On 05/19/2014 05:13 PM, Eli Mesika wrote: - Original Message - From: "Yuriy Demchenko" To: "Eli Mesika" Cc: users@ovirt.org Sent: Monday, May 19, 2014 4:01:04 PM Subject: Re: [ovirt-users] power outage: HA vms not restarted On 05/19/2014 04:56 PM, Eli Mesika wrote: but shouldn't engine restart corresponded vms after holding host came up? (without manual fence) because they up - so engine can query them about running/not running vms and get actual state of vms - running or not the only host were down at that point is srv5, which holded only 1 vm - and it were correctly put in 'unknown' state, other vms were just 'down' until we manually started them Are you sure that those VMs are defined as Highly Available VMs ??? yes, i'm sure. double checked in webinterface, plus log entries like: May this be related, I think that in your case host came up very fast while the fencing operation already started https://bugzilla.redhat.com/show_bug.cgi?id=1064860 doesn't seems so, as vm wasnt put into 'unknown' state and srv19 were allready up when engine booted, so no fence attempt ever made for it 2014-05-17 00:23:10,565 INFO [org.ovirt.engine.core.vdsbroker.VdsUpdateRunTimeInfo] (DefaultQuartzScheduler_Worker-14) vm prod.gui running in db and not running in vds - add to rerun treatment. vds srv19 2014-05-17 00:23:10,909 INFO [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (DefaultQuartzScheduler_Worker-14) [2989840c] Correlation ID: null, Call Stack: null, Custom Event ID: -1, Message: Highly Available VM prod.gui failed. It will be restarted automatically. 2014-05-17 00:23:10,911 INFO [org.ovirt.engine.core.bll.VdsEventListener] (DefaultQuartzScheduler_Worker-14) [2989840c] Highly Available VM went down. Attempting to restart. VM Name: prod.gui, VM Id:bbb7a605-d511-461d-99d2-c5a5bf8d9958 ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [ovirt-users] power outage: HA vms not restarted
On 05/19/2014 05:10 PM, Eli Mesika wrote: - Original Message - From: "Yuriy Demchenko" To: "Eli Mesika" Cc: users@ovirt.org Sent: Monday, May 19, 2014 3:56:41 PM Subject: Re: [ovirt-users] power outage: HA vms not restarted saw another strange moment in logs: for example, vm 'prod.gui', according to logs engine tried to restart it on host srv19 at 00:23:11 and it failed with error 'Failed to run VM prod.gui: VdcBLLException: java.lang.NullPointerException (Failed with error ENGINE and code 5001)' This is a bug and you should open a BZ for that and attach related engine & vdsm logs done https://bugzilla.redhat.com/show_bug.cgi?id=1099134 in vdsm.log on srv19 i can see this error: StoragePoolUnknown: Unknown pool id, pool not connected: ('5849b030-626e-47cb-ad90-3ce782d831b3',) Thread-18::DEBUG::2014-05-17 00:23:13,022::task::869::TaskManager.Task::(_run) Task=`27a323de-ac42-4d68-992f-eb5c84d8baea`::Task._run: 27a323de-ac42-4d68-992f-eb5c84d8baea ('5849b030-626e-47cb-ad90-3ce782d831b3',) {} failed - stopping task Thread-18::DEBUG::2014-05-17 00:23:13,022::task::1194::TaskManager.Task::(stop) Task=`27a323de-ac42-4d68-992f-eb5c84d8baea`::stopping in state preparing (force False) Thread-18::DEBUG::2014-05-17 00:23:13,023::task::974::TaskManager.Task::(_decref) Task=`27a323de-ac42-4d68-992f-eb5c84d8baea`::ref 1 aborting True Thread-18::INFO::2014-05-17 00:23:13,023::task::1151::TaskManager.Task::(prepare) Task=`27a323de-ac42-4d68-992f-eb5c84d8baea`::aborting: Task is aborted: 'Unknown pool id, pool not connected' - code 309 Thread-18::DEBUG::2014-05-17 00:23:13,024::task::1156::TaskManager.Task::(prepare) Task=`27a323de-ac42-4d68-992f-eb5c84d8baea`::Prepare: aborted: Unknown pool id, pool not connected Thread-18::DEBUG::2014-05-17 00:23:13,024::task::974::TaskManager.Task::(_decref) Task=`27a323de-ac42-4d68-992f-eb5c84d8baea`::ref 0 aborting True according to engine database - storage pool '5849b030-626e-47cb-ad90-3ce782d831b3' is datacenter 'CODD'. but - according to engine logs SPM start for datacenter CODD was run only at 00:23:27, i.e. _after_ vm restart attemts How did you concluded that ? erm, am i wrong here? restart attempt according to logs above were at 00:23:11 SPM start according to logs below were at 00:23:27 or i'm wrong in assumption that spm is needed for vm start? 2014-05-17 00:23:27,585 INFO [org.ovirt.engine.core.vdsbroker.irsbroker.IrsBrokerCommand] (DefaultQuartzScheduler_Worker-43) [1bb8f3c0] starting spm on vds srv12, storage pool CODD, prevId 9, LVER 11 2014-05-17 00:23:27,610 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.SpmStartVDSCommand] (DefaultQuartzScheduler_Worker-43) [1bb8f3c0] START, SpmStartVDSCommand(HostName = srv12, HostId = eac8d084-1be0-4e8d-a589-385f71fe3afa, storagePoolId = 5849b030-626e-47cb-ad90-3ce782d831b3, prevId=9, prevLVER=11, storagePoolFormatType=V3, recoveryMode=Manual, SCSIFencing=false), log id: 1844d391 i thought no VM can be started until SPM host selected, why in my case vm restart occurred before that? and what 'pool not connected' means? no spm running, storage unavailable from some host or something else? In this case, storage unavailable from some host shouldn't then that host be set into 'non-operational' state and restart attempt carried by another host? Yuriy Demchenko On 05/19/2014 03:13 PM, Yuriy Demchenko wrote: On 05/19/2014 02:36 PM, Eli Mesika wrote: Hi, i'm running ovirt-3.2.2-el6 on 18 el6 hosts with FC san storage, 46 HA vms in 2 datacenters (3 hosts uses different storage with no connectivity to first storage, that's why second DC) Recently (2014-05-17) i had a double power outage: first blackout at 00:16, went back at ~00:19, second blackout at 00:26, went back at 10:06 When finally all went up (after approx. 10:16) - only 2 vms were restarted from 46. From browsing engine log i saw failed restart attemts of almost all vms after first blackout with error 'Failed with error ENGINE and code 5001', but after second blackout i saw no attempts to restart vms, and only error was 'connect timeout' (probably to srv5 - that host physically died after blackouts). And i cant figure why HA vms were not restarted? Please advice engine and (supposedly) spm host logs in attach. Hi Yuriy What I see is that the log for 2014-05-17 is started at 2014-05-17 00:23:03 so I can not track the first interval you had mentioned (00:19 to 00:26) 00:23 is the time when engine booted up after first outage, that's why logs started at 00:23:03 However, I can clearly see that at 2014-05-17 00:23:03 the engine was restarted and at 2014-05-17 00:23:09,423 we had started to get connection errors. We had tried to solve the problem by doing Soft-Fencing (actually vdsmd service restart) on the problematic hosts, but ssh to the host failed so we had tried to hard-fence
Re: [ovirt-users] power outage: HA vms not restarted
On 05/19/2014 05:13 PM, Eli Mesika wrote: - Original Message - From: "Yuriy Demchenko" To: "Eli Mesika" Cc: users@ovirt.org Sent: Monday, May 19, 2014 4:01:04 PM Subject: Re: [ovirt-users] power outage: HA vms not restarted On 05/19/2014 04:56 PM, Eli Mesika wrote: but shouldn't engine restart corresponded vms after holding host came up? (without manual fence) because they up - so engine can query them about running/not running vms and get actual state of vms - running or not the only host were down at that point is srv5, which holded only 1 vm - and it were correctly put in 'unknown' state, other vms were just 'down' until we manually started them Are you sure that those VMs are defined as Highly Available VMs ??? yes, i'm sure. double checked in webinterface, plus log entries like: May this be related, I think that in your case host came up very fast while the fencing operation already started https://bugzilla.redhat.com/show_bug.cgi?id=1064860 doesn't seems so, as vm wasnt put into 'unknown' state and srv19 were allready up when engine booted, so no fence attempt ever made for it 2014-05-17 00:23:10,565 INFO [org.ovirt.engine.core.vdsbroker.VdsUpdateRunTimeInfo] (DefaultQuartzScheduler_Worker-14) vm prod.gui running in db and not running in vds - add to rerun treatment. vds srv19 2014-05-17 00:23:10,909 INFO [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (DefaultQuartzScheduler_Worker-14) [2989840c] Correlation ID: null, Call Stack: null, Custom Event ID: -1, Message: Highly Available VM prod.gui failed. It will be restarted automatically. 2014-05-17 00:23:10,911 INFO [org.ovirt.engine.core.bll.VdsEventListener] (DefaultQuartzScheduler_Worker-14) [2989840c] Highly Available VM went down. Attempting to restart. VM Name: prod.gui, VM Id:bbb7a605-d511-461d-99d2-c5a5bf8d9958 ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [ovirt-users] power outage: HA vms not restarted
On 05/19/2014 04:56 PM, Eli Mesika wrote: but shouldn't engine restart corresponded vms after holding host came >up? (without manual fence) >because they up - so engine can query them about running/not running vms >and get actual state of vms - running or not >the only host were down at that point is srv5, which holded only 1 vm - >and it were correctly put in 'unknown' state, other vms were just 'down' >until we manually started them Are you sure that those VMs are defined as Highly Available VMs ??? yes, i'm sure. double checked in webinterface, plus log entries like: 2014-05-17 00:23:10,565 INFO [org.ovirt.engine.core.vdsbroker.VdsUpdateRunTimeInfo] (DefaultQuartzScheduler_Worker-14) vm prod.gui running in db and not running in vds - add to rerun treatment. vds srv19 2014-05-17 00:23:10,909 INFO [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (DefaultQuartzScheduler_Worker-14) [2989840c] Correlation ID: null, Call Stack: null, Custom Event ID: -1, Message: Highly Available VM prod.gui failed. It will be restarted automatically. 2014-05-17 00:23:10,911 INFO [org.ovirt.engine.core.bll.VdsEventListener] (DefaultQuartzScheduler_Worker-14) [2989840c] Highly Available VM went down. Attempting to restart. VM Name: prod.gui, VM Id:bbb7a605-d511-461d-99d2-c5a5bf8d9958 ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [ovirt-users] power outage: HA vms not restarted
saw another strange moment in logs: for example, vm 'prod.gui', according to logs engine tried to restart it on host srv19 at 00:23:11 and it failed with error 'Failed to run VM prod.gui: VdcBLLException: java.lang.NullPointerException (Failed with error ENGINE and code 5001)' in vdsm.log on srv19 i can see this error: StoragePoolUnknown: Unknown pool id, pool not connected: ('5849b030-626e-47cb-ad90-3ce782d831b3',) Thread-18::DEBUG::2014-05-17 00:23:13,022::task::869::TaskManager.Task::(_run) Task=`27a323de-ac42-4d68-992f-eb5c84d8baea`::Task._run: 27a323de-ac42-4d68-992f-eb5c84d8baea ('5849b030-626e-47cb-ad90-3ce782d831b3',) {} failed - stopping task Thread-18::DEBUG::2014-05-17 00:23:13,022::task::1194::TaskManager.Task::(stop) Task=`27a323de-ac42-4d68-992f-eb5c84d8baea`::stopping in state preparing (force False) Thread-18::DEBUG::2014-05-17 00:23:13,023::task::974::TaskManager.Task::(_decref) Task=`27a323de-ac42-4d68-992f-eb5c84d8baea`::ref 1 aborting True Thread-18::INFO::2014-05-17 00:23:13,023::task::1151::TaskManager.Task::(prepare) Task=`27a323de-ac42-4d68-992f-eb5c84d8baea`::aborting: Task is aborted: 'Unknown pool id, pool not connected' - code 309 Thread-18::DEBUG::2014-05-17 00:23:13,024::task::1156::TaskManager.Task::(prepare) Task=`27a323de-ac42-4d68-992f-eb5c84d8baea`::Prepare: aborted: Unknown pool id, pool not connected Thread-18::DEBUG::2014-05-17 00:23:13,024::task::974::TaskManager.Task::(_decref) Task=`27a323de-ac42-4d68-992f-eb5c84d8baea`::ref 0 aborting True according to engine database - storage pool '5849b030-626e-47cb-ad90-3ce782d831b3' is datacenter 'CODD'. but - according to engine logs SPM start for datacenter CODD was run only at 00:23:27, i.e. _after_ vm restart attemts 2014-05-17 00:23:27,585 INFO [org.ovirt.engine.core.vdsbroker.irsbroker.IrsBrokerCommand] (DefaultQuartzScheduler_Worker-43) [1bb8f3c0] starting spm on vds srv12, storage pool CODD, prevId 9, LVER 11 2014-05-17 00:23:27,610 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.SpmStartVDSCommand] (DefaultQuartzScheduler_Worker-43) [1bb8f3c0] START, SpmStartVDSCommand(HostName = srv12, HostId = eac8d084-1be0-4e8d-a589-385f71fe3afa, storagePoolId = 5849b030-626e-47cb-ad90-3ce782d831b3, prevId=9, prevLVER=11, storagePoolFormatType=V3, recoveryMode=Manual, SCSIFencing=false), log id: 1844d391 i thought no VM can be started until SPM host selected, why in my case vm restart occurred before that? and what 'pool not connected' means? no spm running, storage unavailable from some host or something else? Yuriy Demchenko On 05/19/2014 03:13 PM, Yuriy Demchenko wrote: On 05/19/2014 02:36 PM, Eli Mesika wrote: Hi, i'm running ovirt-3.2.2-el6 on 18 el6 hosts with FC san storage, 46 HA vms in 2 datacenters (3 hosts uses different storage with no connectivity to first storage, that's why second DC) Recently (2014-05-17) i had a double power outage: first blackout at 00:16, went back at ~00:19, second blackout at 00:26, went back at 10:06 When finally all went up (after approx. 10:16) - only 2 vms were restarted from 46. From browsing engine log i saw failed restart attemts of almost all vms after first blackout with error 'Failed with error ENGINE and code 5001', but after second blackout i saw no attempts to restart vms, and only error was 'connect timeout' (probably to srv5 - that host physically died after blackouts). And i cant figure why HA vms were not restarted? Please advice engine and (supposedly) spm host logs in attach. Hi Yuriy What I see is that the log for 2014-05-17 is started at 2014-05-17 00:23:03 so I can not track the first interval you had mentioned (00:19 to 00:26) 00:23 is the time when engine booted up after first outage, that's why logs started at 00:23:03 However, I can clearly see that at 2014-05-17 00:23:03 the engine was restarted and at 2014-05-17 00:23:09,423 we had started to get connection errors. We had tried to solve the problem by doing Soft-Fencing (actually vdsmd service restart) on the problematic hosts, but ssh to the host failed so we had tried to hard-fence the host (restart/reboot), but this was due the configurable "quite time" in which we are preventing fencing operation after an engine restart which is set by default to 5 min (DisableFenceAtStartupInSec key in engine-config) and therefor we had skipped the fencing operation... my hosts usually boots slower than engine, long bios checks + random power on delay (120s), thats why at first engine reports connect errors however, in logs i see, that some of them were allready up and engine successfully contacted them: So maybe you should put DisableFenceAtStartupInSec to a higher value instead of the default 5 min... 2014-05-17 00:23:10,450 INFO [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (DefaultQuartzScheduler_Wo
Re: [ovirt-users] power outage: HA vms not restarted
On 05/19/2014 02:36 PM, Eli Mesika wrote: Hi, i'm running ovirt-3.2.2-el6 on 18 el6 hosts with FC san storage, 46 HA vms in 2 datacenters (3 hosts uses different storage with no connectivity to first storage, that's why second DC) Recently (2014-05-17) i had a double power outage: first blackout at 00:16, went back at ~00:19, second blackout at 00:26, went back at 10:06 When finally all went up (after approx. 10:16) - only 2 vms were restarted from 46. From browsing engine log i saw failed restart attemts of almost all vms after first blackout with error 'Failed with error ENGINE and code 5001', but after second blackout i saw no attempts to restart vms, and only error was 'connect timeout' (probably to srv5 - that host physically died after blackouts). And i cant figure why HA vms were not restarted? Please advice engine and (supposedly) spm host logs in attach. Hi Yuriy What I see is that the log for 2014-05-17 is started at 2014-05-17 00:23:03 so I can not track the first interval you had mentioned (00:19 to 00:26) 00:23 is the time when engine booted up after first outage, that's why logs started at 00:23:03 However, I can clearly see that at 2014-05-17 00:23:03 the engine was restarted and at 2014-05-17 00:23:09,423 we had started to get connection errors. We had tried to solve the problem by doing Soft-Fencing (actually vdsmd service restart) on the problematic hosts, but ssh to the host failed so we had tried to hard-fence the host (restart/reboot), but this was due the configurable "quite time" in which we are preventing fencing operation after an engine restart which is set by default to 5 min (DisableFenceAtStartupInSec key in engine-config) and therefor we had skipped the fencing operation... my hosts usually boots slower than engine, long bios checks + random power on delay (120s), thats why at first engine reports connect errors however, in logs i see, that some of them were allready up and engine successfully contacted them: So maybe you should put DisableFenceAtStartupInSec to a higher value instead of the default 5 min... 2014-05-17 00:23:10,450 INFO [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (DefaultQuartzScheduler_Worker-18) Correlation ID: null, Call Stack: null, Custom Event ID: -1, Message: State was set to Up for host srv11. 2014-05-17 00:23:10,456 INFO [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (DefaultQuartzScheduler_Worker-4) Correlation ID: null, Call Stack: null, Custom Event ID: -1, Message: State was set to Up for host srv4. 2014-05-17 00:23:10,458 INFO [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (DefaultQuartzScheduler_Worker-11) Correlation ID: null, Call Stack: null, Custom Event ID: -1, Message: State was set to Up for host srv7. 2014-05-17 00:23:10,460 INFO [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (DefaultQuartzScheduler_Worker-20) Correlation ID: null, Call Stack: null, Custom Event ID: -1, Message: State was set to Up for host srv9. and after 00:23:11 i saw no fencing-related messages, only vm restart attemts that failed with strange errors like: 'Failed with error ENGINE and code 5001' 'Candidate host srv1 (2a89e565-aa4e-4a19-82e3-e72e4edee111) was filtered out by VAR__FILTERTYPE__INTERNAL filter Memory' 'CanDoAction of action RunVm failed. Reasons:VAR__ACTION__RUN,VAR__TYPE__VM,ACTION_TYPE_FAILED_VDS_VM_MEMORY,VAR__ACTION__RUN,VAR__TYPE__VM,ACTION_TYPE_FAILED_VDS_VM_MEMORY,SCHEDULING_ALL_HOSTS_FILTERED_OUT,VAR__FILTERTYPE__INTERNAL,$hostName srv1,$filterName Memory,SCHEDULING_HOST_FILTERED_REASON' Strange that you didn't got the message itself (this is only the message key) The original message is : Cannot Run VM. There are no available running Hosts with sufficient memory in VM's Cluster . So, it failed on the RunVM command validation where no host with enough memory to run the VM was found yea, but how that can be? all my hosts have 256gb ram and only 2 vms are ram-hungry (128gb and 240gb), other uses much less (2-64gb) For the first period as I said I only can guess that one of your hosts fencing attempts was after those 5 minutes and therefor it was rebooted and the HA VMs were freed to run on other host. For the second period on which I have logs, the host fencing failed due to the required "quite time" and in this situation the only hing you can do in order to have the HA VMs running again is to right-click on each host and press "Confirm that host has been rebooted" but i see in logs - after second period, at 10:14+ all hosts but one (srv5) were up and power management verified successfully, isn't that should be enough for engine to verify all ha vms down and restart them? How should engine know that the host was rebooted?? The fact that power management verified successfully is not enough in order to run the VMs on another host. As I see the fence commands that intend to reboot the host holding the VMs were failed ... bu
Re: [ovirt-users] power outage: HA vms not restarted
Yuriy DemchenkoOn 05/19/2014 01:27 PM, Eli Mesika wrote: - Original Message - From: "Yuriy Demchenko" To: users@ovirt.org Sent: Monday, May 19, 2014 11:34:15 AM Subject: [ovirt-users] power outage: HA vms not restarted Hi, i'm running ovirt-3.2.2-el6 on 18 el6 hosts with FC san storage, 46 HA vms in 2 datacenters (3 hosts uses different storage with no connectivity to first storage, that's why second DC) Recently (2014-05-17) i had a double power outage: first blackout at 00:16, went back at ~00:19, second blackout at 00:26, went back at 10:06 When finally all went up (after approx. 10:16) - only 2 vms were restarted from 46. From browsing engine log i saw failed restart attemts of almost all vms after first blackout with error 'Failed with error ENGINE and code 5001', but after second blackout i saw no attempts to restart vms, and only error was 'connect timeout' (probably to srv5 - that host physically died after blackouts). And i cant figure why HA vms were not restarted? Please advice engine and (supposedly) spm host logs in attach. Hi Yuriy What I see is that the log for 2014-05-17 is started at 2014-05-17 00:23:03 so I can not track the first interval you had mentioned (00:19 to 00:26) 00:23 is the time when engine booted up after first outage, that's why logs started at 00:23:03 However, I can clearly see that at 2014-05-17 00:23:03 the engine was restarted and at 2014-05-17 00:23:09,423 we had started to get connection errors. We had tried to solve the problem by doing Soft-Fencing (actually vdsmd service restart) on the problematic hosts, but ssh to the host failed so we had tried to hard-fence the host (restart/reboot), but this was due the configurable "quite time" in which we are preventing fencing operation after an engine restart which is set by default to 5 min (DisableFenceAtStartupInSec key in engine-config) and therefor we had skipped the fencing operation... my hosts usually boots slower than engine, long bios checks + random power on delay (120s), thats why at first engine reports connect errors however, in logs i see, that some of them were allready up and engine successfully contacted them: 2014-05-17 00:23:10,450 INFO [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (DefaultQuartzScheduler_Worker-18) Correlation ID: null, Call Stack: null, Custom Event ID: -1, Message: State was set to Up for host srv11. 2014-05-17 00:23:10,456 INFO [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (DefaultQuartzScheduler_Worker-4) Correlation ID: null, Call Stack: null, Custom Event ID: -1, Message: State was set to Up for host srv4. 2014-05-17 00:23:10,458 INFO [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (DefaultQuartzScheduler_Worker-11) Correlation ID: null, Call Stack: null, Custom Event ID: -1, Message: State was set to Up for host srv7. 2014-05-17 00:23:10,460 INFO [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (DefaultQuartzScheduler_Worker-20) Correlation ID: null, Call Stack: null, Custom Event ID: -1, Message: State was set to Up for host srv9. and after 00:23:11 i saw no fencing-related messages, only vm restart attemts that failed with strange errors like: 'Failed with error ENGINE and code 5001' 'Candidate host srv1 (2a89e565-aa4e-4a19-82e3-e72e4edee111) was filtered out by VAR__FILTERTYPE__INTERNAL filter Memory' 'CanDoAction of action RunVm failed. Reasons:VAR__ACTION__RUN,VAR__TYPE__VM,ACTION_TYPE_FAILED_VDS_VM_MEMORY,VAR__ACTION__RUN,VAR__TYPE__VM,ACTION_TYPE_FAILED_VDS_VM_MEMORY,SCHEDULING_ALL_HOSTS_FILTERED_OUT,VAR__FILTERTYPE__INTERNAL,$hostName srv1,$filterName Memory,SCHEDULING_HOST_FILTERED_REASON' For the first period as I said I only can guess that one of your hosts fencing attempts was after those 5 minutes and therefor it was rebooted and the HA VMs were freed to run on other host. For the second period on which I have logs, the host fencing failed due to the required "quite time" and in this situation the only hing you can do in order to have the HA VMs running again is to right-click on each host and press "Confirm that host has been rebooted" but i see in logs - after second period, at 10:14+ all hosts but one (srv5) were up and power management verified successfully, isn't that should be enough for engine to verify all ha vms down and restart them? 2014-05-17 10:11:56,946 INFO [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (DefaultQuartzScheduler_Worker-75) [dbc315d] Correla tion ID: 73469dd6, Call Stack: null, Custom Event ID: -1, Message: Host srv17 power management was verified successfully. in fact, at ~17:55 my colleague restarted engine, forced spm selection and started all vms - all started without errors, and he didn't had to click 'confirm host has been
Re: [Users] [3.2-el6] corrupted disk after snapshot removal
Well, I restored vm from backup. I couldn't find the cause of problem from logs and didn't yet had chance to do more tests to reproduce it, as my test env currently occupied... Yuriy Demchenko On 11/05/2013 01:30 PM, Itamar Heim wrote: On 10/15/2013 02:58 PM, Yuriy Demchenko wrote: Hi, I've run into some disturbing situation recently with my production ovirt-3.2-el6. Planned an upgrade for one of my vm's app, so i've took live snapshot, did application upgrade. Upgrade went fine, so I had no need in taken snapshot - I've shutdown vm and deleted snapshot via admin interface. When task was completed (took about 40mins) I've tried to start vm - but its OS didnt start - "no boot disk found". Disks tab in admin interface shows 'actual size' = 1Gb, fdisk from livecd shows no partitions at all. It seems snapshot removal somehow corrupted vms disk. What went wrong and how to avoid this in future? logs in attach ovirt-engine.noarch 3.2.2-1.1.el6 vdsm.x86_64 4.10.3-16.el6 Storage domain type is FC ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users was this reosolved? ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] Customizing node configuration
Hi, if I may, I believe you've hit this bug: https://bugzilla.redhat.com/show_bug.cgi?id=927874 for 3.2 there's workaround: issue "engine-config -s EmulatedMachine=rhel6.4.0 --cver=3.2" and restart engine Yuriy Demchenko On 10/08/2013 02:11 PM, Sven Kieske wrote: Hi, ok I was able to reproduce this bug, here is some information: ManagementNode: oVirt Engine 3.2.3-1.el6 ManagementNode OS: CentOS 6.4 x64_86 We have various "local storage" datacenters on this management node, one being "LocalCenter_v3" with a single host in it. This Host runs CentOS 6.4 x64 minimal, we installed the ovirt-stable repo on it and installed then vdsm through the managementnode webadmin. this was successful, here are the vdsm-versions from the host: rpm -qa | grep vdsm vdsm-python-4.12.1-2.el6.x86_64 vdsm-xmlrpc-4.12.1-2.el6.noarch vdsm-4.12.1-2.el6.x86_64 vdsm-cli-4.12.1-2.el6.noarch vdsm-python-cpopen-4.12.1-2.el6.x86_64 So I created a VM with the following settings through webadmin portal: Memory 2048MB, Total Virtual CPUs: 2 Operating System "Red Hat Enterprise Linux 6.x x64" Console Protocol VNC Boot Options: Attach CD: CentOS 6.4-x86_64-LiveCD.os All other values were left on default. I created and attached a 10 GB virtual disk (VirtIO) and a VirtIO nic connected to default "ovirtmgmt". When I try to start this vm via Webadmin I get: VM TestSven_v3_Test1 was started by admin@internal (Host: server3). VM TestSven_v3_Test1 is down. Exit message: internal error process exited while connecting to monitor: Supported machines are: pc RHEL 6.4.0 PC (alias of rhel6.4.0) rhel6.4.0 RHEL 6.4.0 PC (default) rhel6.3.0 RHEL 6.3.0 PC rhel6.2.0 RHEL 6.2.0 PC rhel6.1.0 RHEL 6.1.0 PC rhel6.0.0 RHEL 6.0.0 PC rhel5.5.0 RHEL 5.5.0 PC rhel5.4.4 RHEL 5.4.4 PC rhel5.4.0 RHEL 5.4.0 PC . Failed to run VM TestSven_v3_Test1 on Host server3 Please let me know what logs you need. On older vdsm-versions (ovirt-node 2.6.1., different datacenter, same mangementnode) everything works fine (creating and starting vms). The same VM exported via export-domain to this datacenter fails with the same error. Thanks in advance Sven PS: The iso-domain resides on the management node. On 07.10.2013 19:07, Itamar Heim wrote: On 10/07/2013 06:55 PM, Sven Kieske wrote: Hi, thanks again for the answer, we were able to install ovirt-node onto centos 6.4 minimal successfully! to avoid confusion, we'd call this a "host" and install vdsm onto it. we call ovirt-node only the trimmed down one. mike/fabian - which centos is ovirt-node based of? this seems 6.2, not 6.4? ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] cant add storage connection via api
Oh, I see. Will wait till next release than. Thanks for help! Yuriy Demchenko On 09/23/2013 12:34 PM, Michael Pasternak wrote: On 09/23/2013 11:22 AM, Yuriy Demchenko wrote: On 09/23/2013 12:13 PM, Alissa Bonas wrote: - Original Message - From: "Yuriy Demchenko" To: "Alissa Bonas" Cc: "Ofer Schreiber" , users@ovirt.org, "Michael Pasternak" , "Allon Mureinik" Sent: Monday, September 23, 2013 10:57:12 AM Subject: Re: [Users] cant add storage connection via api On 09/23/2013 11:43 AM, Alissa Bonas wrote: - Original Message - From: "Michael Pasternak" To: "Yuriy Demchenko" , "Alissa Bonas" , "Alon Bar-Lev" Cc: "Ofer Schreiber" , users@ovirt.org Sent: Monday, September 23, 2013 10:22:49 AM Subject: Re: [Users] cant add storage connection via api On 09/23/2013 10:05 AM, Yuriy Demchenko wrote: Hi, url [1] (https://ovirt.spb.stone.local/api?rsdl) not working for me - gets http 500 error with "java.lang.NullPointerException" java.lang.NullPointerException org.ovirt.engine.api.restapi.resource.BackendApiResource.addSystemVersion(BackendApiResource.java:358) org.ovirt.engine.api.restapi.resource.BackendApiResource.get(BackendApiResource.java:316) sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57) sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) java.lang.reflect.Method.invoke(Method.java:606) org.jboss.resteasy.core.MethodInjectorImpl.invoke(MethodInjectorImpl.java:155) org.jboss.resteasy.core.ResourceMethod.invokeOnTarget(ResourceMethod.java:257) org.jboss.resteasy.core.ResourceMethod.invoke(ResourceMethod.java:222) org.jboss.resteasy.core.ResourceMethod.invoke(ResourceMethod.java:211) org.jboss.resteasy.core.SynchronousDispatcher.getResponse(SynchronousDispatcher.java:525) org.jboss.resteasy.core.SynchronousDispatcher.invoke(SynchronousDispatcher.java:502) org.jboss.resteasy.core.SynchronousDispatcher.invoke(SynchronousDispatcher.java:119) org.jboss.resteasy.plugins.server.servlet.ServletContainerDispatcher.service(ServletContainerDispatcher.java:208) org.jboss.resteasy.plugins.server.servlet.HttpServletDispatcher.service(HttpServletDispatcher.java:55) org.jboss.resteasy.plugins.server.servlet.HttpServletDispatcher.service(HttpServletDispatcher.java:50) javax.servlet.http.HttpServlet.service(HttpServlet.java:847) api page (without "?rsdl") opens just fine. this is known issue caused by branding infra, i believe it was fixed in the following release (Alon correct me if i'm wrong) Feature "manage storage connections" was mentioned in oVirt-3.3 release notes (http://www.ovirt.org/OVirt_3.3_release_notes), under "storage" section. alissa? Storage connections management should be part of 3.3. Yuriy, when did you install the 3.3 and which exact build did you take? That was upgrade from 3.2.2 to 3.3, did that on Tuesday 17, from 3.3 release repo (http://resources.ovirt.org/releases/3.3/rpm/EL/6/) ovirt-engine-3.3.0-4.el6 noarch 9/17/13 3:05:40 PM MUT ovirt-engine-backend-3.3.0-4.el6 noarch 9/17/13 3:05:34 PM MUT ovirt-engine-cli-3.2.0.12-1.el6 noarch 6/17/13 11:24:48 AM MUT ovirt-engine-dbscripts-3.3.0-4.el6 noarch 9/17/13 3:05:39 PM MUT ovirt-engine-lib-3.3.0-4.el6 noarch 9/17/13 2:57:00 PM MUT ovirt-engine-restapi-3.3.0-4.el6 noarch 9/17/13 3:05:30 PM MUT ovirt-engine-sdk-3.2.0.11-1.el6 noarch 6/17/13 11:24:33 AM MUT ovirt-engine-setup-3.3.0-4.el6 noarch 9/17/13 2:57:03 PM MUT ovirt-engine-tools-3.3.0-4.el6 noarch 9/17/13 3:05:35 PM MUT ovirt-engine-userportal-3.3.0-4.el6 noarch 9/17/13 3:05:29 PM MUT ovirt-engine-webadmin-portal-3.3.0-4.el6 noarch 9/17/13 3:06:01 PM MUT ovirt-engine-websocket-proxy-3.3.0-4.el6 noarch 9/17/13 3:21:14 PM MUT ovirt-host-deploy-1.1.1-1.el6 noarch 9/17/13 3:05:15 PM MUT ovirt-host-deploy-java-1.1.1-1.el6 noarch 9/17/13 3:05:16 PM MUT ovirt-image-uploader-3.3.0-1.el6 noarch 9/17/13 3:05:14 PM MUT ovirt-iso-uploader-3.3.0-1.el6 noarch 9/17/13 3:05:17 PM MUT ovirt-log-collector-3.3.0-1.el6 noarch 9/17/13 3:05:18 PM MUT Also, can you access the following url (just a regular GET request) ? https://ovirt.spb.stone.local/api/storageconnections Yes I can. And I get list of all current connections there. OK, thank you. Please let me know 2 more things: 1. Is the connections subresource for a storage domain available? It is accessed in the link below with a GET request (please replace domainId placeholder with an id of one of your storage domains, doesn't matter which) https://ovirt.spb.stone.local/api/storagedomains//storageconnections nope, no "storagec
Re: [Users] cant add storage connection via api
On 09/23/2013 12:13 PM, Alissa Bonas wrote: - Original Message - From: "Yuriy Demchenko" To: "Alissa Bonas" Cc: "Ofer Schreiber" , users@ovirt.org, "Michael Pasternak" , "Allon Mureinik" Sent: Monday, September 23, 2013 10:57:12 AM Subject: Re: [Users] cant add storage connection via api On 09/23/2013 11:43 AM, Alissa Bonas wrote: - Original Message - From: "Michael Pasternak" To: "Yuriy Demchenko" , "Alissa Bonas" , "Alon Bar-Lev" Cc: "Ofer Schreiber" , users@ovirt.org Sent: Monday, September 23, 2013 10:22:49 AM Subject: Re: [Users] cant add storage connection via api On 09/23/2013 10:05 AM, Yuriy Demchenko wrote: Hi, url [1] (https://ovirt.spb.stone.local/api?rsdl) not working for me - gets http 500 error with "java.lang.NullPointerException" java.lang.NullPointerException org.ovirt.engine.api.restapi.resource.BackendApiResource.addSystemVersion(BackendApiResource.java:358) org.ovirt.engine.api.restapi.resource.BackendApiResource.get(BackendApiResource.java:316) sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57) sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) java.lang.reflect.Method.invoke(Method.java:606) org.jboss.resteasy.core.MethodInjectorImpl.invoke(MethodInjectorImpl.java:155) org.jboss.resteasy.core.ResourceMethod.invokeOnTarget(ResourceMethod.java:257) org.jboss.resteasy.core.ResourceMethod.invoke(ResourceMethod.java:222) org.jboss.resteasy.core.ResourceMethod.invoke(ResourceMethod.java:211) org.jboss.resteasy.core.SynchronousDispatcher.getResponse(SynchronousDispatcher.java:525) org.jboss.resteasy.core.SynchronousDispatcher.invoke(SynchronousDispatcher.java:502) org.jboss.resteasy.core.SynchronousDispatcher.invoke(SynchronousDispatcher.java:119) org.jboss.resteasy.plugins.server.servlet.ServletContainerDispatcher.service(ServletContainerDispatcher.java:208) org.jboss.resteasy.plugins.server.servlet.HttpServletDispatcher.service(HttpServletDispatcher.java:55) org.jboss.resteasy.plugins.server.servlet.HttpServletDispatcher.service(HttpServletDispatcher.java:50) javax.servlet.http.HttpServlet.service(HttpServlet.java:847) api page (without "?rsdl") opens just fine. this is known issue caused by branding infra, i believe it was fixed in the following release (Alon correct me if i'm wrong) Feature "manage storage connections" was mentioned in oVirt-3.3 release notes (http://www.ovirt.org/OVirt_3.3_release_notes), under "storage" section. alissa? Storage connections management should be part of 3.3. Yuriy, when did you install the 3.3 and which exact build did you take? That was upgrade from 3.2.2 to 3.3, did that on Tuesday 17, from 3.3 release repo (http://resources.ovirt.org/releases/3.3/rpm/EL/6/) ovirt-engine-3.3.0-4.el6 noarch 9/17/13 3:05:40 PM MUT ovirt-engine-backend-3.3.0-4.el6 noarch 9/17/13 3:05:34 PM MUT ovirt-engine-cli-3.2.0.12-1.el6 noarch 6/17/13 11:24:48 AM MUT ovirt-engine-dbscripts-3.3.0-4.el6 noarch 9/17/13 3:05:39 PM MUT ovirt-engine-lib-3.3.0-4.el6 noarch 9/17/13 2:57:00 PM MUT ovirt-engine-restapi-3.3.0-4.el6 noarch 9/17/13 3:05:30 PM MUT ovirt-engine-sdk-3.2.0.11-1.el6 noarch 6/17/13 11:24:33 AM MUT ovirt-engine-setup-3.3.0-4.el6 noarch 9/17/13 2:57:03 PM MUT ovirt-engine-tools-3.3.0-4.el6 noarch 9/17/13 3:05:35 PM MUT ovirt-engine-userportal-3.3.0-4.el6 noarch 9/17/13 3:05:29 PM MUT ovirt-engine-webadmin-portal-3.3.0-4.el6 noarch 9/17/13 3:06:01 PM MUT ovirt-engine-websocket-proxy-3.3.0-4.el6 noarch 9/17/13 3:21:14 PM MUT ovirt-host-deploy-1.1.1-1.el6 noarch 9/17/13 3:05:15 PM MUT ovirt-host-deploy-java-1.1.1-1.el6 noarch 9/17/13 3:05:16 PM MUT ovirt-image-uploader-3.3.0-1.el6 noarch 9/17/13 3:05:14 PM MUT ovirt-iso-uploader-3.3.0-1.el6 noarch 9/17/13 3:05:17 PM MUT ovirt-log-collector-3.3.0-1.el6 noarch 9/17/13 3:05:18 PM MUT Also, can you access the following url (just a regular GET request) ? https://ovirt.spb.stone.local/api/storageconnections Yes I can. And I get list of all current connections there. OK, thank you. Please let me know 2 more things: 1. Is the connections subresource for a storage domain available? It is accessed in the link below with a GET request (please replace domainId placeholder with an id of one of your storage domains, doesn't matter which) https://ovirt.spb.stone.local/api/storagedomains//storageconnections nope, no "storageconnections" under storagedomains//, only storage, link, type, master, description and name 2. In the main storage connections resource which you accessed via the below url -
Re: [Users] cant add storage connection via api
On 09/23/2013 12:06 PM, Michael Pasternak wrote: On 09/23/2013 10:58 AM, Alon Bar-Lev wrote: - Original Message - From: "Michael Pasternak" To: "Yuriy Demchenko" , "Alissa Bonas" , "Alon Bar-Lev" Cc: "Ofer Schreiber" , users@ovirt.org Sent: Monday, September 23, 2013 10:22:49 AM Subject: Re: [Users] cant add storage connection via api On 09/23/2013 10:05 AM, Yuriy Demchenko wrote: Hi, url [1] (https://ovirt.spb.stone.local/api?rsdl) not working for me - gets http 500 error with "java.lang.NullPointerException" java.lang.NullPointerException org.ovirt.engine.api.restapi.resource.BackendApiResource.addSystemVersion(BackendApiResource.java:358) org.ovirt.engine.api.restapi.resource.BackendApiResource.get(BackendApiResource.java:316) sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57) sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) java.lang.reflect.Method.invoke(Method.java:606) org.jboss.resteasy.core.MethodInjectorImpl.invoke(MethodInjectorImpl.java:155) org.jboss.resteasy.core.ResourceMethod.invokeOnTarget(ResourceMethod.java:257) org.jboss.resteasy.core.ResourceMethod.invoke(ResourceMethod.java:222) org.jboss.resteasy.core.ResourceMethod.invoke(ResourceMethod.java:211) org.jboss.resteasy.core.SynchronousDispatcher.getResponse(SynchronousDispatcher.java:525) org.jboss.resteasy.core.SynchronousDispatcher.invoke(SynchronousDispatcher.java:502) org.jboss.resteasy.core.SynchronousDispatcher.invoke(SynchronousDispatcher.java:119) org.jboss.resteasy.plugins.server.servlet.ServletContainerDispatcher.service(ServletContainerDispatcher.java:208) org.jboss.resteasy.plugins.server.servlet.HttpServletDispatcher.service(HttpServletDispatcher.java:55) org.jboss.resteasy.plugins.server.servlet.HttpServletDispatcher.service(HttpServletDispatcher.java:50) javax.servlet.http.HttpServlet.service(HttpServlet.java:847) api page (without "?rsdl") opens just fine. Yuriy, does your database upgrade succeeded? did you say any errors? Yea, upgrade succeded without any errors ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] cant add storage connection via api
On 09/23/2013 11:43 AM, Alissa Bonas wrote: - Original Message - From: "Michael Pasternak" To: "Yuriy Demchenko" , "Alissa Bonas" , "Alon Bar-Lev" Cc: "Ofer Schreiber" , users@ovirt.org Sent: Monday, September 23, 2013 10:22:49 AM Subject: Re: [Users] cant add storage connection via api On 09/23/2013 10:05 AM, Yuriy Demchenko wrote: Hi, url [1] (https://ovirt.spb.stone.local/api?rsdl) not working for me - gets http 500 error with "java.lang.NullPointerException" java.lang.NullPointerException org.ovirt.engine.api.restapi.resource.BackendApiResource.addSystemVersion(BackendApiResource.java:358) org.ovirt.engine.api.restapi.resource.BackendApiResource.get(BackendApiResource.java:316) sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57) sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) java.lang.reflect.Method.invoke(Method.java:606) org.jboss.resteasy.core.MethodInjectorImpl.invoke(MethodInjectorImpl.java:155) org.jboss.resteasy.core.ResourceMethod.invokeOnTarget(ResourceMethod.java:257) org.jboss.resteasy.core.ResourceMethod.invoke(ResourceMethod.java:222) org.jboss.resteasy.core.ResourceMethod.invoke(ResourceMethod.java:211) org.jboss.resteasy.core.SynchronousDispatcher.getResponse(SynchronousDispatcher.java:525) org.jboss.resteasy.core.SynchronousDispatcher.invoke(SynchronousDispatcher.java:502) org.jboss.resteasy.core.SynchronousDispatcher.invoke(SynchronousDispatcher.java:119) org.jboss.resteasy.plugins.server.servlet.ServletContainerDispatcher.service(ServletContainerDispatcher.java:208) org.jboss.resteasy.plugins.server.servlet.HttpServletDispatcher.service(HttpServletDispatcher.java:55) org.jboss.resteasy.plugins.server.servlet.HttpServletDispatcher.service(HttpServletDispatcher.java:50) javax.servlet.http.HttpServlet.service(HttpServlet.java:847) api page (without "?rsdl") opens just fine. this is known issue caused by branding infra, i believe it was fixed in the following release (Alon correct me if i'm wrong) Feature "manage storage connections" was mentioned in oVirt-3.3 release notes (http://www.ovirt.org/OVirt_3.3_release_notes), under "storage" section. alissa? Storage connections management should be part of 3.3. Yuriy, when did you install the 3.3 and which exact build did you take? That was upgrade from 3.2.2 to 3.3, did that on Tuesday 17, from 3.3 release repo (http://resources.ovirt.org/releases/3.3/rpm/EL/6/) ovirt-engine-3.3.0-4.el6 noarch 9/17/13 3:05:40 PM MUT ovirt-engine-backend-3.3.0-4.el6 noarch 9/17/13 3:05:34 PM MUT ovirt-engine-cli-3.2.0.12-1.el6 noarch 6/17/13 11:24:48 AM MUT ovirt-engine-dbscripts-3.3.0-4.el6 noarch 9/17/13 3:05:39 PM MUT ovirt-engine-lib-3.3.0-4.el6 noarch 9/17/13 2:57:00 PM MUT ovirt-engine-restapi-3.3.0-4.el6 noarch 9/17/13 3:05:30 PM MUT ovirt-engine-sdk-3.2.0.11-1.el6 noarch 6/17/13 11:24:33 AM MUT ovirt-engine-setup-3.3.0-4.el6 noarch 9/17/13 2:57:03 PM MUT ovirt-engine-tools-3.3.0-4.el6 noarch 9/17/13 3:05:35 PM MUT ovirt-engine-userportal-3.3.0-4.el6 noarch 9/17/13 3:05:29 PM MUT ovirt-engine-webadmin-portal-3.3.0-4.el6 noarch 9/17/13 3:06:01 PM MUT ovirt-engine-websocket-proxy-3.3.0-4.el6 noarch 9/17/13 3:21:14 PM MUT ovirt-host-deploy-1.1.1-1.el6 noarch 9/17/13 3:05:15 PM MUT ovirt-host-deploy-java-1.1.1-1.el6 noarch 9/17/13 3:05:16 PM MUT ovirt-image-uploader-3.3.0-1.el6 noarch 9/17/13 3:05:14 PM MUT ovirt-iso-uploader-3.3.0-1.el6 noarch 9/17/13 3:05:17 PM MUT ovirt-log-collector-3.3.0-1.el6 noarch 9/17/13 3:05:18 PM MUT Also, can you access the following url (just a regular GET request) ? https://ovirt.spb.stone.local/api/storageconnections Yes I can. And I get list of all current connections there. Yuriy Demchenko On 09/22/2013 11:00 AM, Michael Pasternak wrote: Hi Yuriy, please see the correct way of adding new storageconnection at [1], under section, also please let us know if you find any inconsistency between the wiki and rsdl, regarding the UnmarshalException, i'm not sure this feature got in to the version you're using, Ofer, in what release mentioned feature is available? thanks. [1] https://ovirt.spb.stone.local/api?rsdl On 09/18/2013 03:21 PM, Yuriy Demchenko wrote: Hi, I've recently upgraded my test lab to ovirt-3.3 (el6) and trying to add additional target for iscsi domain. As described here - http://www.ovirt.org/Features/Manage_Storage_Connections , I'm trying first to add new connection via restapi, but operation fails with error "HTTP Status 400 - javax.xml.bind.UnmarshalException: unexpected element (uri:"", local:"storage_connection")"
Re: [Users] cant add storage connection via api
Hi, url [1] (https://ovirt.spb.stone.local/api?rsdl) not working for me - gets http 500 error with "java.lang.NullPointerException" java.lang.NullPointerException org.ovirt.engine.api.restapi.resource.BackendApiResource.addSystemVersion(BackendApiResource.java:358) org.ovirt.engine.api.restapi.resource.BackendApiResource.get(BackendApiResource.java:316) sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57) sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) java.lang.reflect.Method.invoke(Method.java:606) org.jboss.resteasy.core.MethodInjectorImpl.invoke(MethodInjectorImpl.java:155) org.jboss.resteasy.core.ResourceMethod.invokeOnTarget(ResourceMethod.java:257) org.jboss.resteasy.core.ResourceMethod.invoke(ResourceMethod.java:222) org.jboss.resteasy.core.ResourceMethod.invoke(ResourceMethod.java:211) org.jboss.resteasy.core.SynchronousDispatcher.getResponse(SynchronousDispatcher.java:525) org.jboss.resteasy.core.SynchronousDispatcher.invoke(SynchronousDispatcher.java:502) org.jboss.resteasy.core.SynchronousDispatcher.invoke(SynchronousDispatcher.java:119) org.jboss.resteasy.plugins.server.servlet.ServletContainerDispatcher.service(ServletContainerDispatcher.java:208) org.jboss.resteasy.plugins.server.servlet.HttpServletDispatcher.service(HttpServletDispatcher.java:55) org.jboss.resteasy.plugins.server.servlet.HttpServletDispatcher.service(HttpServletDispatcher.java:50) javax.servlet.http.HttpServlet.service(HttpServlet.java:847) api page (without "?rsdl") opens just fine. Feature "manage storage connections" was mentioned in oVirt-3.3 release notes (http://www.ovirt.org/OVirt_3.3_release_notes), under "storage" section. Yuriy Demchenko On 09/22/2013 11:00 AM, Michael Pasternak wrote: Hi Yuriy, please see the correct way of adding new storageconnection at [1], under section, also please let us know if you find any inconsistency between the wiki and rsdl, regarding the UnmarshalException, i'm not sure this feature got in to the version you're using, Ofer, in what release mentioned feature is available? thanks. [1] https://ovirt.spb.stone.local/api?rsdl On 09/18/2013 03:21 PM, Yuriy Demchenko wrote: Hi, I've recently upgraded my test lab to ovirt-3.3 (el6) and trying to add additional target for iscsi domain. As described here - http://www.ovirt.org/Features/Manage_Storage_Connections , I'm trying first to add new connection via restapi, but operation fails with error "HTTP Status 400 - javax.xml.bind.UnmarshalException: unexpected element (uri:"", local:"storage_connection")" I'm not very familiar with restapi and maybe doing something wrong, so please help me to figure it out. here's what i put and reply from server: curl -k -v -u "admin@internal:pass" -H "Content-type: application/xml" -d ' iscsi 192.168.221.5 3260 iqn.2013-09.local.stone.spb:target3.disk ' 'https://ovirt.spb.stone.local/api/storageconnections' * About to connect() to ovirt.spb.stone.local port 443 (#0) * Trying 192.168.220.13... * connected * Connected to ovirt.spb.stone.local (192.168.220.13) port 443 (#0) * Initializing NSS with certpath: sql:/etc/pki/nssdb * warning: ignoring value of ssl.verifyhost * skipping SSL peer certificate verification * SSL connection using TLS_DHE_RSA_WITH_AES_256_CBC_SHA * Server certificate: * subject: CN=ovirt.spb.stone.local,O=spb.stone.local,C=US * start date: Aug 28 09:28:45 2013 GMT * expire date: Aug 03 09:28:47 2018 GMT * common name: ovirt.spb.stone.local * issuer: CN=CA-ovirt.spb.stone.local.95565,O=spb.stone.local,C=US * Server auth using Basic with user 'admin@internal' POST /api/storageconnections HTTP/1.1 Authorization: Basic YWRtaW5AaW50ZXJuYWw6bXAyMjFjMg== User-Agent: curl/7.24.0 (x86_64-redhat-linux-gnu) libcurl/7.24.0 NSS/3.14.3.0 zlib/1.2.5 libidn/1.24 libssh2/1.4.1 Host: ovirt.spb.stone.local Accept: */* Content-type: application/xml Content-Length: 170 * upload completely sent off: 170 out of 170 bytes < HTTP/1.1 400 Bad Request < Date: Wed, 18 Sep 2013 12:05:51 GMT < Content-Type: text/html;charset=utf-8 < Vary: Accept-Encoding < Connection: close < Transfer-Encoding: chunked < JBoss Web/7.0.13.Final - Error report<!--H1 {font-family:Tahoma,Arial,sans-serif;color:white;background-color:#525D76;font-size:22px;} H2 {font-family:Tahoma,Arial,sans-serif;color:white;background-color:#525D76;font-size:16px;} H3 {font-family:Tahoma,Arial,sans-serif;color:white;background-color:#525D76;font-size:14px;} BODY {font-family:Tahoma,Arial,sans-serif;color:black;background-color:white;} B {font-family:Tahoma,Arial,sans-serif;color:white;background-color:#525D76;} P {font-family:Tahoma,Arial,sans-serif;background:whit
[Users] cant add storage connection via api
rted_device_types>,<{}reported_devices>,<{}request>,<{}response>,<{}role>,<{}roles>,<{}rsdl>,<{}scheduling_policies>,<{}schema>,<{}server_hook>,<{}server_hooks>,<{}sgio_options>,<{}slaves>,<{}snapshot>,<{}snapshots>,<{}special_objects>,<{}ssh>,<{}stages>,<{}statistic>, <{}statistics>,<{}status>,<{}step>,<{}step_types>,<{}steps>,<{}storage>,<{}storage_connections>,<{}storage_domain>,<{}storage_domain_states>,<{}storage_domain_types>,<{}storage_domains>,<{}storage_formats>,<{}storage_manager>,<{}storage_types>,<{}summary>,<{}system_version>,<{}tag>,<{}tags>,<{}template>,<{}template_states>,<{}templates>,<{}transparent_hugepages>,<{}transport_types>,<{}url>,<{}usages>,<{}usb>,<{}user>,<{}users>,<{}value>,<{}values>,<{}vcpu_pin>,<{}version>,<{}vlan>,<{}vm>,<{}vm_affinities>,<{}vm_device_types>,<{}vm_pause_details>,<{}vm_states>,<{}vm_types>,<{}vmpool>,<{}vmpools>,<{}vms>,<{}volume_group>,<{}watchdog>,<{}watchdog_actions>,<{}watchdog_models>,<{}watchdogs>>description The request sent by the client was syntactically incorrect (javax.xml.bind.UnmarshalException: unexpected element (uri:"", local:"storage_connection"). Expected elements are <{}action>,<{}agent>,<{}agents>,<{}api>,<{}application>,<{}applications>,<{}authentication_methods>,<{}body>,<{}bonding>,<{}boot_devices>,<{}boot_protocols>,<{}brick>,<{}brick_details>,<{}brick_memoryinfo>,<{}brick_states>,<{}bricks>,<{}capabilities>,<{}cdrom>,<{}cdroms>,<{}certificate>,<{}cluster>,<{}clusters>,<{}console>,<{}content_types>,<{}cpu>,<{}cpu_modes>,<{}cpu_tune>,<{}cpus>,<{}creation>,<{}creation_states>,<{}custom_properties>,<{}data_center>,<{}data_center_states>,<{}data_centers>,<{}detailedLink>,<{}detailedLinks>,<{}disk>,<{}disk_formats>,<{}disk_interfaces>,<{}disk_states>,<{}disks>,<{}display>,<{}display_types>,<{}domain>,<{}domains>,<{}error_handling>,<{}event>,<{}events>,&l t;{}fault>,<{}feature>,<{}features>,<{}fence_types>,<{}file>,<{}files>,<{}floppies>,<{}floppy>,<{}general_metadata>,<{}gluster_client>,<{}gluster_clients>,<{}gluster_hook>,<{}gluster_volume>,<{}gluster_volume_states>,<{}gluster_volume_types>,<{}gluster_volumes>,<{}glusterhooks>,<{}group>,<{}groups>,<{}hardware_information>,<{}header>,<{}headers>,<{}hook>,<{}hook_states>,<{}hooks>,<{}host>,<{}host_nic>,<{}host_nic_states>,<{}host_nics>,<{}host_non_operational_details>,<{}host_states>,<{}host_storage>,<{}hosts>,<{}ip>,<{}ip_versions>,<{}ips>,<{}job>,<{}jobs>,<{}keyValuePair>,<{}ksm>,<{}link>,<{}linkCapabilities>,<{}logical_unit>,<{}mac>,<{}memory_pool>,<{}memory_pools>,<{}network>,<{}network_states&g t;,<{}networks>,<{}nfs_versions>,<{}nic>,<{}nic_interfaces>,<{}nics>,<{}options>,<{}os_types>,<{}parameter>,<{}parameters_set>,<{}payload>,<{}payload_file>,<{}payloads>,<{}permission>,<{}permissions>,<{}permit>,<{}permits>,<{}pm_proxies>,<{}pm_proxy>,<{}pm_proxy_types>,<{}port_mirroring>,<{}power_management>,<{}power_management_states>,<{}power_managers>,<{}preview_vms>,<{}product_info>,<{}quota>,<{}quotas>,<{}reported_device>,<{}reported_device_types>,<{}reported_devices>,<{}request>,<{}response>,<{}role>,<{}roles>,<{}rsdl>,<{}scheduling_policies>,<{}schema>,<{}server_hook>,<{}server_hooks>,<{}sgio_options>,<{}slaves>,<{}snapshot>,<{}snapshots>,<{}special_objects>,<{}ssh>,<{}stages>,<{}statistic>, <{}statistics>,<{}status>,<{}step>,<{}step_types>,<{}steps>,<{}storage>,<{}storage_connections>,<{}sto* Closing connection #0 rage_domain>,<{}storage_domain_states>,<{}storage_domain_types>,<{}storage_domains>,<{}storage_formats>,<{}storage_manager>,<{}storage_types>,<{}summary>,<{}system_version>,<{}tag>,<{}tags>,<{}template>,<{}template_states>,<{}templates>,<{}transparent_hugepages>,<{}transport_types>,<{}url>,<{}usages>,<{}usb>,<{}user>,<{}users>,<{}value>,<{}values>,<{}vcpu_pin>,<{}version>,<{}vlan>,<{}vm>,<{}vm_affinities>,<{}vm_device_types>,<{}vm_pause_details>,<{}vm_states>,<{}vm_types>,<{}vmpool>,<{}vmpools>,<{}vms>,<{}volume_group>,<{}watchdog>,<{}watchdog_actions>,<{}watchdog_models>,<{}watchdogs>).size="1" noshade="noshade">JBoss Web/7.0.13.Final -- Yuriy Demchenko ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] cant remove disks from iscsi domain
If by "latest" you mean head/nightly - i'm not using it yet, I'm using packets from stable el6 repo, so my test env match production env. Errors i get: User admin@internal failed to initiate removing of disk virt-dhcp-3_Disk1 from domain iscsi-store Thread-1196::ERROR::2013-08-22 14:00:57,736::task::833::TaskManager.Task::(_setError) Task=`b691e001-7d8e-46cc-a04e-3d89ffea8fe5`::Unexpected error Traceback (most recent call last): File "/usr/share/vdsm/storage/task.py", line 840, in _run return fn(*args, **kargs) File "/usr/share/vdsm/logUtils.py", line 42, in wrapper res = f(*args, **kwargs) File "/usr/share/vdsm/storage/hsm.py", line 1434, in deleteImage raise se.ImageDoesNotExistInSD(imgUUID, sdUUID) ImageDoesNotExistInSD: Image does not exist in domain: 'image=dde47a09-81d9-4592-87e4-92398ea834ce, domain=d786e2d5-05ab-4da6-95fc-1af791a3c113' Maybe just do some sql-query to remove them from DB? Yuriy Demchenko On 08/22/2013 01:55 PM, Dafna Ron wrote: deleting illegal objects from UI was added to the latest ovirt/vdsm are you getting any errors when trying to delete them now? On 08/22/2013 10:46 AM, Yuriy Demchenko wrote: Thanks a lot! Patch did the job, a few more disks were deleted successfully. One last question - how do i remove stale records of disks in "illegal" state? Yuriy Demchenko On 08/22/2013 01:25 PM, Eduardo Warszawski wrote: - Original Message - you said that the disks are deleted although an exception is given but the engine is reporting the delete as failed. I think that I remember a bug reported and fixed on that as well but I can't seem to find it. Adding Ayal and Eduardo The log issue already solved in v4.11.0~380. commit ad916c79e2b0959dea20dd19a21b99bc702d65ca Author: Eduardo Warszawski Date: Mon Dec 17 14:32:51 2012 +0200 Fix typo in negative flow log in blockSD.rmDCImgDir(). Related to BZ#885489. Change-Id: I951e582acc86e08d709da4249084015660fc4ea0 Signed-off-by: Eduardo Reviewed-on: http://gerrit.ovirt.org/10153 Reviewed-by: Yeela Kaplan Reviewed-by: Ayal Baron Tested-by: Dan Kenigsberg diff --git a/vdsm/storage/blockSD.py b/vdsm/storage/blockSD.py index b5447cd..22a434b 100644 --- a/vdsm/storage/blockSD.py +++ b/vdsm/storage/blockSD.py @@ -978,7 +978,7 @@ class BlockStorageDomain(sd.StorageDomain): try: os.rmdir(imgPath) except OSError: -self.log.warning("Can't rmdir %s. %s", imgPath, exc_info=True) +self.log.warning("Can't rmdir %s", imgPath, exc_info=True) else: self.log.debug("removed image dir: %s", imgPath) return imgPath On 08/22/2013 07:55 AM, Yuriy Demchenko wrote: I've done some more tests - and it seems quota error is not related to my issue: I tried to remove another disk and this time there were no quota errors in engine.log New logs in attach. What catches my eye in logs is this errors, but maybe that's not the root of case: Thread-60725::DEBUG::2013-08-22 10:37:45,549::lvm::485::OperationMutex::(_invali datevgs) Operation 'lvm invalidate operation' released the operation mutex Thread-60725::WARNING::2013-08-22 10:37:45,549::blockSD::931::Storage.StorageDom ain::(rmDCVolLinks) Can't unlink /rhev/data-center/mnt/blockSD/d786e2d5-05ab-4da 6-95fc-1af791a3c113/images/5344ca63-302a-43de-9193-da7937fbdfad/dfefc573-de85-40 85-8900-da271affe831. [Errno 2] No such file or directory: '/rhev/data-center/mn t/blockSD/d786e2d5-05ab-4da6-95fc-1af791a3c113/images/5344ca63-302a-43de-9193-da 7937fbdfad/dfefc573-de85-4085-8900-da271affe831' Thread-60725::WARNING::2013-08-22 10:37:45,549::blockSD::931::Storage.StorageDom ain::(rmDCVolLinks) Can't unlink /rhev/data-center/mnt/blockSD/d786e2d5-05ab-4da 6-95fc-1af791a3c113/images/5344ca63-302a-43de-9193-da7937fbdfad/c6cd6d1d-b70f-43 5d-bdc7-713b445a2326. [Errno 2] No such file or directory: '/rhev/data-center/mn t/blockSD/d786e2d5-05ab-4da6-95fc-1af791a3c113/images/5344ca63-302a-43de-9193-da 7937fbdfad/c6cd6d1d-b70f-435d-bdc7-713b445a2326' Thread-60725::DEBUG::2013-08-22 10:37:45,549::blockSD::934::Storage.StorageDomai n::(rmDCVolLinks) removed: [] Thread-60725::ERROR::2013-08-22 10:37:45,549::task::833::TaskManager.Task::(_set Error) Task=`83867bdc-48cd-4ba0-b453-6f8abbace13e`::Unexpected error Traceback (most recent call last): File "/usr/share/vdsm/storage/task.py", line 840, in _run return fn(*args, **kargs) File "/usr/share/vdsm/logUtils.py", line 42, in wrapper res = f(*args, **kwargs) File "/usr/share/vdsm/storage/hsm.py", line 1460, in deleteImage dom.deleteImage(sdUUID, imgUUID, volsByImg) File "/usr/share/vdsm/storage/blockSD.py", line 957, in deleteImage self.rm
Re: [Users] cant remove disks from iscsi domain
Thanks a lot! Patch did the job, a few more disks were deleted successfully. One last question - how do i remove stale records of disks in "illegal" state? Yuriy Demchenko On 08/22/2013 01:25 PM, Eduardo Warszawski wrote: - Original Message - you said that the disks are deleted although an exception is given but the engine is reporting the delete as failed. I think that I remember a bug reported and fixed on that as well but I can't seem to find it. Adding Ayal and Eduardo The log issue already solved in v4.11.0~380. commit ad916c79e2b0959dea20dd19a21b99bc702d65ca Author: Eduardo Warszawski Date: Mon Dec 17 14:32:51 2012 +0200 Fix typo in negative flow log in blockSD.rmDCImgDir(). Related to BZ#885489. Change-Id: I951e582acc86e08d709da4249084015660fc4ea0 Signed-off-by: Eduardo Reviewed-on: http://gerrit.ovirt.org/10153 Reviewed-by: Yeela Kaplan Reviewed-by: Ayal Baron Tested-by: Dan Kenigsberg diff --git a/vdsm/storage/blockSD.py b/vdsm/storage/blockSD.py index b5447cd..22a434b 100644 --- a/vdsm/storage/blockSD.py +++ b/vdsm/storage/blockSD.py @@ -978,7 +978,7 @@ class BlockStorageDomain(sd.StorageDomain): try: os.rmdir(imgPath) except OSError: -self.log.warning("Can't rmdir %s. %s", imgPath, exc_info=True) +self.log.warning("Can't rmdir %s", imgPath, exc_info=True) else: self.log.debug("removed image dir: %s", imgPath) return imgPath On 08/22/2013 07:55 AM, Yuriy Demchenko wrote: I've done some more tests - and it seems quota error is not related to my issue: I tried to remove another disk and this time there were no quota errors in engine.log New logs in attach. What catches my eye in logs is this errors, but maybe that's not the root of case: Thread-60725::DEBUG::2013-08-22 10:37:45,549::lvm::485::OperationMutex::(_invali datevgs) Operation 'lvm invalidate operation' released the operation mutex Thread-60725::WARNING::2013-08-22 10:37:45,549::blockSD::931::Storage.StorageDom ain::(rmDCVolLinks) Can't unlink /rhev/data-center/mnt/blockSD/d786e2d5-05ab-4da 6-95fc-1af791a3c113/images/5344ca63-302a-43de-9193-da7937fbdfad/dfefc573-de85-40 85-8900-da271affe831. [Errno 2] No such file or directory: '/rhev/data-center/mn t/blockSD/d786e2d5-05ab-4da6-95fc-1af791a3c113/images/5344ca63-302a-43de-9193-da 7937fbdfad/dfefc573-de85-4085-8900-da271affe831' Thread-60725::WARNING::2013-08-22 10:37:45,549::blockSD::931::Storage.StorageDom ain::(rmDCVolLinks) Can't unlink /rhev/data-center/mnt/blockSD/d786e2d5-05ab-4da 6-95fc-1af791a3c113/images/5344ca63-302a-43de-9193-da7937fbdfad/c6cd6d1d-b70f-43 5d-bdc7-713b445a2326. [Errno 2] No such file or directory: '/rhev/data-center/mn t/blockSD/d786e2d5-05ab-4da6-95fc-1af791a3c113/images/5344ca63-302a-43de-9193-da 7937fbdfad/c6cd6d1d-b70f-435d-bdc7-713b445a2326' Thread-60725::DEBUG::2013-08-22 10:37:45,549::blockSD::934::Storage.StorageDomai n::(rmDCVolLinks) removed: [] Thread-60725::ERROR::2013-08-22 10:37:45,549::task::833::TaskManager.Task::(_set Error) Task=`83867bdc-48cd-4ba0-b453-6f8abbace13e`::Unexpected error Traceback (most recent call last): File "/usr/share/vdsm/storage/task.py", line 840, in _run return fn(*args, **kargs) File "/usr/share/vdsm/logUtils.py", line 42, in wrapper res = f(*args, **kwargs) File "/usr/share/vdsm/storage/hsm.py", line 1460, in deleteImage dom.deleteImage(sdUUID, imgUUID, volsByImg) File "/usr/share/vdsm/storage/blockSD.py", line 957, in deleteImage self.rmDCImgDir(imgUUID, volsImgs) File "/usr/share/vdsm/storage/blockSD.py", line 943, in rmDCImgDir self.log.warning("Can't rmdir %s. %s", imgPath, exc_info=True) File "/usr/lib64/python2.6/logging/__init__.py", line 1068, in warning self._log(WARNING, msg, args, **kwargs) File "/usr/lib64/python2.6/logging/__init__.py", line 1173, in _log self.handle(record) File "/usr/lib64/python2.6/logging/__init__.py", line 1183, in handle self.callHandlers(record) File "/usr/lib64/python2.6/logging/__init__.py", line 1220, in callHandlers hdlr.handle(record) File "/usr/lib64/python2.6/logging/__init__.py", line 679, in handle self.emit(record) File "/usr/lib64/python2.6/logging/handlers.py", line 780, in emit msg = self.format(record) File "/usr/lib64/python2.6/logging/__init__.py", line 654, in format return fmt.format(record) File "/usr/lib64/python2.6/logging/__init__.py", line 436, in format record.message = record.getMessage() File "/usr/lib64/python2.6/logging/__init__.py", line 306, in getMessage msg = msg % self.
Re: [Users] cant remove disks from iscsi domain
On 08/22/2013 11:17 AM, Doron Fediuck wrote: Yuri, just to clarify the quota part, the stack trace you provided was resolved as the bz Greg described- https://bugzilla.redhat.com/show_bug.cgi?id=905891 This is unrelated to the storage issue, as the quota caching is an independent procedure running in parallel. So what we now need to focus on is this part: [Errno 2] No such file or directory: '/rhev/data-center/mnt/blockSD/d786e2d5-05ab-4da6-95fc-1af791a3c113/images/5344ca63-302a-43de-9193-da7937fbdfad/dfefc573-de85-4085-8900-da271affe831' What we need to understand is how did we get into this state; Did you have network issues or relevant crashes? Well, I had a few power loss events (its my test env, so no ups installed yet) with storage, engine and nodes powered off unclean; my storage is kinda slow - just raid5 from 3 sata hdd. But there were no network issues i'm aware of, iscsi network is in separate vlan with 1gbps physical. Apart from power loss events I had no crashes of engine or nodes. Yuriy Demchenko ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
[Users] cant remove disks from iscsi domain
Hi, I've recently encountered a problem with removing disks from iscsi domain in my test lab - just cant remove any. Remove operation fails with message "User admin@internal failed to initiate removing of disk pg-slave1_opt from domain iscsi-store" and re-elections of SPM. After that - disk is marked as "illegal" in ovirt webinterface, however, it is _in fact_ removed from storage - lvdisplay doesn't show it and free space is updated correctly. And this happens with just about every disk/vm i try to remove. ovirt 3.2.2-el6 centos 6.4 vdsm-4.10.3-17.el6 lvm2-2.02.98-9.el6 Any tips? logs in attach -- Yuriy Demchenko 2013-08-21 17:52:21,156 INFO [org.ovirt.engine.core.bll.RemoveDiskCommand] (ajp --127.0.0.1-8702-8) Lock Acquired to object EngineLock [exclusiveLocks= key: 594 71028-8157-484b-9b2c-98c0eb431ba7 value: DISK , sharedLocks= key: 2be220e7-6e46-4d12-9ab5-e58416225455 value: VM ] 2013-08-21 17:52:21,273 INFO [org.ovirt.engine.core.bll.RemoveDiskCommand] (poo l-3-thread-45) [4b3c8227] Running command: RemoveDiskCommand internal: false. En tities affected : ID: 59471028-8157-484b-9b2c-98c0eb431ba7 Type: Disk 2013-08-21 17:52:21,278 INFO [org.ovirt.engine.core.bll.RemoveImageCommand] (po ol-3-thread-45) [fe46de8] Running command: RemoveImageCommand internal: true. En tities affected : ID: d786e2d5-05ab-4da6-95fc-1af791a3c113 Type: Storage 2013-08-21 17:52:21,299 INFO [org.ovirt.engine.core.bll.RemoveImageCommand] (po ol-3-thread-45) [fe46de8] Lock freed to object EngineLock [exclusiveLocks= key:59471028-8157-484b-9b2c-98c0eb431ba7 value: DISK , sharedLocks= key: 2be220e7-6e46-4d12-9ab5-e58416225455 value: VM ] 2013-08-21 17:52:21,306 INFO [org.ovirt.engine.core.vdsbroker.irsbroker.DeleteI mageGroupVDSCommand] (pool-3-thread-45) [fe46de8] START, DeleteImageGroupVDSComm and( storagePoolId = 5849b030-626e-47cb-ad90-3ce782d831b3, ignoreFailoverLimit =false, compatabilityVersion = 3.2, storageDomainId = d786e2d5-05ab-4da6-95fc-1a f791a3c113, imageGroupId = 59471028-8157-484b-9b2c-98c0eb431ba7, postZeros = fal se, forceDelete = false), log id: 3e928634 2013-08-21 17:52:22,170 ERROR [org.ovirt.engine.core.vdsbroker.vdsbroker.BrokerC ommandBase] (pool-3-thread-45) [fe46de8] Failed in DeleteImageGroupVDS method 2013-08-21 17:52:22,171 ERROR [org.ovirt.engine.core.vdsbroker.vdsbroker.BrokerC ommandBase] (pool-3-thread-45) [fe46de8] Error code GeneralException and error m essage IRSGenericException: IRSErrorException: Failed to DeleteImageGroupVDS, er ror = not enough arguments for format string 2013-08-21 17:52:22,172 ERROR [org.ovirt.engine.core.vdsbroker.irsbroker.IrsBrok erCommand] (pool-3-thread-45) [fe46de8] IrsBroker::Failed::DeleteImageGroupVDS d ue to: IRSErrorException: IRSGenericException: IRSErrorException: Failed to Dele teImageGroupVDS, error = not enough arguments for format string 2013-08-21 17:52:22,343 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.SpmStop VDSCommand] (pool-3-thread-45) [fe46de8] START, SpmStopVDSCommand(HostName = nod e-2, HostId = e499def4-61ad-4801-8fe3-36f6f91c70a8, storagePoolId = 5849b030-626 e-47cb-ad90-3ce782d831b3), log id: a2093f6 2013-08-21 17:52:22,378 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.SpmStop VDSCommand] (pool-3-thread-45) [fe46de8] SpmStopVDSCommand::Stopping SPM on vdsnode-2, pool id 5849b030-626e-47cb-ad90-3ce782d831b3 2013-08-21 17:52:23,009 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.SpmStop VDSCommand] (pool-3-thread-45) [fe46de8] FINISH, SpmStopVDSCommand, log id: a209 3f6 2013-08-21 17:52:23,012 INFO [org.ovirt.engine.core.vdsbroker.irsbroker.IrsBrok erCommand] (pool-3-thread-45) [fe46de8] Irs placed on server e499def4-61ad-4801- 8fe3-36f6f91c70a8 failed. Proceed Failover 2013-08-21 17:52:23,065 INFO [org.ovirt.engine.core.bll.storage.SetStoragePoolS tatusCommand] (pool-3-thread-45) [1377b2ae] Running command: SetStoragePoolStatu sCommand internal: true. Entities affected : ID: 5849b030-626e-47cb-ad90-3ce782 d831b3 Type: StoragePool 2013-08-21 17:52:23,222 INFO [org.ovirt.engine.core.vdsbroker.irsbroker.IrsBrok erCommand] (pool-3-thread-45) [1377b2ae] hostFromVds::selectedVds - node-1, spmS tatus Free, storage pool Default 2013-08-21 17:52:23,225 ERROR [org.ovirt.engine.core.vdsbroker.irsbroker.IrsBrok erCommand] (pool-3-thread-45) [1377b2ae] SPM Init: could not find reported vds o r not up - pool:Default vds_spm_id: 2 2013-08-21 17:52:23,256 INFO [org.ovirt.engine.core.vdsbroker.irsbroker.IrsBrok erCommand] (pool-3-thread-45) [1377b2ae] SPM selection - vds seems as spm node-2 2013-08-21 17:52:23,287 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.SpmStop VDSCommand] (pool-3-thread-45) [1377b2ae] START, SpmStopVDSCommand(HostName = no de-2, HostId = e499def4-61ad-4801-8fe3-36f6f91c70a8, storagePoolId = 5849b030-62 6e-47cb-ad90-3ce782d831b3), log id: 6bfcf09
Re: [Users] Linux sysprep
Yea, that's right - i use dmidecode to get vmname in my init-script, I found it more convenient than reading some files. Besides, i've run into same problem as you with payload - it was added to domxml via hook, but somehow wasn't available inside VM. But i never did more troubleshooting of that problem, bios hack was enough for me. Yuriy Demchenko On 08/21/2013 01:32 PM, René Koch (ovido) wrote: Thanks a lot, Yuriy. That's an interesting solution you came up with. Although I'm unsure if I like to use bios fields for this. Beside this "little bios hack" you're doing the same as with payloads except you're properly using dmidecode in your init-script to get the vm name instead of mounting a floppy and reading content of a file. ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] Linux sysprep
Although you've already got some answers with payload/cloudinit idea, I can point at another solution I've used myself: http://lists.ovirt.org/pipermail/users/2013-June/014907.html Use a vdsm-hook to write vmname into some bios field, then one-time init-script inside template to configure your vm as you need. Yuriy Demchenko On 08/19/2013 06:17 PM, René Koch (ovido) wrote: Another option would be to write a custom script which is started during boot and disables itself after successful run (in the same way as firstboot - I already have such a script for RHN Satellite/Spacewalk joins). The problem here is: How do I get the (oVirt) name of this vm (would need something like virt-whoami:) )? Is the (internal oVirt) ID of this vm stored somewhere in the filesystem of this vm? I don't think so ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] which file system for shared disk?
On 07/11/2013 02:45 PM, Piotr Szubiakowski wrote: On 07/11/2013 12:52 PM, Piotr Szubiakowski wrote: The way that oVirt manage storage domains accessed via FC is very smart. There is separate logical volume for each virtual disk. But I think that logical volume at the same time could be "touched" only by one host. Is it possible that two host access read/write the same logical volume and there is no data corruption? hence a shared disk over block storage using LVM must be pre-allocated, so no LV changes (lv extend) would be needed. (also, it cannot have snapshots, since it would become qcow) Ok, but this is the hypervisor view. For a guest OS this LV is normal RAW block device. I wonder if someone test this feature and access shared disk from many VM's at the same time? I did a brief tests - 3 vms, shared disk, cman/pacemaker + gfs2 - had no problems using it, no data corruption. although that was only basic tests, like create/move/delete files, no extensive usage/stress or something like that. Thanks Yuri, Is fencing handled by the cman/pacemaker? I read about the fanced daemon and it seems to be difficult to use it together with oVirt. Piotr yea, fencing handled by pacemaker, stonith resource fence_rhev - parameters according to man page, with one exception - had to add /pcmk_host_list="vmname" /additional parameter, so pacemaker would know that this stonith device belongs to that VM. ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] which file system for shared disk?
On 07/11/2013 12:52 PM, Piotr Szubiakowski wrote: The way that oVirt manage storage domains accessed via FC is very smart. There is separate logical volume for each virtual disk. But I think that logical volume at the same time could be "touched" only by one host. Is it possible that two host access read/write the same logical volume and there is no data corruption? hence a shared disk over block storage using LVM must be pre-allocated, so no LV changes (lv extend) would be needed. (also, it cannot have snapshots, since it would become qcow) Ok, but this is the hypervisor view. For a guest OS this LV is normal RAW block device. I wonder if someone test this feature and access shared disk from many VM's at the same time? I did a brief tests - 3 vms, shared disk, cman/pacemaker + gfs2 - had no problems using it, no data corruption. although that was only basic tests, like create/move/delete files, no extensive usage/stress or something like that. ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] ovirt 3.2.1 custom bonding options
you should put "mode=0 miimon=100" in "custom mode" field that would make a string in ifcfg file BONDING_OPTS="mode=0 miimon=100". Yuriy Demchenko On 06/20/2013 09:16 PM, Sven Knohsalla wrote: Update: I meant balance-rr (mode 0), typo. Sven Knohsalla |System Administration | Netbiscuits Office +49 631 68036 433 | Fax +49 631 68036 111 |e-mails.knohsa...@netbiscuits.com| Skype: netbiscuits.admin Netbiscuits GmbH | Europaallee 10 | 67657 | GERMANY *Von:*Sven Knohsalla *Gesendet:* Donnerstag, 20. Juni 2013 19:14 *An:* users@ovirt.org *Cc:* Sven Knohsalla *Betreff:* ovirt 3.2.1 custom bonding options Hi, just have a tiny question, regarding bonding & custom mode: If I want to run a bond in mode 2 with miimon=100, how needs the custom text field look like: BONDING_OPTS="miimon=100 mode=0" ? (would be equivalent to ifcfg config file) Thanks in advance! Cheers, Sven. Sven Knohsalla |System Administration Office +49 631 68036 433 | Fax +49 631 68036 111 |e-mails.knohsa...@netbiscuits.com <mailto:s.knohsa...@netbiscuits.com>| Skype: netbiscuits.admin Netbiscuits GmbH | Europaallee 10 | 67657 | GERMANY https://my.netbiscuits.com/image/image_gallery?uuid=3a1a9d19-c305-4032-8cef-00b03c3d4c79&groupId=10211&t=1361534926402 <http://www.netbiscuits.com/> https://my.netbiscuits.com/image/image_gallery?uuid=3031deca-7e56-4417-9822-3d6d72f71ef0&groupId=10211&t=1347280983812 <https://www.netbiscuits.com/news>https://my.netbiscuits.com/image/image_gallery?uuid=1d1a5e29-ceda-4ab1-9353-67a1e838364d&groupId=10211&t=1347281040591 <https://twitter.com/netbiscuits>https://my.netbiscuits.com/image/image_gallery?uuid=c99bf866-be25-4236-a0ee-dca68ec828a5&groupId=10211&t=1347280983848 <http://www.linkedin.com/company/netbiscuits>https://my.netbiscuits.com/image/image_gallery?uuid=d62ba951-14dc-450d-b5f1-be33884225e3&groupId=10211&t=1347280983872 <http://www.xing.com/companies/netbiscuitsgmbh>https://my.netbiscuits.com/image/image_gallery?uuid=7b28f500-f415-40bb-851f-0cd55beeaf45&groupId=10211&t=1347280983791 <https://www.facebook.com/Netbiscuits>https://my.netbiscuits.com/image/image_gallery?uuid=cc8764d0-a5ac-4623-bb63-da3ca7c97f94&groupId=10211&t=1347280983836 <https://plus.google.com/u/0/112410769451962733032>https://my.netbiscuits.com/image/image_gallery?uuid=a15e871c-a11b-419c-acca-da5a0ebd5856&groupId=10211&t=1347281040599 <http://www.youtube.com/user/netbiscuits> *Register Court:*Local Court Kaiserslautern | *Commercial Register ID:* HR B 3604 *Management Board*: Guido Moggert, Michael Neidhöfer, Christian Reitz, Martin Süß This message and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom they are addressed. It may also be privileged or otherwise protected by work product immunity or other legal rules. Please notify the sender immediately by e-mail if you have received this e-mail by mistake and delete this e-mail from your system. If you are not the intended recipient you are notified that disclosing, copying, distributing or taking any action in reliance on the contents of this information is strictly prohibited. Warning: Although Netbiscuits has taken reasonable precautions to ensure no viruses are present in this email, the company cannot accept responsibility for any loss or damage arising from the use of this email or attachments. Beschreibung: Beschreibung: http://www.netbiscuits.com/image/image_gallery?uuid=0ba7711a-a277-4ea0-acb0-17fe13c3089d&groupId=10211&t=1348560850164Please consider the environment before printing ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] Obtain VM name from within VM?
On 06/19/2013 02:31 PM, Itamar Heim wrote: On 06/18/2013 10:59 AM, Yuriy Demchenko wrote: Hi, Is VMpayload supported by oVirt 3.2.2 el6? it should be. have you tried first passing payload via the engine? erm, not quite understand what do you mean? via admin web-interface? there's no fields/options related to payload in web interface, not in 'edit vm' dialog, not in 'runonce' dialog (only 'attach cd/floppy', 'custom properties' with fixed choice-list and general options). However, I've achieved my goal some other way - wrote a hook that fills 'SKU Number' bios field (not used by default) with VM name. It appeared even more simpler and convenient solution than vmpayload/fileinject. Hook is in attach, in case someone interested. #!/usr/bin/python import os import sys import hooking import traceback ''' vmname inject vdsm hook hook is getting VM name and writes it into bios 'SKU Number' field. ''' if True: try: domxml = hooking.read_domxml() names = domxml.getElementsByTagName('name')[0] vmname = names.childNodes[0].nodeValue system = domxml.getElementsByTagName('system')[0] entry = domxml.createElement('entry') entry.setAttribute('name','sku') text = domxml.createTextNode(vmname) entry.appendChild(text) system.appendChild(entry) hooking.write_domxml(domxml) except: sys.stderr.write('name-inject: [unexpected error]: %s\n' % traceback.format_exc()) sys.exit(2) ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] Obtain VM name from within VM?
Hi, Is VMpayload supported by oVirt 3.2.2 el6? As I'm trying to set up startup hook to pass vmname via payload and getting nothing... my hook code quite simple (and maybe crude): domxml = hooking.read_domxml() domain = domxml.getElementsByTagName('domain')[0] payloads = domxml.createElement('payloads') npayload = domxml.createElement('payload') npayload.setAttribute('type','cdrom') filename = domxml.createElement('file') filename.setAttribute('name','vmname.txt') content = domxml.createElement('content') names = domxml.getElementsByTagName('name')[0] vmname = names.childNodes[0].nodeValue text = domxml.createTextNode(vmname) content.appendChild(text) filename.appendChild(content) npayload.appendChild(filename) payloads.appendChild(npayload) domain.appendChild(payloads) hooking.write_domxml(domxml) and from vdsm.log I can see that 'payloads' elements are created succesfully: Thread-56974::DEBUG::2013-06-18 11:43:49,612::libvirtvm::1520::vm.Vm::(_run) vmId=`a21a2ec8-c7fd-4a0a-8cbb-5f0ab4faa960`::encoding="utf-8"?> virt-gate a21a2ec8-c7fd-4a0a-8cbb-5f0ab4faa960 524288 524288 1 path="/var/lib/libvirt/qemu/channels/virt-gate.com.redhat.rhevm.vdsm"/> path="/var/lib/libvirt/qemu/channels/virt-gate.org.qemu.guest_agent.0"/> passwdValidTo="1970-01-01T00:00:01" port="-1" tlsPort="-1" type="spice"> type="pci"/> type="pci"/> type="pci"/> type="pci"/> dev="/rhev/data-center/5849b030-626e-47cb-ad90-3ce782d831b3/d786e2d5-05ab-4da6-95fc-1af791a3c113/images/4927f83f-425e-41ef-aa12-3bc249d7064b/caa56376-deab-41cd-a27c-d35fe8f33946"/> 4927f83f-425e-41ef-aa12-3bc249d7064b type="raw"/> hvm oVirt oVirt Node 6-4.el6.centos.10 A6902500-EA00-0706-0025-90A600EA0E0F a21a2ec8-c7fd-4a0a-8cbb-5f0ab4faa960 Nehalem name="vmname.txt">virt-gate Tried with both 'cdrom' and 'floppy' types. However, inside guest OS (centos 6.4) I couldn't see any payload: /dev/sr0 contains nothing; modprobe floppy doest not finds any floppy devices. Am I missing something or it's just not implemented in 3.2.2-el6? Yuriy Demchenko On 05/25/2013 10:54 PM, Itamar Heim wrote: http://www.ovirt.org/Features/VMPayload That suggests I can use the API to set this up, but is this possible via the administrative UI instead? Could I do this in a creation-time VDSM hook? Thanks, 1. yes, you can do it via a hook using custom properties (that what the floppy/iso inject hooks were for, but they are now deprecated as the vm payload makes them redundant). 2. RHEV has this in 3.1. iirc, its only available via the API though[1] cloud init should be available from the UI via the runonce dialog. ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] way to edit iSCSI storage domain?
they're used only as iSCSI bridge, VM's would be running on other servers (blades) that have no FC connectivity Yuriy Demchenko On 04/03/2013 07:15 PM, Shu Ming wrote: I am still confused if SERV1/SERV2 are used as a iSCSI bridge to the storage only or used as VDSM host also? If they are used as VDSM host and both of them can have FC channel to the storage, why not create FC storage domain for them instead of iSCSI domain? ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] way to edit iSCSI storage domain?
oh, now i see my mistake was putting storage domain in maintenance - when it in maintenance, all options is greyed out, but when active - discover&login options awailable. Thanks for help! Yuriy Demchenko On 04/03/2013 07:05 PM, Alex Leonhardt wrote: but you can add a iscsi disk to a existing iscsi domain ... see attached screenshot .. .if the disk you're adding has the same LUN ID as the already existing one, ovirt will just "add it" as a 2nd / 3rd / 4th and so forth path ... On 3 April 2013 10:49, Yuriy Demchenko <mailto:demchenko...@gmail.com>> wrote: You mean add new path by hands on each node via iscsiadm ? And how that changes survive possible node reboots / reinstalls, as i suppose - it wouldn't? In ovirt webadmin i cannot edit added domain - connection information is greyed out (even when storage domain in maintenance mode) Yuriy Demchenko On 04/03/2013 01:00 PM, Alex Leonhardt wrote: I think i'd just add the 2nd path when the device is available ... i've recently exprimented with iscsi / tgtd and multipath on a ovirt hyper-visor and it will identify the disk as "the same" (new path to target) as long as the LUN ID is the same (this is taken from experience, not from a spec document) ... On 3 April 2013 08:14, Yuriy Demchenko mailto:demchenko...@gmail.com>> wrote: I guess you misunderstood me I'm going to try this scheme: |STORAGE| FC / \ |SERV1/tgtd||SERV2/tgtd| iSCSI \ / |ethernet switches| iSCSI |blades|blades|blades| serv1/serv2 - connectivity isnt a problem, multipathed FC scheme, all good. Same lun accessible for both servers and than exported via tgtd to iSCSI: with different target names ("iqn.2013-03.serv1:store", "iqn.2013-03.serv2:store"), but same vendor_id, product_id, scsi_sn, scsi_id. That way client can login into both targets and see lun as multipathed device. And multipath failover scheme (via custom config with path_grouping_policy=failover for corresponding vendor_id/product_id) is on blades-clients - so they use only one target at time (no round-robin or similar stuff), but with ability to switch to another target in case one of serv1/serv2 is down. However, in my case "serv2" would not be available during oVirt setup (need to setup ovirt and virtual servers to move stuff first), so i cant enter both targets on storage domain initialization - that's why I'm asking if there's any way to edit storage domain details after initialization without destroying it (maybe directly via sql or something). Yuriy Demchenko On 04/02/2013 06:26 PM, Shu Ming wrote: I am not sure if the multipathd can recognize the FC path to the storage when the second server is available and regards it as the same as the iSCSI path used before. If it is not, I think the device under /dev/mapper may change when you cut the iSCSI path off and then enable FC path. That will definitely corrupt the meta data of the volume group which the storage domain is sitting on and the storage domain will be corrupted finally. ___ Users mailing list Users@ovirt.org <mailto:Users@ovirt.org> http://lists.ovirt.org/mailman/listinfo/users -- | RHCE | Sen Sys Engineer / Platform Architect | www.vcore.co <http://www.vcore.co> | www.vsearchcloud.com <http://www.vsearchcloud.com> | -- | RHCE | Sen Sys Engineer / Platform Architect | www.vcore.co <http://www.vcore.co> | www.vsearchcloud.com <http://www.vsearchcloud.com> | ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] way to edit iSCSI storage domain?
You mean add new path by hands on each node via iscsiadm ? And how that changes survive possible node reboots / reinstalls, as i suppose - it wouldn't? In ovirt webadmin i cannot edit added domain - connection information is greyed out (even when storage domain in maintenance mode) Yuriy Demchenko On 04/03/2013 01:00 PM, Alex Leonhardt wrote: I think i'd just add the 2nd path when the device is available ... i've recently exprimented with iscsi / tgtd and multipath on a ovirt hyper-visor and it will identify the disk as "the same" (new path to target) as long as the LUN ID is the same (this is taken from experience, not from a spec document) ... On 3 April 2013 08:14, Yuriy Demchenko <mailto:demchenko...@gmail.com>> wrote: I guess you misunderstood me I'm going to try this scheme: |STORAGE| FC / \ |SERV1/tgtd||SERV2/tgtd| iSCSI \ / |ethernet switches| iSCSI |blades|blades|blades| serv1/serv2 - connectivity isnt a problem, multipathed FC scheme, all good. Same lun accessible for both servers and than exported via tgtd to iSCSI: with different target names ("iqn.2013-03.serv1:store", "iqn.2013-03.serv2:store"), but same vendor_id, product_id, scsi_sn, scsi_id. That way client can login into both targets and see lun as multipathed device. And multipath failover scheme (via custom config with path_grouping_policy=failover for corresponding vendor_id/product_id) is on blades-clients - so they use only one target at time (no round-robin or similar stuff), but with ability to switch to another target in case one of serv1/serv2 is down. However, in my case "serv2" would not be available during oVirt setup (need to setup ovirt and virtual servers to move stuff first), so i cant enter both targets on storage domain initialization - that's why I'm asking if there's any way to edit storage domain details after initialization without destroying it (maybe directly via sql or something). Yuriy Demchenko On 04/02/2013 06:26 PM, Shu Ming wrote: I am not sure if the multipathd can recognize the FC path to the storage when the second server is available and regards it as the same as the iSCSI path used before. If it is not, I think the device under /dev/mapper may change when you cut the iSCSI path off and then enable FC path. That will definitely corrupt the meta data of the volume group which the storage domain is sitting on and the storage domain will be corrupted finally. ___ Users mailing list Users@ovirt.org <mailto:Users@ovirt.org> http://lists.ovirt.org/mailman/listinfo/users -- | RHCE | Sen Sys Engineer / Platform Architect | www.vcore.co <http://www.vcore.co> | www.vsearchcloud.com <http://www.vsearchcloud.com> | ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] way to edit iSCSI storage domain?
I guess you misunderstood me I'm going to try this scheme: |STORAGE| FC / \ |SERV1/tgtd||SERV2/tgtd| iSCSI \ / |ethernet switches| iSCSI |blades|blades|blades| serv1/serv2 - connectivity isnt a problem, multipathed FC scheme, all good. Same lun accessible for both servers and than exported via tgtd to iSCSI: with different target names ("iqn.2013-03.serv1:store", "iqn.2013-03.serv2:store"), but same vendor_id, product_id, scsi_sn, scsi_id. That way client can login into both targets and see lun as multipathed device. And multipath failover scheme (via custom config with path_grouping_policy=failover for corresponding vendor_id/product_id) is on blades-clients - so they use only one target at time (no round-robin or similar stuff), but with ability to switch to another target in case one of serv1/serv2 is down. However, in my case "serv2" would not be available during oVirt setup (need to setup ovirt and virtual servers to move stuff first), so i cant enter both targets on storage domain initialization - that's why I'm asking if there's any way to edit storage domain details after initialization without destroying it (maybe directly via sql or something). Yuriy Demchenko On 04/02/2013 06:26 PM, Shu Ming wrote: I am not sure if the multipathd can recognize the FC path to the storage when the second server is available and regards it as the same as the iSCSI path used before. If it is not, I think the device under /dev/mapper may change when you cut the iSCSI path off and then enable FC path. That will definitely corrupt the meta data of the volume group which the storage domain is sitting on and the storage domain will be corrupted finally. ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
[Users] way to edit iSCSI storage domain?
Hi, is there any way to edit iSCSI storage domain: change/add targets ? (lun, id, sn and actual content wouldn't be affected) Without VM export/import process - downtime should be as little as possible. I'm researching possibility to use oVirt in env where I have only FC storage, connected to 2 servers (and sadly, second server will be available later, hence my question about changing targets in oVirt) and bunch of blades, on which i would like to run virtualization (no fc storage connectivity) - so idea is to share storage via iSCSI/tgtd and use multipathd with failover mode to protect from storage corruption. -- Yuriy Demchenko ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] oVirt 3.2.1 Beta EL6 Content is available
Ah, I see. Thanks for clarification i've opened a bug https://bugzilla.redhat.com/show_bug.cgi?id=927874 for this Yuriy Demchenko On 03/26/2013 12:56 PM, Juan Hernandez wrote: Thanks, that solved my problem. However, shouldn't that be "out-of-box" for el6 packages? (just as a feedback) Unfortunately it is not as simple as it seems. The problem is that if you make that the default for el6 packages it will fail if you try to add a oVirt node as hypervisor, as that is based on Fedora where the rhel6.4.0 machine type is not supported by libvirt. So we need to set this according to the type of hypervisor, not to the distro where the engine is running. You may want to open a bug to request this. ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] oVirt 3.2.1 Beta EL6 Content is available
Thanks, that solved my problem. However, shouldn't that be "out-of-box" for el6 packages? (just as a feedback) Yuriy Demchenko On 03/26/2013 12:34 PM, Juan Hernandez wrote: On 03/26/2013 08:57 AM, Yuriy Demchenko wrote: Just installed ovirt 3.2 from this new repo (http://resources.ovirt.org/releases/3.2/rpm/EL/6/) on centos-6.4 New VM's cannot be started, fails with internal error process exited while connecting to monitor: Supported machines are: pc RHEL 6.4.0 PC (alias of rhel6.4.0) rhel6.4.0 RHEL 6.4.0 PC (default) rhel6.3.0 RHEL 6.3.0 PC rhel6.2.0 RHEL 6.2.0 PC rhel6.1.0 RHEL 6.1.0 PC rhel6.0.0 RHEL 6.0.0 PC rhel5.5.0 RHEL 5.5.0 PC rhel5.4.4 RHEL 5.4.4 PC rhel5.4.0 RHEL 5.4.0 PC From vdsm.log i presume problem in this string "hvm", machine=pc-0.14 doesnt supported by libvirt in el6. Logs attached. Looks like same bug, that is discussed here: http://lists.ovirt.org/pipermail/users/2012-January/005930.html Searched in bugzilla and didn't found any opened bugs - should I open one or I mislooked? Try to do this: # engine-config -s EmulatedMachine=rhel6.4.0 --cver=3.2 Then restart the engine: # service ovirt-engine restart Yuriy Demchenko On 03/21/2013 05:27 PM, Mike Burns wrote: The oVirt Team is happy to announce that the EL6 rpms for the 3.2 release are now available in the beta repositories. To access this content, * download and install the release rpm [1] on your EL6 host. * Edit the ovirt.repo file in /etc/yum.repos.d to enable the ovirt-beta repository. * Follow the rest of the instructions on http://www.ovirt.org/Download. This content is in beta, so feedback on issues found is greatly appreciated. Issues can be filed in Bugzilla[2] in the Community/Ovirt project. You can also get help on IRC in #ovirt on OFTC.net or through the users@ovirt.org mailing list. Thanks The oVirt Team [1] http://resources.ovirt.org/releases/ovirt-release-el6-5-3.noarch.rpm [2] https://bugzilla.redhat.com/ ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] oVirt 3.2.1 Beta EL6 Content is available
Just installed ovirt 3.2 from this new repo (http://resources.ovirt.org/releases/3.2/rpm/EL/6/) on centos-6.4 New VM's cannot be started, fails with internal error process exited while connecting to monitor: Supported machines are: pc RHEL 6.4.0 PC (alias of rhel6.4.0) rhel6.4.0 RHEL 6.4.0 PC (default) rhel6.3.0 RHEL 6.3.0 PC rhel6.2.0 RHEL 6.2.0 PC rhel6.1.0 RHEL 6.1.0 PC rhel6.0.0 RHEL 6.0.0 PC rhel5.5.0 RHEL 5.5.0 PC rhel5.4.4 RHEL 5.4.4 PC rhel5.4.0 RHEL 5.4.0 PC From vdsm.log i presume problem in this string "machine="pc-0.14">hvm", machine=pc-0.14 doesnt supported by libvirt in el6. Logs attached. Looks like same bug, that is discussed here: http://lists.ovirt.org/pipermail/users/2012-January/005930.html Searched in bugzilla and didn't found any opened bugs - should I open one or I mislooked? Yuriy Demchenko On 03/21/2013 05:27 PM, Mike Burns wrote: The oVirt Team is happy to announce that the EL6 rpms for the 3.2 release are now available in the beta repositories. To access this content, * download and install the release rpm [1] on your EL6 host. * Edit the ovirt.repo file in /etc/yum.repos.d to enable the ovirt-beta repository. * Follow the rest of the instructions on http://www.ovirt.org/Download. This content is in beta, so feedback on issues found is greatly appreciated. Issues can be filed in Bugzilla[2] in the Community/Ovirt project. You can also get help on IRC in #ovirt on OFTC.net or through the users@ovirt.org mailing list. Thanks The oVirt Team [1] http://resources.ovirt.org/releases/ovirt-release-el6-5-3.noarch.rpm [2] https://bugzilla.redhat.com/ ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users 2013-03-26 11:23:49,157 INFO [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (DefaultQuartzScheduler_Worker-8) No string for UNASSIGNED type. Use default Log 2013-03-26 11:23:50,034 INFO [org.ovirt.engine.core.bll.RunVmCommand] (ajp--127.0.0.1-8702-13) Lock Acquired to object EngineLock [exclusiveLocks= key: 0826eaf9-6116-4e96-856a-fec350c4163d value: VM , sharedLocks= ] 2013-03-26 11:23:50,200 INFO [org.ovirt.engine.core.vdsbroker.IsVmDuringInitiatingVDSCommand] (ajp--127.0.0.1-8702-13) START, IsVmDuringInitiatingVDSCommand( vmId = 0826eaf9-6116-4e96-856a-fec350c4163d), log id: 2f82917c 2013-03-26 11:23:50,209 INFO [org.ovirt.engine.core.vdsbroker.IsVmDuringInitiatingVDSCommand] (ajp--127.0.0.1-8702-13) FINISH, IsVmDuringInitiatingVDSCommand, return: false, log id: 2f82917c 2013-03-26 11:23:50,899 INFO [org.ovirt.engine.core.bll.RunVmCommand] (pool-3-thread-50) [6f46743f] Running command: RunVmCommand internal: false. Entities affected : ID: 0826eaf9-6116-4e96-856a-fec350c4163d Type: VM 2013-03-26 11:23:52,384 INFO [org.ovirt.engine.core.vdsbroker.irsbroker.IsoPrefixVDSCommand] (pool-3-thread-50) [6f46743f] START, IsoPrefixVDSCommand( storagePoolId = 5849b030-626e-47cb-ad90-3ce782d831b3, ignoreFailoverLimit = false, compatabilityVersion = null), log id: 6ea41f3b 2013-03-26 11:23:52,440 INFO [org.ovirt.engine.core.vdsbroker.irsbroker.IsoPrefixVDSCommand] (pool-3-thread-50) [6f46743f] FINISH, IsoPrefixVDSCommand, return: /rhev/data-center/5849b030-626e-47cb-ad90-3ce782d831b3/df8c4ca7-410f-4ec4-b383-ef995b002d13/images/----, log id: 6ea41f3b 2013-03-26 11:23:52,668 INFO [org.ovirt.engine.core.vdsbroker.CreateVmVDSCommand] (pool-3-thread-50) [6f46743f] START, CreateVmVDSCommand(HostName = node-2, HostId = 4c9b69c0-068e-4a3d-bf25-ce81946a453a, vmId=0826eaf9-6116-4e96-856a-fec350c4163d, vm=VM [w2k8]), log id: 7fe5a151 2013-03-26 11:23:52,847 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.CreateVDSCommand] (pool-3-thread-50) [6f46743f] START, CreateVDSCommand(HostName = node-2, HostId = 4c9b69c0-068e-4a3d-bf25-ce81946a453a, vmId=0826eaf9-6116-4e96-856a-fec350c4163d, vm=VM [w2k8]), log id: 395d208 2013-03-26 11:23:53,398 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.CreateVDSCommand] (pool-3-thread-50) [6f46743f] org.ovirt.engine.core.vdsbroker.vdsbroker.CreateVDSCommand spiceSslCipherSuite=DEFAULT,memSize=2048,kvmEnable=true,smp=2,vmType=kvm,emulatedMachine=pc-0.14,keyboardLayout=en-us,nice=0,display=qxl,smartcardEnable=false,smpCoresPerSocket=2,spiceSecureChannels=smain,sinputs,scursor,splayback,srecord,sdisplay,susbredir,ssmartcard,timeOffset=14400,transparentHugePages=true,vmId=0826eaf9-6116-4e96-856a-fec350c4163d,devices=[Ljava.util.Map;@7aa354c9,acpiEnable=true,vmName=w2k8,cpuType=Nehalem,custom={} 2013-03-26 11:23:53,436 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.CreateVDSCommand] (pool-3-thread-50) [6f46743f] FINISH, CreateVDSCommand, log id: 395d208 2013-03-26 11:23:53,520 INFO [org.ovirt.engine.core.vdsbroker.CreateVmVDSCommand] (pool-3-thread-50) [6f46743f] IncreasePendingVms::CreateVmIncreasing vds node-2 pending vcpu count, now 2. Vm:
Re: [Users] power management for drac6
vdsm.log Thread-1978569::DEBUG::2013-03-21 11:30:38,769::API::1024::vds::(fenceNode) fenceNode(addr=vm2-mgt,port=,agent=drac5,user=root,passwd=,action=status,secure=,options=) btw, looking at this log string i presume you hit the bug from this tread (if you using ovirt 3.1): http://comments.gmane.org/gmane.comp.emulators.ovirt.user/3528 engine didnt pass options string when talking to vdsm you can try and patch it: diff --git a/vdsm/BindingXMLRPC.py b/vdsm/BindingXMLRPC.py index cc5300f..8b548e4 100644 --- a/vdsm/BindingXMLRPC.py +++ b/vdsm/BindingXMLRPC.py @@ -357,7 +357,7 @@ class BindingXMLRPC(object): secure=False, options=''): api = API.Global() return api.fenceNode(addr, port, agent, username, password, -action, secure) +action, secure, options) def setLogLevel(self, level): api = API.Global() Yuriy Demchenko ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] power management for drac6
Hi, I've been using drac5 fence with drac7 equipment for some time now - works well you need to pass option "cmd_prompt=admin1->" (without quotes), so you missing "->" sign secure or not - depends on enabled/disabled ssh in your drac. Hope it helps. Yuriy Demchenko On 03/22/2013 05:07 AM, Thomas Scofield wrote: Has anyone been able to get a drac6 management interface working by using the drac5 fencing agent? From testing on the command line I was able to determine that we need to add the option cmd_prompt=admin1. But for some reason I can't get any options to work. From the logs below you can see that the ovirt-engine is passing the options along, but on the vdsm log those options are no longer there. ovirt_engine/engine.log 2013-03-21 11:30:33,408 INFO [org.ovirt.engine.core.bll.FenceExecutor] (ajp--127.0.0.1-8702-9) Executing Power Management command, Proxy Host:vm1, Agent:drac5, Target Host:, Management IP:vm2, User:root, Options:cmd_prompt=admin1,secure=true,slot=0 2013-03-21 11:30:33,411 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.FenceVdsVDSCommand] (ajp--127.0.0.1-8702-9) START, FenceVdsVDSCommand(HostName = vm1, HostId = 63336472-4bbd-11e2-b2fc-5bc78da11d0c, targetVdsId = f80052ad-993e-4778-9924-d4e603425d32, action = Status, ip = vm2-mgt, port = , type = drac5, user = root, password = **, options = 'cmd_prompt=admin1,secure=true,slot=0'), log id: 3773df59 2013-03-21 11:30:44,123 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.FenceVdsVDSCommand] (ajp--127.0.0.1-8702-9) FINISH, FenceVdsVDSCommand, return: Test Failed, Host Status is: unknown. The fence-agent script reported the following error: Unable to connect/login to fencing device , log id: 3773df59 vdsm.log Thread-1978569::DEBUG::2013-03-21 11:30:38,769::API::1024::vds::(fenceNode) fenceNode(addr=vm2-mgt,port=,agent=drac5,user=root,passwd=,action=status,secure=,options=) Thread-1978569::DEBUG::2013-03-21 11:30:44,113::API::1050::vds::(fenceNode) rc 1 in agent=fence_drac5 ipaddr=vm2-mgt login=root option=status passwd= out err Unable to connect/login to fencing device ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] Host does not comply with the cluster networks, the following networks are missing on host
Hi, after adding a host, did you done "setup networks" step? it's in "networks" tab in host information you should setup networks and add your logical networks vlan12,vlan14 to appropriate host NIC's Yuriy Demchenko On 03/19/2013 10:12 AM, w...@ieiot.cn wrote: hi, first of all thanks, yours read my mail and help me. i have a problem. while i actived host node, engine warned me: "Host does not comply with the cluster networks, the following networks are missing on host: 'vlan12,vlan14'." but cluster ->logical network, vlan12 & vlan 14 is green. where is my error? my vlans from switch.and engine.log is: 2013-03-19 14:05:22,823 ERROR [org.ovirt.engine.core.bll.SetNonOperationalVdsCommand] (pool-3-thread-48) [2333e737] Host is set to Non-Operational, it is missing the following networks: vlan12,vlan14. please help me. by soversion. ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] ovirt 3.2 networking setup problems
CONTROLLED': 'no', 'MASTER': 'bond0', 'DEVICE': 'em2', 'STP': 'no', 'ONBOOT': 'yes'}, 'mtu': '1500', 'netmask': '', 'hwaddr': '90:b1:1c:ac:1b:04', 'speed': 1}, 'p3p4': {'addr': '', 'cfg': {'DEVICE': 'p3p4', 'HWADDR': '90:B1:1C:AC:1B:08', 'ONBOOT': 'no', 'NM_CONTROLLED': 'yes'}, 'mtu': '1500', 'netmask': '', 'hwaddr': '90:b1:1c:ac:1b:08', 'speed': 0}, 'p3p3': {'permhwaddr': '90:B1:1C:AC:1B:07', 'addr': '', 'cfg': {'SLAVE': 'yes', 'ONBOOT': 'yes', 'NM_CONTROLLED': 'no', 'MASTER': 'bond4', 'HWADDR': '90:b1:1c:ac:1b:07', 'DEVICE': 'p3p3'}, 'mtu': '1500', 'netmask': '', 'hwaddr': '90:b1:1c:ac:1b:07', 'speed': 1000}, 'p3p2': {'addr': '', 'cfg': {'DEVICE': 'p3p2', 'HWADDR': '90:B1:1C:AC:1B:06', 'ONBOOT': 'no', 'NM_CONTROLLED': 'yes'}, 'mtu': '1500', 'netmask': '', 'hwaddr': '90:b1:1c:ac:1b:06', 'speed': 0}, 'p3p1': {'permhwaddr': '90:B1:1C:AC:1B:05', 'addr': '', 'cfg': {'SLAVE': 'yes', 'ONBOOT': 'yes', 'NM_CONTROLLED': 'no', 'MASTER': 'bond4', 'HWADDR': '90:b1:1c:ac:1b:05', 'DEVICE': 'p3p1'}, 'mtu': '1500', 'netmask': '', 'hwaddr': '90:b1:1c:ac:1b:05', 'speed': 1000}} operatingSystem = {'release': '3.el6.centos.9', 'version': '6', 'name': 'RHEL'} packages2 = {'kernel': {'release': '279.22.1.el6.x86_64', 'buildtime': 1360105846.0, 'version': '2.6.32'}, 'spice-server': {'release': '10.el6', 'buildtime': 1340375889, 'version': '0.10.1'}, 'vdsm': {'release': '0.31.20.el6', 'buildtime': 1361952052, 'version': '4.10.3'}, 'qemu-kvm': {'release': '2.295.el6.10', 'buildtime': 1357897281, 'version': '0.12.1.2'}, 'libvirt': {'release': '21.el6_3.8', 'buildtime': 1359401311, 'version': '0.9.10'}, 'qemu-img': {'release': '2.295.el6.10', 'buildtime': 1357897281, 'version': '0.12.1.2'}, 'mom': {'release': '1.el6', 'buildtime': 1349470062, 'version': '0.3.0'}} reservedMem = 321 software_revision = 0.31 software_version = 4.10 supportedENGINEs = ['3.0', '3.1'] supportedProtocols = ['2.2', '2.3'] uuid = 4C4C4544-004E-5910-804D-C6C04F47354A version_name = Snow Man vlans = {} vmTypes = ['kvm'] before echo: # cat /sys/class/net/bonding_masters bond0 bond4 after: # cat /sys/class/net/bonding_masters bond0 bond4 bond1 Yuriy Demchenko On 03/06/2013 04:28 PM, Antoni Segura Puimedon wrote: Could you ssh to the host and do: cat /sys/class/net/bonding_masters There is a limitation on ovirt-3.2 that only the default bridges can be used. If you don't have enough of them, you can always create the needed bridge doing echo "+bondN" > /sys/class/net/bonding_masters where N is the bridge you want to create. It would also be helpful to see the contents of doing vdsClient -s 0 getVdsCaps ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] ovirt 3.2 networking setup problems
Hi, I'm still cant solve this issue - "There are no available Bonds" I've done clean install of ovirt-3.2 (from dreyou repo), added freshly-kickstarted host. In my kickstart I set up 2 interfaces em1/em2 as slaves to bond0 (mode=6), which is slave to bridge "ovirtmgmt" (just so i dont need to click more buttons in webadmin setting up bond for mgmt network). After that, on page "setup host networks" I can see correct scheme for ovirtmgmt network and 4 other unused NICs. I want to set up another 2 bonds from that unused NICs, but here's the problem strikes: when trying to bond 2 NICs together i have to chose bond name - and there's only 1 choise - "bond4", I pick that and trying to bond the rest 2 NICs - I get error message "There are no available Bonds". And nothing in engine.log or vdsm.log Why is that and how to solve it? Yuriy Demchenko On 02/26/2013 12:37 PM, Yuriy Demchenko wrote: 2. when i'm trying to team NIC's in bonds, i've got choice from only 3 bonds: "bond0", "bond1", "bond4" - why is that list is fixed? If I've allready defined bond0+bond1 prior adding host to ovirt (via kickstart), even if they aren't used - choice list in ovirt host networks is short to "bond4" and trying to bond more interfaces ends up with "there no bonds available" ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] ovirt 3.2 networking setup problems
Tried to revert that patch - and it helped. So it seems the problem is because that patch... Thanks for help! one less mystery to solve :) Yuriy Demchenko On 02/26/2013 04:16 PM, Dan Kenigsberg wrote: On Tue, Feb 26, 2013 at 03:25:13PM +0400, Yuriy Demchenko wrote: i've made some additional research on problem #1: it seems that /usr/share/vdsm/configNetwork.py treats "gateway=null" value as true, so setting the error flag in "_addNetworkValidation" method there's a check at line 813: if netmask or gateway: raise ConfigNetworkError(ne.ERR_BAD_ADDR, "Specified netmask or gateway but not ip") as "ipaddr", sent by engine = "", its check evaluated as false, then "gateway" evaluated, but it contains value (word "null"), so i presume evaluated as true and exception is raised. I believe that this issue is due to code introduced by dreyou. http://www.dreyou.org/ovirt/vdsm32.patch diff --git a/vdsm/configNetwork.py b/vdsm/configNetwork.py index 53debfa..c8a80f1 100755 --- a/vdsm/configNetwork.py +++ b/vdsm/configNetwork.py @@ -873,6 +873,11 @@ def addNetwork(network, vlan=None, bonding=None, nics=None, ipaddr=None, _netinfo = netinfo.NetInfo() bridged = utils.tobool(bridged) +# Hack here, netmask may be not defined, if this happen, +# set netmask to 255.255.255.0 +if not netmask: +netmask = "255.255.255.0" + if mtu: mtu = int(mtu) I do not know the motivation for it - I hope that Dreyou himself can help. Dreyou, others - I would very much appreciate it if you send such patches to upstream gerrit.ovirt.org. They deserve discussion and hopefully - incorporation into mainline vdsm. I've added my notes to https://bugzilla.redhat.com/show_bug.cgi?id=911957 Yuriy Demchenko On 02/26/2013 12:37 PM, Yuriy Demchenko wrote: Hi, I'm testing ovirt 3.2 on centos 6.3 (from dreyou repo) and have a problems setting up networks: 1. - clean install of ovirt-engine, added a host (6 NICs, only 1 configured with static ip before adding it to ovirt) - set up bonding for ovirtmgmt from 2 NICs - em1+em2 - set up 2 additional networks ("trunk" with no vlanid, "vlan50" with vlanid=50), attached it to cluster - trying to set up host networks - assigned "trunk" to one of unused NIC, "vlan50" to another, boot protocol in both - "none" and when i'm trying to apply changes - i get "Specified netmask or gateway but not ip" What is wrong? why the hell it thinks i pass some netmask/gw in first place? I dont need any ip configuration on this networks, i'll be using them only for VM's 2. when i'm trying to team NIC's in bonds, i've got choice from only 3 bonds: "bond0", "bond1", "bond4" - why is that list is fixed? If I've allready defined bond0+bond1 prior adding host to ovirt (via kickstart), even if they aren't used - choice list in ovirt host networks is short to "bond4" and trying to bond more interfaces ends up with "there no bonds available" Genady has reported this issue to me yesterday :-(. It is fixed by http://gerrit.ovirt.org/12447 and deseves a rebuild of vdsm. Please confirm that my patch solves this issue. ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] ovirt 3.2 networking setup problems
i've made some additional research on problem #1: it seems that /usr/share/vdsm/configNetwork.py treats "gateway=null" value as true, so setting the error flag in "_addNetworkValidation" method there's a check at line 813: if netmask or gateway: raise ConfigNetworkError(ne.ERR_BAD_ADDR, "Specified netmask or gateway but not ip") as "ipaddr", sent by engine = "", its check evaluated as false, then "gateway" evaluated, but it contains value (word "null"), so i presume evaluated as true and exception is raised. I've added my notes to https://bugzilla.redhat.com/show_bug.cgi?id=911957 Yuriy Demchenko On 02/26/2013 12:37 PM, Yuriy Demchenko wrote: Hi, I'm testing ovirt 3.2 on centos 6.3 (from dreyou repo) and have a problems setting up networks: 1. - clean install of ovirt-engine, added a host (6 NICs, only 1 configured with static ip before adding it to ovirt) - set up bonding for ovirtmgmt from 2 NICs - em1+em2 - set up 2 additional networks ("trunk" with no vlanid, "vlan50" with vlanid=50), attached it to cluster - trying to set up host networks - assigned "trunk" to one of unused NIC, "vlan50" to another, boot protocol in both - "none" and when i'm trying to apply changes - i get "Specified netmask or gateway but not ip" What is wrong? why the hell it thinks i pass some netmask/gw in first place? I dont need any ip configuration on this networks, i'll be using them only for VM's 2. when i'm trying to team NIC's in bonds, i've got choice from only 3 bonds: "bond0", "bond1", "bond4" - why is that list is fixed? If I've allready defined bond0+bond1 prior adding host to ovirt (via kickstart), even if they aren't used - choice list in ovirt host networks is short to "bond4" and trying to bond more interfaces ends up with "there no bonds available" Please help me figure this issues out ovirt-engine.log: 2013-02-26 11:42:45,772 INFO [org.ovirt.engine.core.bll.network.host.SetupNetworksCommand] (ajp--127.0.0.1-8702-2) [800e96b] Running command: SetupNetworksCommand internal: false. Entities affected : ID: d1181ea2-936b-498d-be3d-9ce16aacfb19 Type: VDS 2013-02-26 11:42:45,781 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.SetupNetworksVDSCommand] (ajp--127.0.0.1-8702-2) [800e96b] START, SetupNetworksVDSCommand(HostName = its-host2, HostId = d1181ea2-936b-498d-be3d-9ce16aacfb19, force=false, checkConnectivity=true, conectivityTimeout=120, networks=[trunk {id=6a48a2c6-a028-4825-8fcf-cbc869dc5c76, description=null, subnet=null, gateway=null, type=null, vlanId=null, stp=false, dataCenterId=5849b030-626e-47cb-ad90-3ce782d831b3, mtu=0, vmNetwork=true, cluster=NetworkCluster {id={clusterId=null, networkId=null}, status=NON_OPERATIONAL, display=false, required=true}}, vlan50 {id=4f1564e4-0ba2-454c-a511-9566a1bf50a9, description=vm vlan50, subnet=null, gateway=null, type=null, vlanId=50, stp=false, dataCenterId=5849b030-626e-47cb-ad90-3ce782d831b3, mtu=0, vmNetwork=true, cluster=NetworkCluster {id={clusterId=null, networkId=null}, status=NON_OPERATIONAL, display=false, required=true}}], bonds=[], interfaces=[bond0 {id=61d4d61f-45a2-4ac4-84e8-f60077831603, vdsId=d1181ea2-936b-498d-be3d-9ce16aacfb19, name=bond0, macAddress=90:b1:1c:ac:1b:0e, networkName=ovirtmgmt, bondOptions=mode=6, bootProtocol=STATIC_IP, address=192.168.10.132, subnet=255.255.255.0, gateway=192.168.10.1, mtu=1500, bridged=true, type=2, networkImplementationDetails={inSync=true, managed=true}}, p3p4 {id=9f584135-7c20-459b-8582-4c4c2568d6ea, vdsId=d1181ea2-936b-498d-be3d-9ce16aacfb19, name=p3p4, macAddress=90:b1:1c:ac:1b:15, networkName=null, bondName=null, bootProtocol=NONE, address=, subnet=, gateway=null, mtu=1500, bridged=false, speed=0, type=0, networkImplementationDetails=null}, em2 {id=40d2f7e3-8b34-4211-9009-101653947ef3, vdsId=d1181ea2-936b-498d-be3d-9ce16aacfb19, name=em2, macAddress=90:B1:1C:AC:1B:11, networkName=null, bondName=bond0, bootProtocol=NONE, address=, subnet=, gateway=null, mtu=1500, bridged=false, speed=1, type=0, networkImplementationDetails=null}, p3p3 {id=c7e50eca-c14e-4e9d-8edc-f4ab0fbfa4e1, vdsId=d1181ea2-936b-498d-be3d-9ce16aacfb19, name=p3p3, macAddress=90:b1:1c:ac:1b:14, networkName=null, bondName=null, bootProtocol=NONE, address=, subnet=, gateway=null, mtu=1500, bridged=false, speed=0, type=0, networkImplementationDetails=null}, em1 {id=8481bb51-9020-463f-b6d6-af0770d3f3b0, vdsId=d1181ea2-936b-498d-be3d-9ce16aacfb19, name=em1, macAddress=90:B1:1C:AC:1B:0E, networkName=null, bondName=bond0, bootProtocol=NONE, address=, subnet=, gateway=null, mtu=1500, bridged=false, speed=1, type=0, networkImplementationDetails=nul
[Users] ovirt 3.2 networking setup problems
rotocol=NONE, address=null, subnet=null, gateway=null, mtu=0, bridged=true, speed=null, type=0, networkImplementationDetails=null}], removedNetworks=[], removedBonds=[]), log id: 34ba6cbd 2013-02-26 11:42:45,858 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.SetupNetworksVDSCommand] (ajp--127.0.0.1-8702-2) [800e96b] FINISH, SetupNetworksVDSCommand, log id: 34ba6cbd 2013-02-26 11:42:46,375 ERROR [org.ovirt.engine.core.vdsbroker.vdsbroker.BrokerCommandBase] (ajp--127.0.0.1-8702-2) [800e96b] org.ovirt.engine.core.vdsbroker.vdsbroker.VDSNetworkException: VDSGenericException: VDSNetworkException: Specified netmask or gateway but not ip 2013-02-26 11:42:46,384 ERROR [org.ovirt.engine.core.vdsbroker.VDSCommandBase] (ajp--127.0.0.1-8702-2) [800e96b] Command SetupNetworksVDS execution failed. Exception: VDSNetworkException: VDSGenericException: VDSNetworkException: Specified netmask or gateway but not ip 2013-02-26 11:42:46,391 ERROR [org.ovirt.engine.core.bll.network.host.SetupNetworksCommand] (ajp--127.0.0.1-8702-2) [800e96b] Command org.ovirt.engine.core.bll.network.host.SetupNetworksCommand throw Vdc Bll exception. With error message VdcBLLException: org.ovirt.engine.core.vdsbroker.vdsbroker.VDSNetworkException: VDSGenericException: VDSNetworkException: Specified netmask or gateway but not ip vdsm.log on host: Thread-403::DEBUG::2013-02-26 11:42:45,863::BindingXMLRPC::913::vds::(wrapper) client [192.168.10.147]::call setupNetworks with ({'vlan50': {'nic': 'p3p2', 'vlan': '50', 'STP': 'no', 'bridged': 'true'}, 'trunk': {'nic': 'p3p1', 'STP': 'no', 'bridged': 'true'}}, {}, {'connectivityCheck': 'true', 'connectivityTimeout': 120}) {} Thread-404::DEBUG::2013-02-26 11:42:45,872::BindingXMLRPC::913::vds::(wrapper) client [192.168.10.147]::call ping with () {} Thread-404::DEBUG::2013-02-26 11:42:45,872::BindingXMLRPC::920::vds::(wrapper) return ping with {'status': {'message': 'Done', 'code': 0}} MainProcess|Thread-403::DEBUG::2013-02-26 11:42:45,923::configNetwork::1310::setupNetworks::(setupNetworks) Setting up network according to configuration: networks:{'vlan50': {'nic': 'p3p2', 'vlan': '50', 'STP': 'no', 'bridged': 'true'}, 'trunk': {'nic': 'p3p1', 'STP': 'no', 'bridged': 'true'}}, bondings:{}, options:{'connectivityCheck': 'true', 'connectivityTimeout': 120} MainProcess|Thread-403::DEBUG::2013-02-26 11:42:45,923::configNetwork::1314::root::(setupNetworks) Validating configuration MainProcess|Thread-403::DEBUG::2013-02-26 11:42:45,972::configNetwork::1317::setupNetworks::(setupNetworks) Applying... MainProcess|Thread-403::DEBUG::2013-02-26 11:42:46,130::configNetwork::1362::setupNetworks::(setupNetworks) Adding network 'vlan50' MainProcess|Thread-403::DEBUG::2013-02-26 11:42:46,183::configNetwork::887::root::(addNetwork) validating network... MainProcess|Thread-403::ERROR::2013-02-26 11:42:46,183::configNetwork::1382::setupNetworks::(setupNetworks) (22, 'Specified netmask or gateway but not ip') Traceback (most recent call last): File "/usr/share/vdsm/configNetwork.py", line 1364, in setupNetworks implicitBonding=True, **d) File "/usr/share/vdsm/configNetwork.py", line 891, in addNetwork bridged=bridged, **options) File "/usr/share/vdsm/configNetwork.py", line 815, in _addNetworkValidation "Specified netmask or gateway but not ip") ConfigNetworkError: (22, 'Specified netmask or gateway but not ip') MainProcess|Thread-403::ERROR::2013-02-26 11:42:46,191::supervdsmServer::80::SuperVdsm.ServerCallback::(wrapper) Error in setupNetworks Traceback (most recent call last): File "/usr/share/vdsm/supervdsmServer.py", line 78, in wrapper return func(*args, **kwargs) File "/usr/share/vdsm/supervdsmServer.py", line 177, in setupNetworks return configNetwork.setupNetworks(networks, bondings, **options) File "/usr/share/vdsm/configNetwork.py", line 1364, in setupNetworks implicitBonding=True, **d) File "/usr/share/vdsm/configNetwork.py", line 891, in addNetwork bridged=bridged, **options) File "/usr/share/vdsm/configNetwork.py", line 815, in _addNetworkValidation "Specified netmask or gateway but not ip") ConfigNetworkError: (22, 'Specified netmask or gateway but not ip') Thread-403::ERROR::2013-02-26 11:42:46,192::API::1217::vds::(setupNetworks) Specified netmask or gateway but not ip Traceback (most recent call last):
Re: [Users] ovirt node - cant start spice
Thanks! that did the trick with full console i've managed to set "spice_tls=1" in qemu.conf however, now i have another problem - remote/spice viewer closes shortly after "setting up spice session" i've done everything as described in http://wiki.ovirt.org/wiki/How_to_Connect_to_SPICE_Console_With_Portal tried on 2 different pc, win7 x64 / ie9 virt-viewer-0.5.3 nothing in spicex.log: only "spicex_init_logger: started" and "spicex_log_cleanup: done" Any tips? Yuriy Demchenko On 10/15/2012 12:39 PM, Alexandre Santos wrote: 2012/10/15 Yuriy Demchenko <mailto:demchenko...@gmail.com>> Hi, I've recently installed ovirt-node-2.5.2 and now have trouble starting VM with spice as console: unsupported configuration: spice secure channels set in XML configuration, but TLS is disabled in qemu.conf How can i fix this? Also, what I didn't get is how to log in to node via ssh to get normal console, not to "admin interface", is it even possible? Have you tried pressing F2? On the console it works, haven't tried through ssh... Alex -- Yuriy Demchenko ___ Users mailing list Users@ovirt.org <mailto:Users@ovirt.org> http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
[Users] ovirt node - cant start spice
Hi, I've recently installed ovirt-node-2.5.2 and now have trouble starting VM with spice as console: unsupported configuration: spice secure channels set in XML configuration, but TLS is disabled in qemu.conf How can i fix this? Also, what I didn't get is how to log in to node via ssh to get normal console, not to "admin interface", is it even possible? -- Yuriy Demchenko ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] ovirt 3.1 guest pxe boot fails
Hi, I've checked this issue with ipxe-roms-qemu from default fedora repo (ipxe-roms-qemu-20120328-1.gitaac9718.fc17) - same behaviour: first time system pxe boots fine after domain start, but after guest-reboot - pxe boot fails, cant get ip from dhcp and malformed packets on bridge interface. And same situation with fedora host not connected to ovirt (vm created with virt-manager) Yuriy Demchenko On 10/05/2012 11:55 AM, Yuriy Demchenko wrote: Hi, I cant use ipxe-roms-qemu-20120328-1.gitaac9718.fc17 from default repo (cant boot from SAN with it - http://ipxe.org/err/1d704439), so i've took srpm and rebuild it with lastest git sources so now i'm using ipxe-roms-qemu-20120926-1.git09cc63f.fc17 still cant find logical connection between ipxe version and my problem, as first boot never fails, only reboots initiated from guest system does; same goes for real hardware - i chainload same ipxe rom into real machine and it boots/reboots fine. dunno about qemu/kvm not connected to ovirt - i'll check it out on a weekend. Yuriy Demchenko On 10/05/2012 01:16 AM, Rami Vaknin wrote: On 10/04/2012 12:49 PM, Yuriy Demchenko wrote: Hi, I've started testing ovirt 3.1 in my lab environment and found some strange behaviour/bug: when i create vm and set it to boot from network - first time it boots fine 14:23:43.718613 IP 0.0.0.0.68 > 255.255.255.255.67: BOOTP/DHCP, Request from 52:54:00:c7:76:50, length 395 Although you managed to boot from network in the first time, I'm not sure that DHCP packet with length of 395 is ok - I think it should be ~60, I would start with checking the gpxe-roms-qemu package version. pxe works, ip address received and machine boots, but if I'll restart it via guest command 'reboot' (i.e not through stop/start from ovirt admin interface) - pxe boot fails, ipxe just states timeout on dhcp requests and with tcpdump i even cannot see any dhcp requests at all! instead of that i get some weird malformed packets: 14:21:38.529244 00:00:00:00:ff:ff > 00:00:00:00:00:00, ethertype Unknown (0x), length 447: 0x: 5254 00c7 7650 0800 4500 01a7 0322 ..RT..vP..E" 0x0010: 4011 7625 0044 ..@.v%.D 0x0020: 0043 0193 02f7 0101 0600 2c78 5053 000c .C,xPS.. 0x0030: 0x0040: 5254 00c7 7650 ..RT..vP 0x0050: 0x0060: 0x0070: 0x0080: 0x0090: 0x00a0: 0x00b0: 0x00c0: 0x00d0: 0x00e0: 0x00f0: 0x0100: 0x0110: 6382 5363 3501 0139 0205 c05d 0200 ..c.Sc5..9...].. 0x0120: 005e 0301 0201 3c20 5058 4543 6c69 656e .^<.PXEClien 0x0130: 743a 4172 6368 3a30 3030 3030 3a55 4e44 t:Arch:0:UND 0x0140: 493a 3030 3230 3031 4d04 6950 5845 3715 I:002001M.iPXE7. 0x0150: 0103 0607 0c0f 112b 3c42 4380 8182 8384 ...+ As i'm planning to heavily rely on network booting this issue is critical for me, any tips how to solve it? ovirt 3.1, engine and node on separate hosts, installed on clean fedora 17/minimal, node with kernel-3.3.4-5.fc17.x86_64 node has 1 nic, 1 network defined in ovirt - ovirtmgmt ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] ovirt 3.1 guest pxe boot fails
Hi, I cant use ipxe-roms-qemu-20120328-1.gitaac9718.fc17 from default repo (cant boot from SAN with it - http://ipxe.org/err/1d704439), so i've took srpm and rebuild it with lastest git sources so now i'm using ipxe-roms-qemu-20120926-1.git09cc63f.fc17 still cant find logical connection between ipxe version and my problem, as first boot never fails, only reboots initiated from guest system does; same goes for real hardware - i chainload same ipxe rom into real machine and it boots/reboots fine. dunno about qemu/kvm not connected to ovirt - i'll check it out on a weekend. Yuriy Demchenko On 10/05/2012 01:16 AM, Rami Vaknin wrote: On 10/04/2012 12:49 PM, Yuriy Demchenko wrote: Hi, I've started testing ovirt 3.1 in my lab environment and found some strange behaviour/bug: when i create vm and set it to boot from network - first time it boots fine 14:23:43.718613 IP 0.0.0.0.68 > 255.255.255.255.67: BOOTP/DHCP, Request from 52:54:00:c7:76:50, length 395 Although you managed to boot from network in the first time, I'm not sure that DHCP packet with length of 395 is ok - I think it should be ~60, I would start with checking the gpxe-roms-qemu package version. pxe works, ip address received and machine boots, but if I'll restart it via guest command 'reboot' (i.e not through stop/start from ovirt admin interface) - pxe boot fails, ipxe just states timeout on dhcp requests and with tcpdump i even cannot see any dhcp requests at all! instead of that i get some weird malformed packets: 14:21:38.529244 00:00:00:00:ff:ff > 00:00:00:00:00:00, ethertype Unknown (0x), length 447: 0x: 5254 00c7 7650 0800 4500 01a7 0322 ..RT..vP..E" 0x0010: 4011 7625 0044 ..@.v%.D 0x0020: 0043 0193 02f7 0101 0600 2c78 5053 000c .C,xPS.. 0x0030: 0x0040: 5254 00c7 7650 ..RT..vP 0x0050: 0x0060: 0x0070: 0x0080: 0x0090: 0x00a0: 0x00b0: 0x00c0: 0x00d0: 0x00e0: 0x00f0: 0x0100: 0x0110: 6382 5363 3501 0139 0205 c05d 0200 ..c.Sc5..9...].. 0x0120: 005e 0301 0201 3c20 5058 4543 6c69 656e .^<.PXEClien 0x0130: 743a 4172 6368 3a30 3030 3030 3a55 4e44 t:Arch:0:UND 0x0140: 493a 3030 3230 3031 4d04 6950 5845 3715 I:002001M.iPXE7. 0x0150: 0103 0607 0c0f 112b 3c42 4380 8182 8384 ...+ As i'm planning to heavily rely on network booting this issue is critical for me, any tips how to solve it? ovirt 3.1, engine and node on separate hosts, installed on clean fedora 17/minimal, node with kernel-3.3.4-5.fc17.x86_64 node has 1 nic, 1 network defined in ovirt - ovirtmgmt ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
[Users] ovirt 3.1 guest pxe boot fails
Hi, I've started testing ovirt 3.1 in my lab environment and found some strange behaviour/bug: when i create vm and set it to boot from network - first time it boots fine 14:23:43.718613 IP 0.0.0.0.68 > 255.255.255.255.67: BOOTP/DHCP, Request from 52:54:00:c7:76:50, length 395 pxe works, ip address received and machine boots, but if I'll restart it via guest command 'reboot' (i.e not through stop/start from ovirt admin interface) - pxe boot fails, ipxe just states timeout on dhcp requests and with tcpdump i even cannot see any dhcp requests at all! instead of that i get some weird malformed packets: 14:21:38.529244 00:00:00:00:ff:ff > 00:00:00:00:00:00, ethertype Unknown (0x), length 447: 0x: 5254 00c7 7650 0800 4500 01a7 0322 ..RT..vP..E" 0x0010: 4011 7625 0044 ..@.v%.D 0x0020: 0043 0193 02f7 0101 0600 2c78 5053 000c .C,xPS.. 0x0030: 0x0040: 5254 00c7 7650 ..RT..vP 0x0050: 0x0060: 0x0070: 0x0080: 0x0090: 0x00a0: 0x00b0: 0x00c0: 0x00d0: 0x00e0: 0x00f0: 0x0100: 0x0110: 6382 5363 3501 0139 0205 c05d 0200 ..c.Sc5..9...].. 0x0120: 005e 0301 0201 3c20 5058 4543 6c69 656e .^<.PXEClien 0x0130: 743a 4172 6368 3a30 3030 3030 3a55 4e44 t:Arch:0:UND 0x0140: 493a 3030 3230 3031 4d04 6950 5845 3715 I:002001M.iPXE7. 0x0150: 0103 0607 0c0f 112b 3c42 4380 8182 8384 ...+ As i'm planning to heavily rely on network booting this issue is critical for me, any tips how to solve it? ovirt 3.1, engine and node on separate hosts, installed on clean fedora 17/minimal, node with kernel-3.3.4-5.fc17.x86_64 node has 1 nic, 1 network defined in ovirt - ovirtmgmt -- Yuriy Demchenko ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users