Re: [Users] installation failed on cluster host
its obvious that something is bad with your repo. can you please run: yum repolist and: for f in `ls /etc/yum.repos.d/`; do cat /etc/yum.repos.d/$f; done i'd check the centos repo, I use: [BASE] name=BASE baseurl=http://mirror.centos.org/centos/6/os/x86_64/ gpgcheck=0 enabled=1 On Wed, Jan 15, 2014 at 1:43 AM, Dafna Ron d...@redhat.com wrote: Hi William, The issue you are having is a yum issue with Centos and not ovirt issue. I found several posts on it by searching 'Cannot find a valid baseurl for repo: base' in google. https://www.google.co.uk/?gws_rd=crei=GsnVUu- YCo7T7AbG6YCICw#q=Cannot+find+a+valid+baseurl+for+repo:+ base+CentOS+6.5spell=1 can you please look at the link and see if any of the forms were reporting anything like you describe with Yum? If you do, can you also post it here so others encountering this yum issue on centos can use your workaround? Just so you know, ovirt has a deployment log under /var/log/ovirt-engine/host-deploy/ so if a host installation fails, it should be reported there in a way which should be easy to debug. Thanks, Dafna On 01/14/2014 11:29 PM, William Kwan wrote: HI, Running through the whole installation exercises on two hosts again with the latest release on CentOS6.5. I added a host to a new cluster. There are action items listed under the new host. One of them is Host installation failed. Fix installation issues and try to Re-Install I clicked Re-install, I monitor the engine.log and see the followings Installation 10.10.25.10: Yum [u'Errors were encountered while downloading packages.', u'libselinux- python-2.0.94-5.3.el6_4.1.x86_64: failure: Packages/libselinux-python-2.0.94-5.3.el6_4.1.x86_64.rpm from base: [Errno 256] No more mirrors to try.'] I'm sure I ran yum update already and I cleaned the db and rerun update...etc. Eventually I manually did `yum install vdsm.x86_64` on the host. Try to run re-install again and I still get yum error. Yum Cannot queue package iproute: Cannot find a valid baseurl for repo: base ERROR [org.ovirt.engine.core.bll.InstallerMessages] (VdsDeploy) Installation 10.10.25.10: Failed to execute stage 'Environment packages setup': Cannot find a valid baseurl for repo: base iproute is sure installed. # rpm -qa|grep iproute iproute-2.6.32-31.el6.x86_64 Something in ssh? Password is sure correct, what could be wrong Will ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users -- Dafna Ron ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] Can I use IP address for FQDN?
It's a bit of a problem if encryption is involved and deployed, you can start change DB entires but it can cause more harm than good. If we are not talking about large scale setup, I would suggest to configure /etc/hosts on each node manually or add to DNS server. Haim On Jan 10, 2014, at 19:48, David Li david...@sbcglobal.net wrote: Hi, I have set up a default FQDN ( for the engine machine) at engine installation. But this is not resolvable by the node when it tries to mount the NFS ISO share from the engine. Is there anyway that I can update this name on the engine to using a IP address? ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] Documentation: Storage Domain conversion from Data Domain to Export Domain
- Original Message - From: Bob Doolittle b...@doolittle.us.com To: d...@redhat.com Cc: users@ovirt.org Sent: Wednesday, November 27, 2013 12:16:30 AM Subject: Re: [Users] Documentation: Storage Domain conversion from Data Domain to Export Domain On 11/26/2013 03:27 PM, Dafna Ron wrote: sql -U postgres engine -c 'select connection from storage_server_connections;' So now that we determined that I have stale connection state in my DB, any suggestions as to how I might clear it out safely? I tried rebooting my Engine, but the connection is still in the DB and it still doesn't show in the Admin Portal. Hi Bob, you can try the following command: engine=# delete FROM storage_server_connections where connection = '172.16.0.58:/export/VM_EXPORTDOMAIN'; this should allow you re-create your storage domain. I would appreciate if you could file a bug against it. Thanks, Bob On 11/26/2013 08:21 PM, Bob Doolittle wrote: On 11/26/2013 03:11 PM, Dafna Ron wrote: can you check the engine data base for the connection? Sorry, will need more guidance for that :( -Bob On 11/26/2013 07:12 PM, Bob Doolittle wrote: On 11/26/2013 02:00 PM, Dafna Ron wrote: can you run mount and see if you have stale connections to the domain? The only existing mounts on the Host are the loopback mount for the Data Domain, and the mount from Engine for the ISO Domain. I can import another Export domain on the same host which was created in the normal manner (and then 'destroy' it), but I can't import the converted one. -Bob On 11/26/2013 06:22 PM, Bob Doolittle wrote: Hi, I am having to recreate my engine, and want to recover my old VMs from the previous Storage Master Data Domain. A while back, Michael Ayers posted this recipe (the gmane reply feature won't allow lines longer than 80 chars, even if they were from the original post :(, so I apologize for the link ) : http://comments.gmane.org/gmane.comp.emulators.ovirt.user/4428 Should this method still work? I have tried to recreate this with 3.3.1 and can't. I copied over the old Data Domain to a new directory, and edited the dom_md/metadata file as follows: http://pastebin.com/U4xJawR8 But when I try to import it, I get: Error while executing action: Cannot add Storage Connection. Storage connection already exists. and Failed to retrieve existing storage domain information. Here's output from engine.log: http://pastebin.com/NVdzLDXb Any clues? Thanks, Bob ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] KVM version not showing in Ovirt Manager
please issue the following commands on both hv's: vdsClinet -s 0 getVdsCaps vdsClient -s 0 getVdsStats I would like to make sure vdsm is indeed report them to the engine. Haim - Original Message - From: Tom Brown t...@ng23.net To: users@ovirt.org Sent: Tuesday, January 22, 2013 8:00:54 AM Subject: [Users] KVM version not showing in Ovirt Manager Hi I have just added another HV to a cluster and its up and running fine. I can run VM's on it and migrate fro other HV's onto it. I do note however that in the manager there is no KVM version listed as installed however on other HV's in the cluster there is a version present. I see that the KVM version is slightly different on this new host but as i said apart from this visual issue everything appear to be running fine. These HV's are CentOS 6.3 using dreyou 3.1 Node where KVM version not showing in the manager node003 ~]# rpm -qa | grep kvm qemu-kvm-rhev-0.12.1.2-2.295.el6.10.x86_64 qemu-kvm-rhev-tools-0.12.1.2-2.295.el6.10.x86_64 Node where KVM version is showing in the manager node002 ~]# rpm -qa | grep kvm qemu-kvm-tools-0.12.1.2-2.295.el6_3.8.x86_64 qemu-kvm-0.12.1.2-2.295.el6_3.8.x86_64 thanks ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] Create ISO domain from scratch? Not attached, create!
Hi, generally speaking, you should be able to create an EXPORT\ISO domain and attach it to the current data-center, if you can't it just means that there is no working data domain and pool is not initialised. please make sure pool (data-center) status is up and running, one of the hosts is functioning as SPM. Haim - Original Message - From: No Reply no-re...@dc.rr.com To: users@ovirt.org Sent: Monday, January 21, 2013 8:50:43 PM Subject: [Users] Create ISO domain from scratch? Not attached, create! Ok, this one has me scratching my head... Have existing oVirt 3.1 environment, based on Fedora 17, storage resource is NFS, also based on Fedora 17. Can create NFS storage domain, no problem. But when the environment was setup initially, no (local) ISO domain as created. Now, under oVirt engine 3.1, the option to create a storage type under domain function, only allows for creation of data type domain, only type in the drop down list is DATA/NFS? What gives? It used to be 3.0 and older, you could select the storage type as Data or ISO! Anyone have this issue as well? Any help appreciated. ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] Power mgmt and engine relationship
- Original Message - From: Gianluca Cecchi gianluca.cec...@gmail.com To: users users@ovirt.org Sent: Tuesday, January 22, 2013 9:35:31 AM Subject: [Users] Power mgmt and engine relationship Hello, Is there any requirement in connectivity between engine and hosts for fencing? Or is only from any host to any host? host to host. and iirc, its a cluster wide. Is it a concept cluster wide, correct? Not a DC one? Thanks Gianluca ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] custom nfs mount options
you can set it manually on each hypervisor by using vdsm.conf. add the following into /etc/vdsm/vdsm.conf [irs] nfs_mount_options = soft,nosharecache restart vdsmd service on the end. - Original Message - From: Alex Leonhardt alex.t...@gmail.com To: oVirt Mailing List users@ovirt.org Sent: Tuesday, January 22, 2013 1:46:56 AM Subject: [Users] custom nfs mount options Hi, Is it possible set custom nfs mount options, specifically : noatime, wsize and rsize ? I couldnt see anything when adding a NFS domain - only timeout retry. Thanks! Alex -- | RHCE | Senior Systems Engineer | www.vcore.co | | www.vsearchcloud.com | ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] Error run once VM
please make sure sanlock service is running properly. - Original Message - From: Juan Jose jj197...@gmail.com To: users@ovirt.org Sent: Monday, January 21, 2013 7:18:51 AM Subject: [Users] Error run once VM Hello everybody, I'm following the http://www.ovirt.org/Quick_Start_Guide#Create_a_Fedora_Virtual_Machine and when I click OK buttom after put all parameters in Run Virtual Machine, I receive bellow error in events and in vdsm.log file from my host: Thread-352921::DEBUG::2013-01-21 15:55:40,709::task::978::TaskManager.Task::(_decref) Task=`8bb281a1-434b-4506-b4a8-2d6665bb382f`::ref 0 aborting False Thread-352921::INFO::2013-01-21 15:55:40,709::clientIF::274::vds::(prepareVolumePath) prepared volume path: /rhev/data-center/d6e7e8b8-49c7-11e2-a261-000a5e429f63/57d184a0-908b-49b5-926f-cd413b9e6526/images/c77ff9d7-6280-4454-b342-faa206989d2a/bf973de9-d344-455d-a628-3dbfbf2693d9 Thread-352921::DEBUG::2013-01-21 15:55:40,717::libvirtvm::1338::vm.Vm::(_run) vmId=`51738dae-c758-4e77-bad7-281f56c4d61d`::?xml version=1.0 encoding=utf-8? domain type=kvm nameFedora17/name uuid51738dae-c758-4e77-bad7-281f56c4d61d/uuid memory1048576/memory currentMemory1048576/currentMemory vcpu1/vcpu devices channel type=unix target name=com.redhat.rhevm.vdsm type=virtio/ source mode=bind path=/var/lib/libvirt/qemu/channels/Fedora17.com.redhat.rhevm.vdsm/ /channel input bus=ps2 type=mouse/ channel type=spicevmc target name=com.redhat.spice.0 type=virtio/ /channel graphics autoport=yes keymap=en-us listen=0 passwd=* passwdValidTo=1970-01-01T00:00:01 port=-1 tlsPort=-1 type=spice channel mode=secure name=main/ channel mode=secure name=inputs/ channel mode=secure name=cursor/ channel mode=secure name=playback/ channel mode=secure name=record/ channel mode=secure name=display/ /graphics console type=pty target port=0 type=virtio/ /console video model heads=1 type=qxl vram=65536/ /video interface type=bridge mac address=00:1a:4a:6d:ca:00/ model type=virtio/ source bridge=ovirtmgmt/ boot order=3/ /interface memballoon model=virtio/ disk device=cdrom snapshot=no type=file source file=/rhev/data-center/d6e7e8b8-49c7-11e2-a261-000a5e429f63/cd9b45e6-2150-44d9-af1a-a557840fde9e/images/----/Fedora-17-x86_64-Live-XFCE.iso startupPolicy=optional/ target bus=ide dev=hdc/ readonly/ serial/serial boot order=1/ /disk disk device=disk snapshot=no type=file source file=/rhev/data-center/d6e7e8b8-49c7-11e2-a261-000a5e429f63/57d184a0-908b-49b5-926f-cd413b9e6526/images/c77ff9d7-6280-4454-b342-faa206989d2a/bf973de9-d344-455d-a628-3dbfbf2693d9/ target bus=virtio dev=vda/ serialc77ff9d7-6280-4454-b342-faa206989d2a/serial boot order=2/ driver cache=none error_policy=stop io=threads name=qemu type=raw/ /disk /devices os type arch=x86_64 machine=pc-0.14hvm/type smbios mode=sysinfo/ /os sysinfo type=smbios system entry name=manufacturerRed Hat/entry entry name=productRHEV Hypervisor/entry entry name=version17-1/entry entry name=serial36303030-3139-3236-3800-00199935CC54_00:19:99:35:cc:54/entry entry name=uuid51738dae-c758-4e77-bad7-281f56c4d61d/entry /system /sysinfo clock adjustment=0 offset=variable timer name=rtc tickpolicy=catchup/ /clock features acpi/ /features cpu match=exact modelConroe/model topology cores=1 sockets=1 threads=1/ /cpu /domain Thread-352921::DEBUG::2013-01-21 15:55:41,258::vm::580::vm.Vm::(_startUnderlyingVm) vmId=`51738dae-c758-4e77-bad7-281f56c4d61d`::_ongoingCreations released Thread-352921::ERROR::2013-01-21 15:55:41,259::vm::604::vm.Vm::(_startUnderlyingVm) vmId=`51738dae-c758-4e77-bad7-281f56c4d61d`::The vm start process failed Traceback (most recent call last): File /usr/share/vdsm/vm.py, line 570, in _startUnderlyingVm self._run() File /usr/share/vdsm/libvirtvm.py, line 1364, in _run self._connection.createXML(domxml, flags), File /usr/lib/python2.7/site-packages/vdsm/libvirtconnection.py, line 82, in wrapper ret = f(*args, **kwargs) File /usr/lib64/python2.7/site-packages/libvirt.py, line 2420, in createXML if ret is None:raise libvirtError('virDomainCreateXML() failed', conn=self) libvirtError: internal error Failed to open socket to sanlock daemon: No such file or directory Thread-352921::DEBUG::2013-01-21 15:55:41,262::vm::920::vm.Vm::(setDownStatus) vmId=`51738dae-c758-4e77-bad7-281f56c4d61d`::Changed state to Down: internal error Failed to open socket to sanlock daemon: No such file or directory In Tree VMs, click in my Fedora17 VM, and in detail windows events: Failed to run VM Fedora17 (User: admin@internal). Failed to run VM Fedora17 on Host host1. VM Fedora17 is down. Exit message: internal error Failed to open socket to sanlock daemon: No such file or directory. In /var/log/vdsm/libvirt.log: 2013-01-21 14:55:41.258+: 10619: error : virNetClientProgramDispatchError:174 : internal error Failed to open socket to sanlock daemon: No such file or directory
Re: [Users] spm keeps on shifting between nodes continously
what type of storage are you using? is it posix btw? can you please issues the following command on both hosts: vdsClient -s 0 getConnectedStoragePoolsList vdsClient -s 0 getStoragePoolInfo `vdsClient -s 0 getConnectedStoragePoolsList` - Original Message - From: Jithin Raju rajuj...@gmail.com To: users@ovirt.org Sent: Tuesday, January 15, 2013 12:22:50 PM Subject: [Users] spm keeps on shifting between nodes continously Hi, I have 2 nodes of ovirt 3.1+ gluster. When i am trying to activate the Data center its changing to up then contend then back up continuously. Same way along with the above SPM status is shifting between the two nodes continously. With one node its working fine. Somebody has reported this before I remember, but do not remember the fix. engine log: 2013-01-15 15:50:41,762 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.HSMGetAllTasksInfoVDSCommand] (QuartzScheduler_Worker-66) [16c01e11] START, HSMGetAllTasksInfoVDSCommand(vdsId = 7caf739e-5ef7-11e2-aa89-525400927148), log id: 59dae374 2013-01-15 15:50:41,791 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.HSMGetAllTasksInfoVDSCommand] (QuartzScheduler_Worker-66) [16c01e11] FINISH, HSMGetAllTasksInfoVDSCommand, return: [], log id: 59dae374 2013-01-15 15:50:41,793 INFO [org.ovirt.engine.core.vdsbroker.irsbroker.SPMGetAllTasksInfoVDSCommand] (QuartzScheduler_Worker-66) [16c01e11] FINISH, SPMGetAllTasksInfoVDSCommand, return: [], log id: 77055e85 2013-01-15 15:50:41,795 INFO [org.ovirt.engine.core.bll.AsyncTaskManager] (QuartzScheduler_Worker-66) [16c01e11] AsyncTaskManager::AddStoragePoolExistingTasks: Discovered no tasks on Storage Pool DC 2013-01-15 15:50:41,796 INFO [org.ovirt.engine.core.vdsbroker.irsbroker.SPMGetAllTasksInfoVDSCommand] (QuartzScheduler_Worker-66) [16c01e11] START, SPMGetAllTasksInfoVDSCommand(storagePoolId = 1a995d7c-5ef3-11e2-a8c4-525400927148, ignoreFailoverLimit = false, compatabilityVersion = null), log id: 318b02c2 2013-01-15 15:50:41,798 INFO [org.ovirt.engine.core.vdsbroker.irsbroker.SPMGetAllTasksInfoVDSCommand] (QuartzScheduler_Worker-66) [16c01e11] -- SPMGetAllTasksInfoVDSCommand::ExecuteIrsBrokerCommand: Attempting on storage pool 1a995d7c-5ef3-11e2-a8c4-525400927148 2013-01-15 15:50:41,800 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.HSMGetAllTasksInfoVDSCommand] (QuartzScheduler_Worker-66) [16c01e11] START, HSMGetAllTasksInfoVDSCommand(vdsId = 7caf739e-5ef7-11e2-aa89-525400927148), log id: 22d29c5b 2013-01-15 15:50:41,832 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.HSMGetAllTasksInfoVDSCommand] (QuartzScheduler_Worker-66) [16c01e11] FINISH, HSMGetAllTasksInfoVDSCommand, return: [], log id: 22d29c5b 2013-01-15 15:50:41,836 INFO [org.ovirt.engine.core.vdsbroker.irsbroker.SPMGetAllTasksInfoVDSCommand] (QuartzScheduler_Worker-66) [16c01e11] FINISH, SPMGetAllTasksInfoVDSCommand, return: [], log id: 318b02c2 2013-01-15 15:50:41,841 INFO [org.ovirt.engine.core.bll.AsyncTaskManager] (QuartzScheduler_Worker-66) [16c01e11] AsyncTaskManager::AddStoragePoolExistingTasks: Discovered no tasks on Storage Pool DC 2013-01-15 15:50:51,830 ERROR [org.ovirt.engine.core.vdsbroker.irsbroker.GetStoragePoolInfoVDSCommand] (QuartzScheduler_Worker-44) irsBroker::BuildStorageDynamicFromXmlRpcStruct::Failed building Storage dynamic, xmlRpcStruct = org.ovirt.engine.core.vdsbroker.xmlrpc.XmlRpcStruct@7fdd2faf 2013-01-15 15:50:51,832 ERROR [org.ovirt.engine.core.vdsbroker.irsbroker.GetStoragePoolInfoVDSCommand] (QuartzScheduler_Worker-44) org.ovirt.engine.core.vdsbroker.irsbroker.IRSErrorException: IRSErrorException: 2013-01-15 15:50:51,833 ERROR [org.ovirt.engine.core.vdsbroker.irsbroker.IrsBrokerCommand] (QuartzScheduler_Worker-44) IrsBroker::Failed::GetStoragePoolInfoVDS due to: IRSErrorException: IRSErrorException: 2013-01-15 15:50:51,865 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.SpmStopVDSCommand] (QuartzScheduler_Worker-44) START, SpmStopVDSCommand(vdsId = 7caf739e-5ef7-11e2-aa89-525400927148, storagePoolId = 1a995d7c-5ef3-11e2-a8c4-525400927148), log id: 6c7ade5e 2013-01-15 15:50:51,899 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.SpmStopVDSCommand] (QuartzScheduler_Worker-44) SpmStopVDSCommand::Stopping SPM on vds blueberry, pool id 1a995d7c-5ef3-11e2-a8c4-525400927148 2013-01-15 15:50:53,032 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.SpmStopVDSCommand] (QuartzScheduler_Worker-44) FINISH, SpmStopVDSCommand, log id: 6c7ade5e 2013-01-15 15:50:53,036 INFO [org.ovirt.engine.core.vdsbroker.irsbroker.IrsBrokerCommand] (QuartzScheduler_Worker-44) Irs placed on server null failed. Proceed Failover 2013-01-15 15:50:53,046 INFO [org.ovirt.engine.core.bll.storage.SetStoragePoolStatusCommand] (QuartzScheduler_Worker-44) [3f11e766] Running command: SetStoragePoolStatusCommand internal: true. Entities affected : ID: 1a995d7c-5ef3-11e2-a8c4-525400927148 Type: StoragePool 2013-01-15
Re: [Users] hypervisor install fails to detect proper CPU type
interesting, any chance of reproduction of this matter and fetching the following input on hypervisor: vdsClient -s 0 getVdsCaps virsh -r capabilities cat /proc/cpuinfo can you also paste the above on the centos installation? Haim - Original Message - From: Jim Kinney jim.kin...@gmail.com To: users@ovirt.org Sent: Monday, January 14, 2013 8:06:55 PM Subject: [Users] hypervisor install fails to detect proper CPU type I installed the F17 version of the hypervisor and was unable to join the new node to the cluster. The failure was wrong CPU type for cluster. The system has an Intel Xeon x5660 CPU (Westmere family). There is another system of the same class in the same cluster NOT using the hypervisor (using a CentOS 6.3 install with dre-repo). I reinstalled the failing system with CentOS and all is well now joining the system to the cluster. -- -- James P. Kinney III Every time you stop a school, you will have to build a jail. What you gain at one end you lose at the other. It's like feeding a dog on his own tail. It won't fatten the dog. - Speech 11/23/1900 Mark Twain http://electjimkinney.org http://heretothereideas.blogspot.com/ ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] centos issues..
Hi Peter, sounds like everything is working fine, as for the warnings, the first and second ones should be changed to INFO, as for the rest, shouldn't worry you too much, we are trying to get domain information by running vgs, since its NFS, vgs will fail with RC 5, maybe one day we will change the implementation to query domain stat according to storage type and not have one class for all storage domains. Haim - Original Message - From: peter houseman peterhouseman...@gmail.com To: users@ovirt.org Sent: Sunday, January 13, 2013 11:01:57 PM Subject: [Users] centos issues.. Hi, I am currently trying to get ovirt engine and nodes up and running on Centos 6u3. Unfortunately my lab does not have direct internet access so the ovirt repo has been copied over from the people.centos.org/hughesjr/ovirt31 repo as recommended by ovirt howto on the Centos Wiki. Everything installs fine with no dependency errors but as soon as I create storage domains in the engine, warning messages appear in the vdsm.logs on the hosts for the ISO and Data domains: (retyped below) Warning... 390::3d::363:: Storage.StorageDomain::(_registerResourceNamespaces) Resource namespace 72xx_volumeNS already registered plus similar warning message as above but imageNS already registered plus Warning Storage.LVM::(reloadvgs) lvm vgs failed:5 Volume group 5b not found I have tried rebuilding the whole system and using NFS, ISCSI and Gluster data domains but I still have the same warning messages. Also, and I'm not sure if its associated, I have noticed info messages on the ovirt engine log that: Autorecovering Storage domains is disabled, skipping Even though I am getting storage warning messages, the VMs are up and I can log into them and run applications. Any help appreciated. Pete ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] libvirt implimentation in oVirt
you can find vdsm code under /usr/share/vdsm, then try your grep; grep libvirt * | grep import - Original Message - From: Arindam Choudhury arindamchoudhu...@gmail.com To: users@ovirt.org Sent: Wednesday, January 9, 2013 1:51:18 PM Subject: [Users] libvirt implimentation in oVirt hi, I am a new user. I have downloaded the source code and out of curiosity I ran grep to find out the code related to libvirt. but both grep for import libvirt and import org.libvirt returned empty. Where is the libvirt related code then? Sincerely, Arindam Choudhury ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] VM migration policies - looking for the code handling it
Hi Yuval, I would start with: backend/manager/modules/bll/src/main/java/org/ovirt/engine/core/bll/VdsLoadBalancer.java Haim - Original Message - From: Yuval M yuva...@gmail.com To: users@ovirt.org Cc: Limor Gavish lgav...@gmail.com Sent: Tuesday, January 8, 2013 11:23:36 PM Subject: [Users] VM migration policies - looking for the code handling it Hello, Can anyone point me to the place in the sources that deals with automatic VM migration policies? i.e migrations that take place when a host is over 75% CPU or other conditions Thanks! Yuval ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] ovirt 3.2 migrations failing
- Original Message - From: Jonathan Horne jho...@skopos.us To: Jonathan Horne jho...@skopos.us, users@ovirt.org Sent: Tuesday, January 8, 2013 10:26:52 PM Subject: Re: [Users] ovirt 3.2 migrations failing so far i see this, and it looks related: this only means libvirt can't find the guest on the host, was it on source or destination? please run the following: on source server: - egrep 'vmMigrate|_setupVdsConnection' /var/log/vdsm/vdsm.log - then, from output, get the Thread number of both commands and run grep again: - egrep 'Thread-$x|Thread-$y' /var/log/vdsm/vdsm.log on destination server: - egrep 'vmMigrationCreate|prepareImage' /var/log/vdsm/vdsm.log - then, from output, get the Thread number of both commands and run grep again: - egrep 'Thread-$x|Thread-$y' /var/log/vdsm/vdsm.log please paste it here. Haim Traceback (most recent call last): File /usr/share/vdsm/vm.py, line 676, in _startUnderlyingVm self._waitForIncomingMigrationFinish() File /usr/share/vdsm/libvirtvm.py, line 1757, in _waitForIncomingMigrationFinish self._connection.lookupByUUIDString(self.id), File /usr/lib64/python2.6/site-packages/vdsm/libvirtconnection.py, line 111, in wrapper ret = f(*args, **kwargs) File /usr/lib64/python2.6/site-packages/libvirt.py, line 2682, in lookupByUUIDString if ret is None:raise libvirtError('virDomainLookupByUUIDString() failed', conn=self) libvirtError: Domain not found: no domain with matching uuid '063c7cbe-c569-4df3-b9a7-6474c41d797e' From: Jonathan Horne jho...@skopos.us Date: Tuesday, January 8, 2013 2:15 PM To: users@ovirt.org users@ovirt.org Subject: [Users] ovirt 3.2 migrations failing i just built up 2 nodes and a manager on 3.2 dreyou packages, and now that i have a VM up and installed with rhev agent, the VM is unable to migrate. the failure is pretty much immediate. i don't know where to begin troubleshooting this, can someone help me get going in the right direction? just let me know what logs are appropriate and i will post them up. thanks, jonathan This is a PRIVATE message. If you are not the intended recipient, please delete without copying and kindly advise us by e-mail of the mistake in delivery. NOTE: Regardless of content, this e-mail shall not operate to bind SKOPOS to any order or other contract unless pursuant to explicit written agreement or government initiative expressly permitting the use of e-mail for such purpose. This is a PRIVATE message. If you are not the intended recipient, please delete without copying and kindly advise us by e-mail of the mistake in delivery. NOTE: Regardless of content, this e-mail shall not operate to bind SKOPOS to any order or other contract unless pursuant to explicit written agreement or government initiative expressly permitting the use of e-mail for such purpose. ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] Storage domain mount options
- Original Message - From: Shu Ming shum...@linux.vnet.ibm.com To: Alexandru Vladulescu avladule...@bfproject.ro Cc: users users@ovirt.org Sent: Tuesday, January 8, 2013 3:48:06 PM Subject: Re: [Users] Storage domain mount options Alexandru Vladulescu : Dear Shu, Exactly what I was saying, in that section we can only change in the Advanced Parameters the NFS version, the Retransmissions and the Timeout. Sorry, was not seeing these. I am afraid you can not append other options, for neither ovirt engine nor VDSM accepts other options than Retransmissions and the Timeout. Hi, first, please open an RFE for it: https://bugzilla.redhat.com/enter_bug.cgi?product=oVirt second, you can set it in vdsm.conf configuration file (/etc/vdsm/vdsm.conf). as for how to do it, check out sample config file distributed with the rpm: /usr/share/doc/vdsm-4.9.6/vdsm.conf.sample make sure to restart vdsmd service on each hypervisor afterwards. Haim Therefore I see exactly no field for additional mount options. As I know there is a thread on the mailing list about the new features that should be included in the 3.2 version release. I think would be useful to include this one too to the GUI, but again my question, how could this be modified in the stable 3.1 release ? Alex On 01/08/2013 03:31 PM, Shu Ming wrote: Alexandru Vladulescu: Hello Guys, I would like to ask you help for the following problem I am facing right now. As we know, on the Storage tab we can configure storage domains in many types. I am using NFS, and when configuring a such export on the Advanced Parameters section we can only adjust the NFS version, Retrans and Timeout. After a successful configuration setup, on all hypervisors I have the mount share declared attached. The result of mount for the share datastore looks like below: nas01.net:/datastore01/nas01.ISO on /rhev/data-center/mnt/nas01.net:_datastore01_nas01.ISO type nfs (rw,soft,nosharecache,timeo=10,retrans=6,vers=4,addr=10.20.30.10,clientaddr=10.20.30.102) All good and sound but how can I tune the NFS mounting command, for example let's say adding noatime value to the mount option list ? New domain --- Advanced Parameters. Many thanks, Alex. ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users -- --- 舒明 Shu Ming Open Virtualization Engineerning; CSTL, IBM Corp. Tel: 86-10-82451626 Tieline: 9051626 E-mail: shum...@cn.ibm.com or shum...@linux.vnet.ibm.com Address: 3/F Ring Building, ZhongGuanCun Software Park, Haidian District, Beijing 100193, PRC ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] oVirt 3.1 - VM Migration Issue
, line 474, in _startUnderlyingMigration None, maxBandwidth) File /usr/share/vdsm/libvirtvm.py, line 510, in f ret = attr(*args, **kwargs) File /usr/lib64/python2.6/site-packages/vdsm/libvirtconnection.py, line 83, in wrapper ret = f(*args, **kwargs) File /usr/lib64/python2.6/site-packages/libvirt.py, line 1103, in migrateToURI2 if ret == -1: raise libvirtError ('virDomainMigrateToURI2() failed', dom=self) libvirtError: internal error Process exited while reading console log output: any chance you attach libvirtd.log and qemu log (/var/log/libvirt/qemu/{}.log? Danken - any insights? - Original Message - From: Tom Brown t...@ng23.net To: Roy Golan rgo...@redhat.com Cc: Haim Ateya hat...@redhat.com, users@ovirt.org Sent: Tuesday, January 8, 2013 11:50:26 AM Subject: Re: [Users] oVirt 3.1 - VM Migration Issue can you attach the same snip from the src VDSM 10.192.42.196 as well? The log is pretty chatty therefore i did another migration attempt and snipd'd the new log from both sides. see attached ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] Configure NFS resource from Host
hi, can you please run the following and tell me if its working for you: usr/bin/sudo -n /usr/bin/mount -t nfs -o soft,nosharecache,timeo=600,retrans=6 ovirt-host.uab.es:/home/storage1 /mnt are you using NFSv3 or NFSv4? if its not working, can you try: usr/bin/sudo -n /usr/bin/mount -t nfs -o soft,vers=3,nosharecache,timeo=600,retrans=6 ovirt-host.uab.es:/home/storage1 /mnt - Original Message - From: jj197005 jj197...@gmail.com To: users@ovirt.org Sent: Monday, January 7, 2013 6:04:37 PM Subject: [Users] Configure NFS resource from Host Hello everybody, I'm trying to configure and install oVirt for test in our University faculty. I have installed an engine an a host, both of them with Fedora 17. The engine is working and I can log-in in the Data Center with admin user. I have tried to add my host to the data center portal and I successfully done. After that I have tried to add a NFS resource which is in the host that I have added. If I open a console in the engine and log-in as vdsm user, I can mount the NFS resource without problems. The problem is when I tried to add this NFS resource to the Data Center. I have followed the tutorial http://www.ovirt.org/Quick_Start_Guide#Configure_Storage, but when I have completed the form and pressed Ok, after one or two minutes I receive one screen with bellow error: Error: A Request to the Server failed with the following Status Code: 500 I'm attaching the vdsm.log file with the lines that are created with this operation. I hope that someone can show me what is the problem. If you need more information aboout the installation I can show it. Many thanks in avanced, Juanjo. ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] ISO path is not empty
I don't see any particular reason for blocking this in general, since we create our domain structure under /ISO/sdUUID/.. so it doesn't really matter. please open a bug for it. Haim - Original Message - From: Mohsen Saeedi mohsen.sae...@gmail.com To: users@ovirt.org Sent: Friday, January 4, 2013 10:49:25 AM Subject: [Users] ISO path is not empty Hi I have a problem with Ovir t engine-setup. when i run it and in the setup process then it checks the ISO local path and print a error: directory /ISO is not empty I make a new partition and make ext4 filesystem and the i mount it under /ISO. we know it ha s a lost+foun d directory. i think it should be fixed in newer version. Thanks. ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] Failed to import Vm from export to storagedomain
Hi Ricky, its really interesting, the vm process failed to start as libvirt identified double use of same PCI address: Thread-3111::ERROR::2013-01-03 16:30:27,373::vm::617::vm.Vm::(_startUnderlyingVm) vmId=`9741c58b-e7b2-41d8-9f35-8ea79ca81528`::The vm start process failed Traceback (most recent call last): File /usr/share/vdsm/vm.py, line 579, in _startUnderlyingVm self._run() File /usr/share/vdsm/libvirtvm.py, line 1421, in _run self._connection.createXML(domxml, flags), File /usr/lib64/python2.6/site-packages/vdsm/libvirtconnection.py, line 83, in wrapper ret = f(*args, **kwargs) File /usr/lib64/python2.6/site-packages/libvirt.py, line 2489, in createXML if ret is None:raise libvirtError('virDomainCreateXML() failed', conn=self) libvirtError: XML error: Attempted double use of PCI Address '0:0:1.2' (may need multifunction='on' for device on function 0 Thread-3111::DEBUG::2013-01-03 16:30:27,377::vm::933::vm.Vm::(setDownStatus) vmId=`9741c58b-e7b2-41d8-9f35-8ea79ca81528`::Changed state to Down: XML error: Attempted double use of PCI Address '0:0:1.2' (may need multifunction='on' for device on function 0 from the VM xml, I see you trying to use 6 devices of USB with same PCI address: controller type=usb address bus=0x00 domain=0x function=0x2 slot=0x01 type=pci/ /controller controller type=usb address bus=0x00 domain=0x function=0x2 slot=0x01 type=pci/ /controller controller type=usb address bus=0x00 domain=0x function=0x2 slot=0x01 type=pci/ /controller controller type=usb address bus=0x00 domain=0x function=0x2 slot=0x01 type=pci/ /controller controller type=usb address bus=0x00 domain=0x function=0x2 slot=0x01 type=pci/ /controller controller type=usb address bus=0x00 domain=0x function=0x2 slot=0x01 type=pci/ /controller need to understand what went wrong there, was it the export attempt that created this problematic entry in the OVF file or was it the import? anyway, please open a bug for it. Haim - Original Message - From: Ricky rockyba...@gmail.com To: Users@ovirt.org Sent: Thursday, January 3, 2013 7:00:21 PM Subject: [Users] Failed to import Vm from export to storagedomain Hi, So, Finally I reinstalled the whole cluster after I have exported every VM. I followed this wiki http://wiki.dreyou.org/dokuwiki/doku.php?id=ovirt_rpm_start31. Everything went up except that I just have one host installed in the cluster. The other host is still serving the old cluster When trying to import my old VMs I got problems with some of them not beeing imported. But 3 of my VMs did and one of them was my mailserver... but when I try to start the VM I hit the wall again. I have attached the vdsm.log, right now I cant see the forest for all the trees Regards //Ricky ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] oVirt 3.1 - VM Migration Issue
interesting, please search for migrationCreate command on desination host and search for ERROR afterwords, what do you see? - Original Message - From: Tom Brown t...@ng23.net To: users@ovirt.org Sent: Thursday, January 3, 2013 4:12:05 PM Subject: [Users] oVirt 3.1 - VM Migration Issue Hi I seem to have an issue with a single VM and migration, other VM's can migrate OK - When migrating from the GUI it appears to just hang but in the engine.log i see the following 2013-01-03 14:03:10,359 INFO [org.ovirt.engine.core.bll.VdsSelector] (ajp--0.0.0.0-8009-59) Checking for a specific VDS only - id:a2d84a1e-3e18-11e2-8851-3cd92b4c8e89, name:ovirt-node.domain-name, host_name(ip):10.192.42.165 2013-01-03 14:03:10,411 INFO [org.ovirt.engine.core.bll.MigrateVmToServerCommand] (pool-3-thread-48) [4d32917d] Running command: MigrateVmToServerCommand internal: false. Entities affected : ID: 9dc63ce4-0f76-4963-adfe-6f8eb1a44806 Type: VM 2013-01-03 14:03:10,413 INFO [org.ovirt.engine.core.bll.VdsSelector] (pool-3-thread-48) [4d32917d] Checking for a specific VDS only - id:a2d84a1e-3e18-11e2-8851-3cd92b4c8e89, name:ovirt-node.domain-name, host_name(ip):10.192.42.165 2013-01-03 14:03:11,028 INFO [org.ovirt.engine.core.vdsbroker.MigrateVDSCommand] (pool-3-thread-48) [4d32917d] START, MigrateVDSCommand(vdsId = 1a52b722-43a1-11e2-af96-3cd92b4c8e89, vmId=9dc63ce4-0f76-4963-adfe-6f8eb1a44806, srcHost=10.192.42.196, dstVdsId=a2d84a1e-3e18-11e2-8851-3cd92b4c8e89, dstHost=10.192.42.165:54321, migrationMethod=ONLINE), log id: 5011789b 2013-01-03 14:03:11,030 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.MigrateBrokerVDSCommand] (pool-3-thread-48) [4d32917d] VdsBroker::migrate::Entered (vm_guid=9dc63ce4-0f76-4963-adfe-6f8eb1a44806, srcHost=10.192.42.196, dstHost=10.192.42.165:54321, method=online 2013-01-03 14:03:11,031 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.MigrateBrokerVDSCommand] (pool-3-thread-48) [4d32917d] START, MigrateBrokerVDSCommand(vdsId = 1a52b722-43a1-11e2-af96-3cd92b4c8e89, vmId=9dc63ce4-0f76-4963-adfe-6f8eb1a44806, srcHost=10.192.42.196, dstVdsId=a2d84a1e-3e18-11e2-8851-3cd92b4c8e89, dstHost=10.192.42.165:54321, migrationMethod=ONLINE), log id: 7cd53864 2013-01-03 14:03:11,041 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.MigrateBrokerVDSCommand] (pool-3-thread-48) [4d32917d] FINISH, MigrateBrokerVDSCommand, log id: 7cd53864 2013-01-03 14:03:11,086 INFO [org.ovirt.engine.core.vdsbroker.MigrateVDSCommand] (pool-3-thread-48) [4d32917d] FINISH, MigrateVDSCommand, return: MigratingFrom, log id: 5011789b 2013-01-03 14:03:11,606 INFO [org.ovirt.engine.core.vdsbroker.VdsUpdateRunTimeInfo] (QuartzScheduler_Worker-29) vds::refreshVmList vm id 9dc63ce4-0f76-4963-adfe-6f8eb1a44806 is migrating to vds ovirt-node.domain-name ignoring it in the refresh till migration is done 2013-01-03 14:03:12,836 INFO [org.ovirt.engine.core.vdsbroker.VdsUpdateRunTimeInfo] (QuartzScheduler_Worker-36) VM test002.domain-name 9dc63ce4-0f76-4963-adfe-6f8eb1a44806 moved from MigratingFrom -- Up 2013-01-03 14:03:12,837 INFO [org.ovirt.engine.core.vdsbroker.VdsUpdateRunTimeInfo] (QuartzScheduler_Worker-36) adding VM 9dc63ce4-0f76-4963-adfe-6f8eb1a44806 to re-run list 2013-01-03 14:03:12,852 ERROR [org.ovirt.engine.core.vdsbroker.VdsUpdateRunTimeInfo] (QuartzScheduler_Worker-36) Rerun vm 9dc63ce4-0f76-4963-adfe-6f8eb1a44806. Called from vds ovirt-node002.domain-name 2013-01-03 14:03:12,855 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.MigrateStatusVDSCommand] (pool-3-thread-48) START, MigrateStatusVDSCommand(vdsId = 1a52b722-43a1-11e2-af96-3cd92b4c8e89, vmId=9dc63ce4-0f76-4963-adfe-6f8eb1a44806), log id: 4721a1f3 2013-01-03 14:03:12,864 ERROR [org.ovirt.engine.core.vdsbroker.vdsbroker.BrokerCommandBase] (pool-3-thread-48) Failed in MigrateStatusVDS method 2013-01-03 14:03:12,865 ERROR [org.ovirt.engine.core.vdsbroker.vdsbroker.BrokerCommandBase] (pool-3-thread-48) Error code migrateErr and error message VDSGenericException: VDSErrorException: Failed to MigrateStatusVDS, error = Fatal error during migration 2013-01-03 14:03:12,865 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.BrokerCommandBase] (pool-3-thread-48) Command org.ovirt.engine.core.vdsbroker.vdsbroker.MigrateStatusVDSCommand return value Class Name: org.ovirt.engine.core.vdsbroker.vdsbroker.StatusOnlyReturnForXmlRpc mStatus Class Name: org.ovirt.engine.core.vdsbroker.vdsbroker.StatusForXmlRpc mCode 12 mMessage Fatal error during migration 2013-01-03 14:03:12,866 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.BrokerCommandBase] (pool-3-thread-48) Vds: ovirt-node002.itvonline.ads 2013-01-03 14:03:12,867 ERROR [org.ovirt.engine.core.vdsbroker.VDSCommandBase] (pool-3-thread-48) Command MigrateStatusVDS execution failed. Exception: VDSErrorException: VDSGenericException:
Re: [Users] ovirt 3.1 fails to attach gluster storage volume to data center
happened to debug this issue with Jithin on IRC, it looks like a regression in vdsm as connectStorageServer with posixFS (type=6) doesn't return. the local mount command for the gluster volume is working fine, which indicates its a fault in vdsm. can you please open a bug on this matter so we could track it better? currently, I see no way to workaround this issue. Haim - Original Message - From: Balamurugan Arumugam barum...@redhat.com To: Jithin Raju rajuj...@gmail.com Cc: users@ovirt.org Sent: Monday, December 31, 2012 5:28:50 PM Subject: Re: [Users] ovirt 3.1 fails to attach gluster storage volume to data center Hi Jithin, - Original Message - From: Jithin Raju rajuj...@gmail.com To: users@ovirt.org Sent: Monday, December 31, 2012 3:04:54 PM Subject: [Users] ovirt 3.1 fails to attach gluster storage volume to data center Hi, I have new setup for ovirt 3.1 installed on centos 6.3. Node centos 6.3,gluster 3.3.1. I was able to create gluster volume on node and activate but attaching it to data center fails. My understanding is that you are trying to use gluster volume as storage domain (Correct me if I am wrong). Could you give following details here? 1. Describe the flow you followed in the UI. Adding screenshot of error would help more. 2. Attach vdsm log from all nodes and engine log from ovirt engine. As per my chat with hat...@redhat.com I am submitting some of the traces below: [root@hedge /]# grep mount /var/log/vdsm/vdsm.log Thread-671::DEBUG::2012-08-23 16:44:32,344::__init__::1164::Storage.Misc.excCmd::(_log) '/usr/bin/sudo -n /bin/mount -t glusterfs -o vers=3 hedge:/vol2 /rhev/data-center/mnt/hedge:_vol2' (cwd None) [root@hedge /]# grep connectStorageServer /var/log/vdsm/vdsm.log Thread-671::INFO::2012-08-23 16:44:32,340::logUtils::37::dispatcher::(wrapper) Run and protect: connectStorageServer(domType=6, spUUID='----', conList=[{'port': '', 'connection': 'hedge:/vol2', 'mnt_options': 'vers=3', 'portal': '', 'user': '', 'iqn': '', 'vfs_type': 'glusterfs', 'password': '**', 'id': '----'}], options=None) Thread-671::INFO::2012-08-23 16:44:36,567::logUtils::39::dispatcher::(wrapper) Run and protect: connectStorageServer, Return response: {'statuslist': [{'status': 0, 'id': '----'}]} I am new to gluster so mostly I would have made some mistake.Any help is appreciated. Sure. Welcome to gluster! Regards, Bala ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] attempted live snapshot, machine paused, wouldn't restart
logs are not accesible. - Original Message - From: Erik Jacobs erikmjac...@gmail.com To: d...@redhat.com Cc: Haim Ateya hat...@redhat.com, mlipc...@redhat.com, users@ovirt.org Sent: Wednesday, December 26, 2012 9:05:20 PM Subject: Re: [Users] attempted live snapshot, machine paused, wouldn't restart OK... tried this again today after making some changes. Logs are here: http://50.73.95.6/logs/ Did a snapshot, looks like the VM got paused due to unknown storage error and then the VM could not be resumed. I've since manually stopped the VM, deleted the live snapshot, created a new snapshot from a stopped VM, and restarted the VM. Happy to try other stuff. On Sun, Dec 23, 2012 at 4:09 AM, Dafna Ron d...@redhat.com wrote: I do see that the vm was move to pause at some point but its hard to debug when we only have some of the log. I need the vdsm log from the spm and the host that the vm ran on + engine log (full logs please). Thanks, Dafna On 12/23/2012 05:54 AM, Erik Jacobs wrote: Hi all, Any thoughts here? On Wed, Dec 19, 2012 at 10:19 AM, Erik Jacobs erikmjac...@gmail.com mailto: erikmjac...@gmail.com wrote: Hi Haim, In-line responses follow. On Wed, Dec 19, 2012 at 4:02 AM, Haim Ateya hat...@redhat.com mailto: hat...@redhat.com wrote: Hi Erik, in order to debug it, we need some more details, such as engine and vdsm logs, can you please attach those logs from the moment you took the snapshot? http://pastie.org/5553030 (engine.log) http://pastebin.ca/2294934 (vdsm.log) from the log below, it seems as if vm resume operation succeeded and vm state changed to powering up. btw, did you have an operating system installed on this vm ? what was the status of the vm prior to the live snapshot attempt? Centos 5.6 guest that was running at the time I hit the create snapshot button. Status of my 3 storage domains: /dev/mapper/mediavg-121028_rhev 148G 21G 121G 15% /mnt/rhev /dev/mapper/mediavg-121028_rheviso 20G 4.2G 15G 23% /mnt/rheviso /dev/mapper/mediavg-121028_rhevimport 9.9G 5.1G 4.3G 55% /mnt/rhevimport Thanks! -E Thanks. Haim - Original Message - From: Erik Jacobs erikmjac...@gmail.com mailto: erikmjac...@gmail.com To: users@ovirt.org mailto: users@ovirt.org Sent: Wednesday, December 19, 2012 5:58:18 AM Subject: [Users] attempted live snapshot, machine paused, wouldn't restart I attempted to take a snapshot of a machine while it was running. I noticed that the machine was paused, and then I attempted to resume it. The machine looked like it was going to launch, but the events tab simply indicated that execution of launching the vm failed. Here's the engine.log at the time: 2012-12-18 22:53:55,838 INFO [org.ovirt.engine.core.vdsbroker.irsbroker.IsValidVDSCommand] (ajp--0.0.0.0-8009-5) START, IsValidVDSCommand(storagePoolId = 2ccd03b1-fd0e-4578-88ce-e5065a9742d7, ignoreFailoverLimit = false, compatabilityVersion = null), log id: 19399f32 2012-12-18 22:53:55,848 INFO [org.ovirt.engine.core.vdsbroker.irsbroker.IsValidVDSCommand] (ajp--0.0.0.0-8009-5) FINISH, IsValidVDSCommand, return: true, log id: 19399f32 2012-12-18 22:53:55,894 INFO [org.ovirt.engine.core.vdsbroker.IsVmDuringInitiatingVDSCommand] (ajp--0.0.0.0-8009-5) START, IsVmDuringInitiatingVDSCommand(vmId = 35d58779-ee24-4926-8612-e053ff48881b), log id: 76d4a2b2 2012-12-18 22:53:55,895 INFO [org.ovirt.engine.core.vdsbroker.IsVmDuringInitiatingVDSCommand] (ajp--0.0.0.0-8009-5) FINISH, IsVmDuringInitiatingVDSCommand, return: false, log id: 76d4a2b2 2012-12-18 22:53:55,935 INFO [org.ovirt.engine.core.bll.RunVmCommand] (pool-3-thread-49) [17252a1c] Lock Acquired to object EngineLock [exclusiveLocks= key: 35d58779-ee24-4926-8612-e053ff48881b value: VM , sharedLocks= ] 2012-12-18 22:53:55,952 INFO [org.ovirt.engine.core.bll.RunVmCommand] (pool-3-thread-49) [17252a1c] Running command: RunVmCommand internal: false. Entities affected : ID: 35d58779-ee24-4926-8612-e053ff48881b Type: VM 2012-12-18 22:53:55,966 INFO [org.ovirt.engine.core.vdsbroker.ResumeVDSCommand] (pool-3-thread-49) [17252a1c] START, ResumeVDSCommand(vdsId = 0a7046ea-216d-11e2-8fe8-001372eb596b, vmId=35d58779-ee24-4926-8612-e053ff48881b), log id: 607d6dc2 2012-12-18 22:53:55,976 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.ResumeBrokerVDSCommand] (pool-3-thread-49) [17252a1c] START, ResumeBrokerVDSCommand(vdsId = 0a7046ea-216d-11e2-8fe8-001372eb596b, vmId=35d58779-ee24-4926-8612-e053ff48881b), log id: 66b1872c 2012-12-18 22:53:56,292 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.ResumeBrokerVDSCommand] (pool-3-thread-49) [17252a1c] FINISH, ResumeBrokerVDSCommand, log id: 66b1872c
Re: [Users] Problems when trying to delete a snapshot
Hi Ricky, from going over your logs, it seems like create snapshot failed, its logged clearly in both engine and vdsm logs [1]. did you try to delete this snapshot or was it a different one? if so, not sure its worth debugging. bee7-78e7d1cbc201, vmId=d41b4ebe-3631-4bc1-805c-d762c636ca5a), log id: 46d21393 2012-12-13 10:40:24,372 ERROR [org.ovirt.engine.core.vdsbroker.vdsbroker.BrokerCommandBase] (pool-5-thread-50) [12561529] Failed in SnapshotVDS method 2012-12-13 10:40:24,372 ERROR [org.ovirt.engine.core.vdsbroker.vdsbroker.BrokerCommandBase] (pool-5-thread-50) [12561529] Error code SNAPSHOT_FAILED and error message VDSGenericException: VDSErrorException: Failed to SnapshotVDS, error = Snapshot failed 2012-12-13 10:40:24,372 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.BrokerCommandBase] (pool-5-thread-50) [12561529] Command org.ovirt.engine.core.vdsbroker.vdsbroker.SnapshotVDSCommand return value Class Name: org.ovirt.engine.core.vdsbroker.vdsbroker.StatusOnlyReturnForXmlRpc mStatus Class Name: org.ovirt.engine.core.vdsbroker.vdsbroker.StatusForXmlRpc mCode 48 mMessage Snapshot failed enter/6d91788c-99d9-11e1-b913-78e7d1cbc201/mastersd/master/tasks/21cbcc25-7672-4704-a414-a44f5e9944ed temp /rhev/data-center/6d91788c-99d9-11e1-b913-78e7d1cbc201/mastersd/maste r/tasks/21cbcc25-7672-4704-a414-a44f5e9944ed.temp 21cbcc25-7672-4704-a414-a44f5e9944ed::ERROR::2012-12-14 10:48:41,189::volume::492::Storage.Volume::(create) Unexpected error Traceback (most recent call last): File /usr/share/vdsm/storage/volume.py, line 475, in create srcVolUUID, imgPath, volPath) File /usr/share/vdsm/storage/fileVolume.py, line 138, in _create oop.getProcessPool(dom.sdUUID).createSparseFile(volPath, sizeBytes) File /usr/share/vdsm/storage/remoteFileHandler.py, line 277, in callCrabRPCFunction *args, **kwargs) File /usr/share/vdsm/storage/remoteFileHandler.py, line 195, in callCrabRPCFunction raise err IOError: [Errno 27] File too large 2012-12-13 10:40:24,372 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.BrokerCommandBase] (pool-5-thread-50) [12561529] Vds: virthost01 2012-12-13 10:40:24,372 ERROR [org.ovirt.engine.core.vdsbroker.VDSCommandBase] (pool-5-thread-50) [12561529] Command SnapshotVDS execution failed. Exception: VDSErrorException: VDSGenericException: VDSErrorException: Failed to SnapshotVDS, error = Snapshot failed 2012-12-13 10:40:24,373 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.SnapshotVDSCommand] (pool-5-thread-50) [12561529] FINISH, SnapshotVDSCommand, log id: 46d21393 2012-12-13 10:40:24,373 ERROR [org.ovirt.engine.core.bll.CreateAllSnapshotsFromVmCommand] (pool-5-thread-50) [12561529] Wasnt able to live snpashot due to error: VdcBLLException: VdcBLLException: org.ovirt.engine.core.vdsbroker.vdsbroker.VDSErrorException: VDSGenericException: VDSErrorException: Failed to SnapshotVDS, error = Snapshot failed, rolling back. 2012-12-13 10:40:24,376 ERROR [org.ovirt.engine.core.bll.CreateSnapshotCommand] (pool-5-thread-50) [4fd6c4e4] Ending command with failure: org.ovirt.engine.core.bll.CreateSnapshotCommand 2012-12-13 1 21cbcc25-7672-4704-a414-a44f5e9944ed::ERROR::2012-12-14 10:48:41,196::task::833::TaskManager.Task::(_setError) Task=`21cbcc25-7672-4704-a414-a44f5e9944ed`::Unexpected error Traceback (most recent call last): File /usr/share/vdsm/storage/task.py, line 840, in _run return fn(*args, **kargs) File /usr/share/vdsm/storage/task.py, line 307, in run return self.cmd(*self.argslist, **self.argsdict) File /usr/share/vdsm/storage/securable.py, line 68, in wrapper return f(self, *args, **kwargs) File /usr/share/vdsm/storage/sp.py, line 1903, in createVolume srcImgUUID=srcImgUUID, srcVolUUID=srcVolUUID) File /usr/share/vdsm/storage/fileSD.py, line 258, in createVolume volUUID, desc, srcImgUUID, srcVolUUID) File /usr/share/vdsm/storage/volume.py, line 494, in create (volUUID, e)) VolumeCreationError: Error creating a new volume: ('Volume creation 6da02c1e-5ef5-4fab-9ab2-bb081b35e7b3 failed: [Errno 27] File too large',) - Original Message - From: Ricky Schneberger ri...@schneberger.se To: Haim Ateya hat...@redhat.com Cc: users@ovirt.org Sent: Thursday, December 20, 2012 5:52:10 PM Subject: Re: [Users] Problems when trying to delete a snapshot -BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Hi, The task did not finished but it broked my VM. What I have right now is a VM with a base-image and a snapshot that I need to merge together so I can import the disk in a new VM. I have attached the logs and even the output from the tree-command. Regards // Ricky On 2012-12-16 08:35, Haim Ateya wrote: please attach full engine and vdsm log from SPM machine. also, did the task finished ? please run tree command for /rhev/data-center/. - Original Message - From: Ricky Schneberger ri
Re: [Users] Fwd: Power outage = storage outage = ovirt crash with mismatched master storage domain?
Hi Ian, I really like to understand how it happened, there are several ways to fix this issue, please send us engine and vdsm logs for a start. as for fixing the issue, we will probably have to perform some manual intervention on either engine data-base (preferable) or storage metadata. its a bit complex, and might cause other things to fail if not doing carefully, so its your call. - Original Message - From: Ian Forde ianfo...@gmail.com To: users@ovirt.org Sent: Sunday, December 30, 2012 12:31:31 PM Subject: [Users] Fwd: Power outage = storage outage = ovirt crash with mismatched master storage domain? (resend, as it might have tried to go through before my subscription to the list was active...) -- Forwarded message -- From: Ian Forde ianfo...@gmail.com Date: Sun, Dec 30, 2012 at 1:57 AM Subject: Power outage = storage outage = ovirt crash with mismatched master storage domain? To: users@ovirt.org Hi all - I'm running Ovirt from the dreyou packages (ovirt-engine 3.1.0-3.26) on CentOS 6.3, with 3 hypervisors all running CentOS 6.3 with dreyou-packaged vdsm 4.10.1-0.77.20 running. Storage is a Synology 1812+. And tonight I had a series of power events that affected the NAS. (So much for the small UPS. Storage has now been moved to a bigger UPS.) Anyway - it turns out that my storage went down 3 times while I was out. The NAS recovered. Ovirt didn't. I've tried putting boxes into maintenance mode. I've tried reboots and confirm reboot. I've even shut everything down and brought it back up from scratch. What I'm facing now is a message in the GUI that states: Sync Error on Master Domain between Host bv-hv01 and oVirt Engine. Domain BV4Pool1 is marked as Master in oVirt Engine database but not on the Storage side. Please consult with Support on how to fix this issue. Now, what was *supposed* to be my master storage domain is named BV4Data-Master, and BV4Pool1 is where the VM disks are stored. (I'm not crazy about putting VM disks into the master pool.) So why does the database have the wrong entry and how do I fix it? Currently everything is *down*, given that the last time I tried to mess with oVirt innards I had to recreate my DC and restore all of the VMs by hand. Not something I'd like to do again... (This, of course, also means that I need to get proper oVirt database backups going...) Any ideas? -I ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] Report size of thin provisioned disk
- Original Message - From: Jeff Bailey bai...@cs.kent.edu To: users@ovirt.org Sent: Thursday, December 20, 2012 12:09:35 AM Subject: [Users] Report size of thin provisioned disk With 3.2 alpha, on the disks subtab of the virtual machines tab I'm seeing the correct virtual size for the disk but the actual size remains 1GB even though the drive has grown to 11GB (according to lvs output). It doesn't seem to be causing any problems. The auto-growth (from my perspective) is working much smoother than it did a release ago :) I just wondered if anyone else had noticed this. I didn't see it, Dafna, ever encountered such case? anyway, can you attach db dump of your data-base along with the following: tree /rhev/data-center/ for i in `vdsClient -s 0 getStorageDomainsList`; do vgs $i -o+lv_name,lv_tags out.txt ; done also, engine and vdsm logs. Thanks, Jeff ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] attempted live snapshot, machine paused, wouldn't restart
Hi Erik, in order to debug it, we need some more details, such as engine and vdsm logs, can you please attach those logs from the moment you took the snapshot? from the log below, it seems as if vm resume operation succeeded and vm state changed to powering up. btw, did you have an operating system installed on this vm ? what was the status of the vm prior to the live snapshot attempt? Thanks. Haim - Original Message - From: Erik Jacobs erikmjac...@gmail.com To: users@ovirt.org Sent: Wednesday, December 19, 2012 5:58:18 AM Subject: [Users] attempted live snapshot, machine paused, wouldn't restart I attempted to take a snapshot of a machine while it was running. I noticed that the machine was paused, and then I attempted to resume it. The machine looked like it was going to launch, but the events tab simply indicated that execution of launching the vm failed. Here's the engine.log at the time: 2012-12-18 22:53:55,838 INFO [org.ovirt.engine.core.vdsbroker.irsbroker.IsValidVDSCommand] (ajp--0.0.0.0-8009-5) START, IsValidVDSCommand(storagePoolId = 2ccd03b1-fd0e-4578-88ce-e5065a9742d7, ignoreFailoverLimit = false, compatabilityVersion = null), log id: 19399f32 2012-12-18 22:53:55,848 INFO [org.ovirt.engine.core.vdsbroker.irsbroker.IsValidVDSCommand] (ajp--0.0.0.0-8009-5) FINISH, IsValidVDSCommand, return: true, log id: 19399f32 2012-12-18 22:53:55,894 INFO [org.ovirt.engine.core.vdsbroker.IsVmDuringInitiatingVDSCommand] (ajp--0.0.0.0-8009-5) START, IsVmDuringInitiatingVDSCommand(vmId = 35d58779-ee24-4926-8612-e053ff48881b), log id: 76d4a2b2 2012-12-18 22:53:55,895 INFO [org.ovirt.engine.core.vdsbroker.IsVmDuringInitiatingVDSCommand] (ajp--0.0.0.0-8009-5) FINISH, IsVmDuringInitiatingVDSCommand, return: false, log id: 76d4a2b2 2012-12-18 22:53:55,935 INFO [org.ovirt.engine.core.bll.RunVmCommand] (pool-3-thread-49) [17252a1c] Lock Acquired to object EngineLock [exclusiveLocks= key: 35d58779-ee24-4926-8612-e053ff48881b value: VM , sharedLocks= ] 2012-12-18 22:53:55,952 INFO [org.ovirt.engine.core.bll.RunVmCommand] (pool-3-thread-49) [17252a1c] Running command: RunVmCommand internal: false. Entities affected : ID: 35d58779-ee24-4926-8612-e053ff48881b Type: VM 2012-12-18 22:53:55,966 INFO [org.ovirt.engine.core.vdsbroker.ResumeVDSCommand] (pool-3-thread-49) [17252a1c] START, ResumeVDSCommand(vdsId = 0a7046ea-216d-11e2-8fe8-001372eb596b, vmId=35d58779-ee24-4926-8612-e053ff48881b), log id: 607d6dc2 2012-12-18 22:53:55,976 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.ResumeBrokerVDSCommand] (pool-3-thread-49) [17252a1c] START, ResumeBrokerVDSCommand(vdsId = 0a7046ea-216d-11e2-8fe8-001372eb596b, vmId=35d58779-ee24-4926-8612-e053ff48881b), log id: 66b1872c 2012-12-18 22:53:56,292 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.ResumeBrokerVDSCommand] (pool-3-thread-49) [17252a1c] FINISH, ResumeBrokerVDSCommand, log id: 66b1872c 2012-12-18 22:53:56,298 INFO [org.ovirt.engine.core.vdsbroker.ResumeVDSCommand] (pool-3-thread-49) [17252a1c] FINISH, ResumeVDSCommand, return: PoweringUp, log id: 607d6dc2 2012-12-18 22:53:56,302 INFO [org.ovirt.engine.core.bll.RunVmCommand] (pool-3-thread-49) [17252a1c] Lock freed to object EngineLock [exclusiveLocks= key: 35d58779-ee24-4926-8612-e053ff48881b value: VM , sharedLocks= ] I'm not sure if I was missing something, or if I needed to wait for something to complete, or what. I ended up just stopping the machine and then starting it from cold. Any thoughts here? I'm happy to attempt this again and to see if it breaks the same way and to capture more data. Cheers, -- Erik Jacobs ATLElite / www.atlelite.com DGTrials / www.DGTrials.com FestiveGarage / www.FestiveGarage.com Riding Resource / www.RidingResource.com www.erikjacobs.com (C) 646-284-3482 (F) 404-585-4409 ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] Problem with VDSM becoming unresponsive
please attach vdsm log, libvirtd.log, sanlock, and messages. what's the status of the following services: vdsmd, libvirtd, sanlock - Original Message - From: Daniel Rowe daniel.fatho...@gmail.com To: users@ovirt.org Sent: Monday, December 17, 2012 2:43:34 AM Subject: [Users] Problem with VDSM becoming unresponsive Hi I have an issue with a one node ovirt setup with gluster storage. Every so often (twice a week or) the management console show the node as unresponsive. I can ssh to the node fine and it is indeed responsive. I can see the VM processes taking CPU with top. I can see the VDSM process as well. Restarting vsdmd causes the node to become in the up state again, but I have to restart all the VMs that are running. In the engine log file (attached) there is VDS::handleNetworkException Server failed to respond error. I am not sure how I can fix this so any help appreciated. Attached is the engine log. Regards Daniel ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] Problems when trying to delete a snapshot
please attach full engine and vdsm log from SPM machine. also, did the task finished ? please run tree command for /rhev/data-center/. - Original Message - From: Ricky Schneberger ri...@schneberger.se To: users@ovirt.org Sent: Friday, December 14, 2012 3:16:58 PM Subject: [Users] Problems when trying to delete a snapshot -BEGIN PGP SIGNED MESSAGE- Hash: SHA1 I was trying to delete a snapshot from one of my VM and everything started fine. The disk image is a thin provisioned 100GB disk with 8GB data. I just hade one snapshot and it was that one I started to delete. After more than two hours I look in the folder with that VMs disk images and found out that there was i new created file with a size of around 650GB and it was still growing. - -rw-rw. 1 vdsm kvm 8789950464 14 dec 12.23 8ede8e53-1323-442b-84f2-3c94114c64cf - -rw-r--r--. 1 vdsm kvm 681499951104 14 dec 14.10 8ede8e53-1323-442b-84f2-3c94114c64cf_MERGE - -rw-r--r--. 1 vdsm kvm 272 14 dec 12.24 8ede8e53-1323-442b-84f2-3c94114c64cf.meta - -rw-rw. 1 vdsm kvm 107382439936 6 jun 2012 b4a43421-728b-4204-a389-607221d945b7 - -rw-r--r--. 1 vdsm kvm 282 14 dec 12.24 b4a43421-728b-4204-a389-607221d945b7.meta Any idea what is happening? Regards - -- Ricky -BEGIN PGP SIGNATURE- Version: GnuPG v1.4.11 (GNU/Linux) Comment: Using GnuPG with undefined - http://www.enigmail.net/ iEYEARECAAYFAlDLJsoACgkQOap81biMC2Nu+ACfTnCbl9BqIvQmp0zO4pYxlNDv qw4AoIZAR2Zg2RQNKOYf4YEVOSqVQZIV =MUYs -END PGP SIGNATURE- ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] No Master Storage Domain added
Hi Matt, any chance you can attach both engine and vdsm logs ? also, what type of storage are you using ? is it block or NFS ? if its block, please provide lsblk, multipath -ll, vgs -o +pv_name, dmsetup ls. - Original Message - From: Matt . yamakasi@gmail.com To: users@ovirt.org Sent: Sunday, November 18, 2012 10:08:51 PM Subject: [Users] No Master Storage Domain added Hi All, I have some strange issue at the moment after clearing my installation and did a new egine-setup. When I want to attach a data-domain to a datacenter I always get the message that I need to activate my Master Storage Domain, but it isn't there. As far as I know this Master Storage domain was the first added Storage Domain that becomes Master automaticly, but after a lot of tries and re-tries I need some help. Of course I have checked my logs but there is nothing to find yet Any idea what I do wrong ? Cheers, ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] SELinux policy issue with oVirt/sanlock
- Original Message - From: Brian Vetter bjvet...@gmail.com To: Haim Ateya hat...@redhat.com Cc: users@ovirt.org, seli...@lists.fedoraproject.org Sent: Wednesday, October 24, 2012 6:24:31 PM Subject: Re: [Users] SELinux policy issue with oVirt/sanlock I removed lock_manager=sanlock from the settings file, restarted the daemons, and all works fine right now. I'm guessing that means there is no locking of the VMs (the default?). that's right, i'm glad it works for you, but it just a workaround since we expect this configuration to work, it would be much appreciated if you could open a bug on that issue so we can track and resolve when possible. please attach all required logs such as: vdsm.log, libvirtd.log, qemu.log (under /var/log/libvirt/qemu/), audit.log, sanlock.log and /var/log/messages. thanks, Haim In any case, the setting of the lock_manager to sanlock was not done by myself but presumably via the host/vdsm installation on my fc17 host. So if that is the desired setting, then there appears to be an issue with selinux policies, nfs storage for VMs, and sanlock that still needs to be resolved in the nightly builds. Brian On Oct 24, 2012, at 9:51 AM, Haim Ateya wrote: - Original Message - From: Brian Vetter bjvet...@gmail.com To: Haim Ateya hat...@redhat.com Cc: users@ovirt.org, seli...@lists.fedoraproject.org Sent: Wednesday, October 24, 2012 4:11:17 PM Subject: Re: [Users] SELinux policy issue with oVirt/sanlock Here you go # getsebool -a | grep sanlock sanlock_use_fusefs -- off sanlock_use_nfs -- on sanlock_use_samba -- off virt_use_sanlock -- on # grep -v -e ^# -e ^$ /etc/libvirt/qemu.conf dynamic_ownership=0 spice_tls=1 spice_tls_x509_cert_dir=/etc/pki/vdsm/libvirt-spice lock_manager=sanlock this entry looks problematic to me (use sanlock as lock manager of the vms), please comment this entry, restart libvirt and vdsm, and try again. On Oct 24, 2012, at 1:07 AM, Haim Ateya wrote: Hi Brian, please run the following commands and paste your output: getsetbool -a | grep sanlock cat /etc/libvirt/qemu.conf - Original Message - From: Brian Vetter bjvet...@gmail.com To: seli...@lists.fedoraproject.org Cc: users@ovirt.org Sent: Wednesday, October 24, 2012 6:34:07 AM Subject: [Users] SELinux policy issue with oVirt/sanlock I get the following AVC msg when trying to run a VM from the ovirt admin tool: type=AVC msg=audit(1351051834.851:720): avc: denied { read } for pid=979 comm=sanlock name=8798edc0-dbd2-466d-8be9-1997f63e196f dev=dm-4 ino=3145737 scontext=system_u:system_r:sanlock_t:s0-s0:c0.c1023 tcontext=system_u:object_r:mnt_t:s0 tclass=lnk_file The file it is attempting to read I believe (from the sanlock.log file) is the following: # ls -lZ /rhev/data-center/a8ea368c-bc08-4e10-81e7-c8439bf7bd35/8798edc0-dbd2-466d-8be9-1997f63e196f/images/b029b5a6-9eb3-4a34-ad03-1ac4386e8c7c/71252c8f-68a9-495f-b5a6-4e8e035b56ea.lease -rw-rw. vdsm kvm system_u:object_r:nfs_t:s0 /rhev/data-center/a8ea368c-bc08-4e10-81e7-c8439bf7bd35/8798edc0-dbd2-466d-8be9-1997f63e196f/images/b029b5a6-9eb3-4a34-ad03-1ac4386e8c7c/71252c8f-68a9-495f-b5a6-4e8e035b56ea.lease I'm no SELinux policy expert, so I 'm not sure what is exactly wrong. The situation is that the VM image file is stored on an NFS file server (in this case, configured using NFSv3). Both the client and the server are fc17. The error occurs when trying to start the VM. The version of oVirt I am using is a recent nightly build (ovirt-engine - 3.1.0-3.1345126685.git7649eed.fc17). I'd be making a wild guess that the sanlock process doesn't have rights to open some nfs resources but I'm way over the end of my skis. Brian ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] local storage problems
Hi Christian, any chance you can provide full vdsm log as well as the following output: getenforce ls -lZ /media/ceva2/Ovirt/Storage/ - Original Message - From: Alexandre Santos santosa...@gmail.com To: Cristian Falcas cristi.fal...@gmail.com Cc: Vered Volansky ve...@redhat.com, users@ovirt.org Sent: Tuesday, October 23, 2012 11:54:48 AM Subject: Re: [Users] local storage problems 2012/10/23 Cristian Falcas cristi.fal...@gmail.com Hi, I didn't do any sanlock configuration or touch it in any way. Is this required for local storage? Bellow you can find the info you asked for: Package versions: sanlock-lib-2.4-2.fc17.x86_64 sanlock-python-2.4-2.fc17.x86_64 sanlock-2.4-2.fc17.x86_64 libvirt-lock-sanlock-0.10.2-1.fc17.x86_64 vdsm-4.10.1-0.52.git1c10d06.fc17.x86_64 Dir permissions: ls -la /media/ceva2/Ovirt/Storage/ total 12 drwxrwxr-x. 3 vdsm kvm 4096 Oct 23 09:37 . drwxr-xr-x. 4 vdsm kvm 4096 Oct 23 09:00 .. drwxr-xr-x. 4 vdsm kvm 4096 Oct 23 09:37 a6d7a944-4807-474d-89b0-df9e74dd2cd0 Sanlock errors: 2012-10-23 10:31:49+0300 891 [2813]: s20 lockspace a6d7a944-4807-474d-89b0-df9e74dd2cd0:1:/rhev/data-center/mnt/_media_ceva2_Ovirt_Storage/a6d7a944-4807-474d-89b0-df9e74dd2cd0/dom_md/ids:0 2012-10-23 10:31:49+0300 891 [16423]: open error -13 /rhev/data-center/mnt/_media_ceva2_Ovirt_Storage/a6d7a944-4807-474d-89b0-df9e74dd2cd0/dom_md/ids 2012-10-23 10:31:49+0300 891 [16423]: s20 open_disk /rhev/data-center/mnt/_media_ceva2_Ovirt_Storage/a6d7a944-4807-474d-89b0-df9e74dd2cd0/dom_md/ids error -13 Best regards, On Tue, Oct 23, 2012 at 12:01 PM, Vered Volansky ve...@redhat.com wrote: Hi Cristian, Can you please share your SanLock and VDSM versions and the permissions of the directory you're attempting to use for the storage domain. It doesn't happen in my environment, and I'm trying to understand if it's something in your environment or an issue in a specific build. Regards, Vered - Original Message - From: Cristian Falcas cristi.fal...@gmail.com To: users@ovirt.org Sent: Tuesday, October 23, 2012 9:26:27 AM Subject: [Users] local storage problems I'm trying to play with engine and vdsm on the same machine, but I can't add a local storage. This is the last error from vdsm: 7e6c31db-abc0-4d6c-baff-d78e9c47d7a2::ERROR::2012-10-23 10:22:02,454::task::833::TaskManager.Task::(_setError) Task=`7e6c31db-abc0-4d6c-baff-d78e9c47d7a2`::Unexpected error Traceback (most recent call last): File /usr/share/vdsm/storage/task.py, line 840, in _run return fn(*args, **kargs) File /usr/share/vdsm/storage/task.py, line 307, in run return self.cmd(*self.argslist, **self.argsdict) File /usr/share/vdsm/storage/sp.py, line 269, in startSpm self._upgradePool(expectedDomVersion, __securityOverride=True) File /usr/share/vdsm/storage/securable.py, line 68, in wrapper return f(self, *args, **kwargs) File /usr/share/vdsm/storage/sp.py, line 409, in _upgradePool self._convertDomain(self.masterDomain, str(targetDomVersion)) File /usr/share/vdsm/storage/sp.py, line 1027, in _convertDomain domain.getRealDomain(), isMsd, targetFormat) File /usr/share/vdsm/storage/imageRepository/formatConverter.py, line 252, in convert converter(repoPath, hostId, imageRepo, isMsd) File /usr/share/vdsm/storage/imageRepository/formatConverter.py, line 99, in v3DomainConverter newClusterLock.acquireHostId(hostId, async=False) File /usr/share/vdsm/storage/safelease.py, line 175, in acquireHostId raise se.AcquireHostIdFailure(self._sdUUID, e) AcquireHostIdFailure: Cannot acquire host id: ('a6d7a944-4807-474d-89b0-df9e74dd2cd0', SanlockException(19, 'Sanlock lockspace add failure', 'No such device')) If I make the DC a NFS one, I can add nfs paths. Best regards, ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users Is SELinux sanlock flag set to 1 (on)? Alex ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] (no subject)
- Original Message - From: Neil nwilson...@gmail.com To: Haim Ateya hat...@redhat.com Cc: users@ovirt.org Sent: Monday, October 22, 2012 4:33:11 PM Subject: Re: [Users] (no subject) Hi Haim, On Mon, Oct 22, 2012 at 3:46 PM, Haim Ateya hat...@redhat.com wrote: please attach installation logs from node under /tmp/{vdsm-bootstrap*). I've checked all of the other logs and they are all the same, so I've only included the latest one... [root@node03 tmp]# cat vds_installer.100915.log Mon, 22 Oct 2012 12:12:04 DEBUG Start VDS Installation Mon, 22 Oct 2012 12:12:04 DEBUGget_id_line: read line CentOS release 6.3 (Final). Mon, 22 Oct 2012 12:12:04 DEBUGlsb_release: input line CentOS release 6.3 (Final). Mon, 22 Oct 2012 12:12:04 DEBUGlsb_release: return: CentOS. Mon, 22 Oct 2012 12:12:04 DEBUGBSTRAP component='INSTALLER' status='OK' message='Test platform succeeded'/ Mon, 22 Oct 2012 12:12:04 DEBUGBSTRAP component='INSTALLER LIB' status='OK' message='Install library already exists'/ Mon, 22 Oct 2012 12:12:04 DEBUGtrying to fetch vds_bootstrap.py script cmd = '/usr/bin/curl -s -k -w %{http_code} -o /tmp/vds_bootstrap_06a0a4ee-8935-4743-8c25-acc13971047b.py http://backup.blabla.com/vds_bootstrap.py' Mon, 22 Oct 2012 12:12:04 DEBUGBSTRAP component='INSTALLER' status='FAIL' message='vds_bootstrap.py download failed. Pathname could not be resolved (verify computer/domain name).'/ [root@node03 tmp]# also run the following command from your engine machine: psql -U postgres engine -c 'select * from vds_static;' Output is attached as output.txt, shows all three hosts along with the correct ip's, node03 has no power management as of yet. please make sure 'node02.blabla.com' is accessible from host side (first try ping), later try with http request (simply issue wget http://node02.blabla.com:80/Components/vds/vds_bootstrap.py) Both tests worked fine... node02 is one of my hosts, it used to be my old engine but was changed, backup.blabla.com is my new working engine. so, the problem is that in your engine, VdcBootstrapUrl points to your old engine address (node02.blabla.com:80) instead of your new one (backup.blabla.com), this confuses the host as it tries to download the script from the old engine, and fails since they are not there. don't really know how you got to this state, but in order to resolve it, please apply the following query: psql -U postgres engine -c update vdc_options set option_value = 'backup.blabla.com:80/Components/vds/' where option_name = 'VdcBootStrapUrl' [root@node03 ~]# wget http://backup.blabla.com:80/Components/vds/vds_bootstrap.py Connecting to backup.blabla.com|10.0.2.31|:80... connected. HTTP request sent, awaiting response... 200 OK Saving to: âvds_bootstrap.pyâ 2012-10-22 16:08:52 (72.6 MB/s) - âvds_bootstrap.pyâ . Please shout if you need any further info. Thank you! Neil Wilson. ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] (no subject)
please attach installation logs from node under /tmp/{vdsm-bootstrap*). also run the following command from your engine machine: psql -U postgres engine -c 'select * from vds_static;' please make sure 'node02.blabla.com' is accessible from host side (first try ping), later try with http request (simply issue wget http://node02.blabla.com:80/Components/vds/vds_bootstrap.py). - Original Message - From: Neil nwilson...@gmail.com To: users@ovirt.org Sent: Friday, October 19, 2012 2:03:35 PM Subject: [Users] (no subject) Hi guys, Shew! Been a busy oVirt week for me :) I'm trying to add a 3rd host into my datacentre but each time the install is failing. In the front end I'm seeing could not resolve host verify computer/domain name. Below is the log file, with the domain name changed to blabla for privacy reasons 2012-10-19 13:32:11,275 INFO [org.ovirt.engine.core.utils.hostinstall.MinaInstallWrapper] (pool-3-thread-50) return true 2012-10-19 13:32:11,275 INFO [org.ovirt.engine.core.bll.VdsInstaller] (pool-3-thread-50) [14c49c46] Installation of 10.0.2.23. Executing installation stage. (Stage: Running first installation script on Host) 2012-10-19 13:32:11,275 INFO [org.ovirt.engine.core.bll.VdsInstaller] (pool-3-thread-50) [14c49c46] Installation of 10.0.2.23. Sending SSH Command chmod +x /tmp/vds_installer_52c69f58-90f3-4878-9350-55fa23d5ae53.py; /tmp/vds_installer_52c69f58-90f3-4878-9350-55fa23d5ae53.py -c 'ssl=true;management_port=54321' -O 'blabla ' -t 2012-10-19T11:32:10 -p 80 -b http://node02.blabla.com:80/Components/vds/ http://node02.blabla.com:80/Components/vds/ 10.0.2.23 52c69f58-90f3-4878-9350-55fa23d5ae53 False. (Stage: Running first installation script on Host) 2012-10-19 13:32:11,276 INFO [org.ovirt.engine.core.utils.hostinstall.MinaInstallWrapper] (pool-3-thread-50) Invoking chmod +x /tmp/vds_installer_52c69f58-90f3-4878-9350-55fa23d5ae53.py; /tmp/vds_installer_52c69f58-90f3-4878-9350-55fa23d5ae53.py -c 'ssl=true;management_port=54321' -O 'blabla' -t 2012-10-19T11:32:10 -p 80 -b http://node02.blabla.com:80/Components/vds/ http://node02.blabla.com:80/Components/vds/ 10.0.2.23 52c69f58-90f3-4878-9350-55fa23d5ae53 False on 10.0.2.23 2012-10-19 13:32:11,328 INFO [org.ovirt.engine.core.bll.VdsInstaller] (pool-3-thread-50) [14c49c46] Installation of 10.0.2.23. Received message: BSTRAP component='INSTALLER' status='OK' message='Test platform succeeded'/ BSTRAP component='INSTALLER LIB' status='OK' message='Install library already exists'/ BSTRAP component='INSTALLER' status='FAIL' message='vds_bootstrap.py download failed. Pathname could not be resolved (verify computer/domain name).'/ . FYI. (Stage: Running first installation script on Host) 2012-10-19 13:32:11,360 INFO [org.ovirt.engine.core.utils.hostinstall.MinaInstallWrapper] (pool-3-thread-50) RunSSHCommand returns true 2012-10-19 13:32:11,361 INFO [org.ovirt.engine.core.bll.VdsInstaller] (pool-3-thread-50) [14c49c46] RunScript ended:true 2012-10-19 13:32:11,361 ERROR [org.ovirt.engine.core.bll.VdsInstaller] (pool-3-thread-50) [14c49c46] Installation of 10.0.2.23. No meaningful response received from Host. (Stage: Running first installation script on Host) 2012-10-19 13:32:11,362 INFO [org.ovirt.engine.core.bll.InstallVdsCommand] (pool-3-thread-50) [14c49c46] After Installation pool-3-thread-50 2012-10-19 13:32:11,362 INFO [org.ovirt.engine.core.vdsbroker.SetVdsStatusVDSCommand] (pool-3-thread-50) [14c49c46] START, SetVdsStatusVDSCommand(vdsId = 67d9a20c-f811-11e1-9004-bb531c164d07, status=InstallFailed, nonOperationalReason=NONE), log id: 50fafc10 2012-10-19 13:32:11,378 INFO [org.ovirt.engine.core.vdsbroker.SetVdsStatusVDSCommand] (pool-3-thread-50) [14c49c46] FINISH, SetVdsStatusVDSCommand, log id: 50fafc10 My engine is engine01.blabla.com My current workings hosts are node01.blabla.com and node02.blabla.com and I'm trying to add node03.blabla.com I can ping node03 as well as the full FQDN node03.blabla.com from the engine as well as the other hosts, there iptables is completely disabled on node03 as well. Node03 has full internet access, it's DNS is working and I can ping all the hosts and engine01.blabla.com etc so I'm fairly certain it's no a networking/DNS issue. A bit of background, I've recently upgraded from an early ovirt 3.1 release, changed the hostname, as well as recreate the SSL certificate on my engine(using Juan's steps). What I do find odd is in the engine log I'm seeing the mention of node02(which is 10.0.2.22), even though I'm specifically trying to install node03(10.0.2.23). I have tried removing the host and re-adding it without any success. On node03 I'm seeing the following error in my /var/log/messages... Oct 19 13:31:32 node03 vdsm root ERROR client ('10.0.2.31', 57654)#012Traceback (most recent call last):#012 File /usr/lib64/python2.6/SocketServer.py, line 560, in
Re: [Users] ITA-2967 URGENT: ovirt Node turns status to non operational STORAGE_DOMAIN_UNREACHABLE
- Original Message - From: Itamar Heim ih...@redhat.com To: Sven Knohsalla s.knohsa...@netbiscuits.com Cc: Haim Ateya hat...@redhat.com, users@ovirt.org, Omer Frenkel ofren...@redhat.com Sent: Sunday, October 21, 2012 11:05:56 AM Subject: Re: AW: [Users] ITA-2967 URGENT: ovirt Node turns status to non operational STORAGE_DOMAIN_UNREACHABLE On 10/19/2012 06:43 PM, Sven Knohsalla wrote: Hi Haim, I wanted to wait to send this mail, until the problem occurs again. Disabled live-migration for the cluster first, to make sure the second node wouldn't have the same problem, when migration is started. It seems the problem isn't caused by migration, as I did run in the same error again today. Log snippet Webgui: 2012-Oct-19,04:28:13 Host deovn-a01 cannot access one of the Storage Domains attached to it, or the Data Center object. Setting Host state to Non-Operational. -- all VMs are running properly, although the engine tells something different. Even the VM status in engine gui is wrong, as it's showing vmname reboot in progress, but there is no reboot initiated (ssh/rdp connections, file operations are working fine) Engine log says for this period: cat /var/log/ovirt-engine/engine.log | grep 04:2* 2012-10-19 04:23:13,773 WARN [org.ovirt.engine.core.vdsbroker.irsbroker.IrsBrokerCommand] (QuartzScheduler_Worker-94) domain ccaa4e7a-fa89-46a6-a6e0-07dfe78d1bd5 in problem. vds: deovn-a01 2012-10-19 04:28:13,775 INFO [org.ovirt.engine.core.vdsbroker.irsbroker.IrsBrokerCommand] (QuartzScheduler_Worker-1) starting ProcessDomainRecovery for domain ccaa4e7a-fa89-46a6-a6e0-07dfe78d1bd5 2012-10-19 04:28:13,799 WARN [org.ovirt.engine.core.vdsbroker.irsbroker.IrsBrokerCommand] (QuartzScheduler_Worker-1) vds deovn-a01 reported domain ccaa4e7a-fa89-46a6-a6e0-07dfe78d1bd5:DE-VM-SYSTEM as in problem, moving the vds to status NonOperational 2012-10-19 04:28:13,882 INFO [org.ovirt.engine.core.bll.SetNonOperationalVdsCommand] (QuartzScheduler_Worker-1) Running command: SetNonOperationalVdsCommand internal: true. Entities affected : ID: 66b546c2-ae62-11e1-b734-5254005cbe44 Type: VDS 2012-10-19 04:28:13,884 INFO [org.ovirt.engine.core.vdsbroker.SetVdsStatusVDSCommand] (QuartzScheduler_Worker-1) START, SetVdsStatusVDSCommand(vdsId = 66b546c2-ae62-11e1-b734-5254005cbe44, status=NonOperational, nonOperationalReason=STORAGE_DOMAIN_UNREACHABLE), log id: daad8bd 2012-10-19 04:28:13,888 INFO [org.ovirt.engine.core.vdsbroker.SetVdsStatusVDSCommand] (QuartzScheduler_Worker-1) FINISH, SetVdsStatusVDSCommand, log id: daad8bd 2012-10-19 04:28:19,690 WARN [org.ovirt.engine.core.vdsbroker.irsbroker.IrsBrokerCommand] (QuartzScheduler_Worker-38) domain ccaa4e7a-fa89-46a6-a6e0-07dfe78d1bd5 in problem. vds: deovn-a01 I think the first output is important: 2012-10-19 04:23:13,773 WARN [org.ovirt.engine.core.vdsbroker.irsbroker.IrsBrokerCommand] (QuartzScheduler_Worker-94) domain ccaa4e7a-fa89-46a6-a6e0-07dfe78d1bd5 in problem. vds: deovn-a01 -- which problem? There's no debug info during that time period to consider where tha problem could come from :/ look to the lines above: 2012-10-19 04:28:13,799 WARN [org.ovirt.engine.core.vdsbroker.irsbroker.IrsBrokerCommand] (QuartzScheduler_Worker-1) vds deovn-a01 reported domain ccaa4e7a-fa89-46a6-a6e0-07dfe78d1bd5:DE-VM-SYSTEM as in problem, moving the vds to status NonOperational 2012-10-19 04:28:13,882 INFO [org.ovirt.engine.core.bll.SetNonOperationalVdsCommand] (QuartzScheduler_Worker-1) Running command: SetNonOperationalVdsCommand internal: true. Entities affected : ID: 66b546c2-ae62-11e1-b734-5254005cbe44 Type: VDS the problem was with the storage domain. On affected node side I did grep /var/log/vdsm for ERROR: Thread-254302::ERROR::2012-10-12 16:01:11,359::vm::950::vm.Vm::(getStats) vmId=`537eea7c-d12c-461f-adfb-6a1f2ebff4fb`::Error fetching vm stats And 20 more of the same type with same vmId, I'm sure this is an aftereffect s the engine can't tell the status of the VMs. Can you give me an advice where I can find more information to solve this issue? Or perhaps a scenario I can try? what's the status of the VMs right now ? can you please provide the output of the following commands: virsh -r list vdsClient -s 0 list table please attach full engine, vdsm and libvirt logs (and if possible, qemu log file under /var/log/libvirt/qemu/). I have another curiosity I wanted to ask for in a new mail, but perhaps this has something to do with my issue: The elected SPM is not part of this cluster, just has 2 storage paths (multipath) to the SAN. The problematic cluster has 4 storage paths(bigger hypervisors), and all storage paths are connected successfully . I would like to see repoStats reports within the node logs (vdsm.log). Does the SPM detects
Re: [Users] Images not appearing in ISO domain
- Original Message - From: Neil nwilson...@gmail.com To: users@ovirt.org Sent: Tuesday, October 16, 2012 12:03:21 PM Subject: [Users] Images not appearing in ISO domain Hi guys, I've uploaded images to my iso domain and I can see the .iso files under the full path /iso-domain/a89acef7-5766-4a61-9176-f7870e4f5989/images/----/ with the vdsm:vdsm as the owner but from inside my ovirt-engine the images don't show up. Under Storages I can see my iso domain is listed as active(green) and it shows my free space correctly, but if I go to images' nothing shows up, under Data Center Default shows as active as well. I've tried putting it into maintenance and then activating which succeeds and yet my images still don't show. I uploaded the images as follows cd /root/ISOs/ engine-iso-uploader -i ISO upload * Hi Neil, please print the output of the following commands: on NFS server: - ls -la /iso-domain/a89acef7-5766-4a61-9176-f7870e4f5989/images/----/ on host (vds): - vdsClient -s 0 getIsoList `vdsClient -s 0 getConnectedStoragePoolsList` ...which completed successfully with no errors. Any ideas? Will gladly send through whichever logs are required. Thanks! Regards. Neil Wilson. ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] ITA-2967 URGENT: ovirt Node turns status to non operational STORAGE_DOMAIN_UNREACHABLE
Hi Sven, can you attach full logs from the second host (problematic one)? i guess its deovn-a01. 2012-10-15 11:13:38,197 WARN [org.ovirt.engine.core.vdsbroker.irsbroker.IrsBrokerCommand] (QuartzScheduler_Worker-33) domain ccaa4e7a-fa89-46a6-a6e0-07dfe78d1bd5 in problem. vds: deovn-a01 - Original Message - From: Omer Frenkel ofren...@redhat.com To: Itamar Heim ih...@redhat.com, Sven Knohsalla s.knohsa...@netbiscuits.com Cc: users@ovirt.org Sent: Tuesday, October 16, 2012 2:02:50 PM Subject: Re: [Users] ITA-2967 URGENT: ovirt Node turns status to non operational STORAGE_DOMAIN_UNREACHABLE - Original Message - From: Itamar Heim ih...@redhat.com To: Sven Knohsalla s.knohsa...@netbiscuits.com Cc: users@ovirt.org Sent: Monday, October 15, 2012 8:36:07 PM Subject: Re: [Users] ITA-2967 URGENT: ovirt Node turns status to non operational STORAGE_DOMAIN_UNREACHABLE On 10/15/2012 03:56 PM, Sven Knohsalla wrote: Hi, sometimes one hypervisors status turns to „Non-operational“ with error “STORAGE_DOMAIN_UNREACHABLE” and the live-migration (activated for all VMs) is starting. I don’t currently know why the ovirt-node turns to this status, because the connected iSCSI SAN is available all the time(checked via iscsi session and lsblk), I’m also able to r/w on the SAN during that time. We can simply activate this ovirt-node and it turns up again. The migration process is running from scratch and hitting the some error àReboot of ovirt-node necessary! When a hypervisor turns to “non-operational” status, the live migration is starting and tries to migrate ~25 VMs (~ 100 GB RAM to migrate). During that process the network workload goes 100%, some VMs will be migrated, then the destination host also turns to “non-operational” status with error “STORAGE_DOMAIN_UNREACHABLE”. Many VMs are still running on their origin host, some are paused, some are showing “migration from” status. After a reboot of the origin host, the VMs turns of course into unknown state. So the whole cluster is down :/ For this problem I have some questions: -Does ovirt engine just use the ovirt-mgmt network for migration/HA? yes. -If so, is there any possibility to *add*/switch a network for migration/HA? you can bond, not yet add another one. -Is the kind of way we are using the live-migration not recommended? -Which engine module checks the availability of the storage domain for the ovirt-nodes? the engine. -Is there any timeout/cache option we can set/increase to avoid this problem? well, not clear what the problem is. also, vdsm is supposed to throttle live migration to 3 vm's in parallel iirc. also, you can at cluster level configure to not live migrate VMs on non-operational status. -Is there any known problem with the versions we are using? (Migration to ovirt-engine 3.1 is not possible atm) oh, the cluster level migration policy on non operational may be a 3.1 feature, not sure. AFAIR, it's in 3.0 -Is it possible to modify the migration queue to just migrate a max. of 4 VMs at the same time for example? yes, there is a vdsm config for that. i am pretty sure 3 is the default though? _ovirt-engine: _ FC 16: 3.3.6-3.fc16.x86_64 Engine: 3.0.0_0001-1.6.fc16 KVM based VM: 2 vCPU, 4 GB RAM 1 NIC for ssh/https access 1 NIC for ovirtmgmt network access engine source: dreyou repo _ovirt-node:_ Node: 2.3.0 2 bonded NICs - Frontend Network 4 Multipath NICs - SAN connection Attached some relevant logfiles. Thanks in advance, I really appreciate your help! Best, Sven Knohsalla |System Administration Office +49 631 68036 433 | Fax +49 631 68036 111 |e-mails.knohsa...@netbiscuits.com |mailto:s.knohsa...@netbiscuits.com| Skype: Netbiscuits.admin Netbiscuits GmbH | Europaallee 10 | 67657 | GERMANY ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] Can't start a VM - sanlock permission denied
Hi Alex, can you please provide some logs (engine + vdsm) ? also, can you verify that sanlock service is running on node ? - Original Message - From: Alexandre Santos santosa...@gmail.com To: users@ovirt.org Sent: Thursday, October 11, 2012 12:06:19 PM Subject: Re: [Users] Can't start a VM - sanlock permission denied It's me again, now less sleepy :-) What I would like to know is if this sanlock permission issue is within the oVirt engine, a problem on the NFS Server or on the oVirt node (it's a 2.5.3 from nightly). On the later, I can't access the console to issue those commands - I don't even know if it's running SELinux at all :-) Alex 2012/10/11 Alexandre Santos santosa...@gmail.com Him I managed to install the new oVirt Node on a 8 GB USB Pen. I created the NFS Share anf when I try to Run the VM I get the: M ubuntu is down. Exit message: internal error Failed to open socket to sanlock daemon: Permission denied. I did the setsebool mentioned on the wiki on the engine but still get the error. Is it necessary to configure anywhere else? Alex ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] Error importing export storage
On 08/28/2012 04:34 PM, зоррыч wrote: /usr/bin/sudo -n /bin/mount -t nfs -o soft,nosharecache,timeo=600,retrans=6 10.1.20.2:/home/nfs4 /rhev/data-center/mnt/10.1.20.2:_home_nfs4 please attach both engine and vdsm logs (full, compressed). also, please execute the following commands from host (vds): 1) /usr/bin/sudo -n /bin/mount -t nfs -o soft,nosharecache,timeo=600,retrans=6 10.1.20.2:/home/nfs4 /rhev/data-center/mnt/10.1.20.2:_home_nfs4 2) vdsClient -s 0 getStorageDomainsList 3) mount * if you are working in a non-secure mode, try vdsClient 0 (without the -s). ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] Error importing export storage
On 08/28/2012 04:34 PM, зоррыч wrote: Hi Trying to import export stotage created earlier. But get this error: hi, please provide full vdsm and engine logs (compressed). also, run the following commands on host (vds): - /usr/bin/sudo -n /bin/mount -t nfs -o soft,nosharecache,timeo=600,retrans=6 10.1.20.2:/home/nfs4 /rhev/data-center/mnt/10.1.20.2:_home_nfs4 - mount - vdsClient -s 0 getStorageDomainsList (if you are working in a non-secure mode, don't use the -s). - vdsClient -s 0 getStorageDomainInfo b23c7ab6-b1d4-4888-8d4a-adc78e61db38 - check permission on storage server (NFS) - ls -l /home/nfs4 (should be vdsm:kvm). another thing - it appears that domain is already attached to storage pool (as I understand from domain metadata), is it possible that you haven't detach this domain from its previous domain ? There is no storage domain under the specified path. Please check path. Vdsm.logs: Thread-99790::DEBUG::2012-08-28 09:17:26,010::task::568::TaskManager.Task::(_updateState) Task=`ac99ba99-f55d-4562-822e-0286ab30566e`::moving from state init - state preparing Thread-99790::INFO::2012-08-28 09:17:26,010::logUtils::37::dispatcher::(wrapper) Run and protect: repoStats(options=None) Thread-99790::INFO::2012-08-28 09:17:26,010::logUtils::39::dispatcher::(wrapper) Run and protect: repoStats, Return response: {'b0a0e76b-f983-405b-a0af-d0314a1c381a': {'delay': '0.00292301177979', 'lastCheck': 1346159839.788852, 'code': 0, 'valid': True}} Thread-99790::DEBUG::2012-08-28 09:17:26,011::task::1151::TaskManager.Task::(prepare) Task=`ac99ba99-f55d-4562-822e-0286ab30566e`::finished: {'b0a0e76b-f983-405b-a0af-d0314a1c381a': {'delay': '0.00292301177979', 'lastCheck': 1346159839.788852, 'code': 0, 'valid': True}} Thread-99790::DEBUG::2012-08-28 09:17:26,011::task::568::TaskManager.Task::(_updateState) Task=`ac99ba99-f55d-4562-822e-0286ab30566e`::moving from state preparing - state finished Thread-99790::DEBUG::2012-08-28 09:17:26,011::resourceManager::809::ResourceManager.Owner::(releaseAll) Owner.releaseAll requests {} resources {} Thread-99790::DEBUG::2012-08-28 09:17:26,011::resourceManager::844::ResourceManager.Owner::(cancelAll) Owner.cancelAll requests {} Thread-99790::DEBUG::2012-08-28 09:17:26,011::task::957::TaskManager.Task::(_decref) Task=`ac99ba99-f55d-4562-822e-0286ab30566e`::ref 0 aborting False Thread-99792::DEBUG::2012-08-28 09:17:26,473::BindingXMLRPC::160::vds::(wrapper) [10.1.20.2] Thread-99792::DEBUG::2012-08-28 09:17:26,474::task::568::TaskManager.Task::(_updateState) Task=`e55145ac-1052-454b-92ec-a9eb981c1b04`::moving from state init - state preparing Thread-99792::INFO::2012-08-28 09:17:26,474::logUtils::37::dispatcher::(wrapper) Run and protect: validateStorageServerConnection(domType=1, spUUID='----', conList=[{'connection': '10.1.20.2:/home/nfs4', 'iqn': '', 'portal': '', 'user': '', 'password': '**', 'id': '----', 'port': ''}], options=None) Thread-99792::INFO::2012-08-28 09:17:26,474::logUtils::39::dispatcher::(wrapper) Run and protect: validateStorageServerConnection, Return response: {'statuslist': [{'status': 0, 'id': '----'}]} Thread-99792::DEBUG::2012-08-28 09:17:26,474::task::1151::TaskManager.Task::(prepare) Task=`e55145ac-1052-454b-92ec-a9eb981c1b04`::finished: {'statuslist': [{'status': 0, 'id': '----'}]} Thread-99792::DEBUG::2012-08-28 09:17:26,474::task::568::TaskManager.Task::(_updateState) Task=`e55145ac-1052-454b-92ec-a9eb981c1b04`::moving from state preparing - state finished Thread-99792::DEBUG::2012-08-28 09:17:26,475::resourceManager::809::ResourceManager.Owner::(releaseAll) Owner.releaseAll requests {} resources {} Thread-99792::DEBUG::2012-08-28 09:17:26,475::resourceManager::844::ResourceManager.Owner::(cancelAll) Owner.cancelAll requests {} Thread-99792::DEBUG::2012-08-28 09:17:26,475::task::957::TaskManager.Task::(_decref) Task=`e55145ac-1052-454b-92ec-a9eb981c1b04`::ref 0 aborting False Thread-99793::DEBUG::2012-08-28 09:17:26,494::BindingXMLRPC::160::vds::(wrapper) [10.1.20.2] Thread-99793::DEBUG::2012-08-28 09:17:26,495::task::568::TaskManager.Task::(_updateState) Task=`700181ad-b9d4-411b-bfbc-25a28aa288e2`::moving from state init - state preparing Thread-99793::INFO::2012-08-28 09:17:26,503::logUtils::37::dispatcher::(wrapper) Run and protect: connectStorageServer(domType=1, spUUID='----', conList=[{'connection': '10.1.20.2:/home/nfs4', 'iqn': '', 'portal': '', 'user': '', 'password': '**', 'id': '----', 'port': ''}], options=None) Thread-99793::DEBUG::2012-08-28 09:17:26,505::__init__::1164::Storage.Misc.excCmd::(_log) '/usr/bin/sudo -n /bin/mount -t nfs -o soft,nosharecache,timeo=600,retrans=6 10.1.20.2:/home/nfs4
Re: [Users] [rhev 3] add new domain fails: Could not retrieve LUNs
On 08/23/2012 05:54 PM, Scotto Alberto wrote: hi, can you attach full vdsm log during the execution of getDeviceList command? Hi all, I'm trying to configure a FCP storage domain on RHEV 3. I try to add a new domain from the console, but it can't find any LUNs: Could not retrieve LUNs, please check your storage Here is the output from /var/log/rhevm/rhevm.log: 2007-06-29 21:50:07,811 WARN [org.ovirt.engine.core.bll.GetConfigurationValueQuery] (http-0.0.0.0-8443-1) calling GetConfigurationValueQuery with null version, using default general for version 2007-06-29 21:50:07,911 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.GetDeviceListVDSCommand] (http-0.0.0.0-8443-1) START, GetDeviceListVDSCommand(vdsId = 7e077f4c-25d8-11dc-bbcb-001cc4c2469a, storageType=FCP), log id: 60bdafe6 2007-06-29 21:50:08,726 ERROR [org.ovirt.engine.core.vdsbroker.vdsbroker.BrokerCommandBase] (http-0.0.0.0-8443-1) Failed in GetDeviceListVDS method 2007-06-29 21:50:08,727 ERROR [org.ovirt.engine.core.vdsbroker.vdsbroker.BrokerCommandBase] (http-0.0.0.0-8443-1) Error code BlockDeviceActionError and error message VDSGenericException: VDSErrorException: Failed to GetDeviceListVDS, error = Error block device action: () 2007-06-29 21:50:08,727 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.BrokerCommandBase] (http-0.0.0.0-8443-1) Command org.ovirt.engine.core.vdsbroker.vdsbroker.GetDeviceListVDSCommand return value Class Name: org.ovirt.engine.core.vdsbroker.vdsbroker.LUNListReturnForXmlRpc lunList Null mStatus Class Name: org.ovirt.engine.core.vdsbroker.vdsbroker.StatusForXmlRpc mCode 600 mMessage Error block device action: () 2007-06-29 21:50:08,727 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.BrokerCommandBase] (http-0.0.0.0-8443-1) Vds: pittor06vhxd020 2007-06-29 21:50:08,727 ERROR [org.ovirt.engine.core.vdsbroker.VDSCommandBase] (http-0.0.0.0-8443-1) Command GetDeviceListVDS execution failed. Exception: VDSErrorException: VDSGenericException: VDSErrorException: Failed to GetDeviceListVDS, error = Error block device action: () 2007-06-29 21:50:08,727 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.GetDeviceListVDSCommand] (http-0.0.0.0-8443-1) FINISH, GetDeviceListVDSCommand, log id: 60bdafe6 2007-06-29 21:50:08,727 ERROR [org.ovirt.engine.core.bll.storage.GetDeviceListQuery] (http-0.0.0.0-8443-1) Query GetDeviceListQuery failed. Exception message is VdcBLLException: org.ovirt.engine.core.vdsbroker.vdsbroker.VDSErrorException: VDSGenericException: VDSErrorException: Failed to GetDeviceListVDS, error = Error block device action: () -- First question: do LUNs have to be visible from RHEV-H or RHEV-M? Currently they are visible only from the hypervisor. [root@pittor06vhxd020 log]# multipath -ll 3600601601cde1d0066b2fb054dece111 dm-2 DGC,RAID 5 size=650G features='1 queue_if_no_path' hwhandler='1 emc' wp=rw |-+- policy='round-robin 0' prio=1 status=active | |- 2:0:0:0 sda 8:0 active ready running | |- 2:0:1:0 sdd 8:48 active ready running | |- 3:0:0:0 sde 8:64 active ready running | `- 3:0:1:0 sdf 8:80 active ready running `-+- policy='round-robin 0' prio=0 status=enabled |- 2:0:2:0 sdb 8:16 active ready running |- 2:0:3:0 sdc 8:32 active ready running |- 3:0:2:0 sdg 8:96 active ready running `- 3:0:3:0 sdh 8:112 active ready running 3600508b100103533392020202025 dm-0 HP,LOGICAL VOLUME size=205G features='1 queue_if_no_path' hwhandler='0' wp=rw `-+- policy='round-robin 0' prio=1 status=active `- 0:0:1:0 cciss!c0d1 104:16 active ready running -- Our SAN device is Clariion AX150. Is it compatible with ovirt? vdsClient -s 0 getDeviceListgives me: Error block device action: () Could it be due to SPM turned off? (I have only one host) [root@pittor06vhxd020 log]# ps axu | grep -i spm root 16068 0.0 0.0 7888 868 pts/1 R+ 00:04 0:00 grep -i spm How can I turn it on? I know the command but I don’t know what paramaters append spmStart spUUID prevID prevLVER recoveryMode scsiFencing maxHostID version Thank you very much for any hints. AS Alberto Scotto Blue Via Cardinal Massaia, 83 10147 - Torino - ITALY phone: +39 011 29100 al.sco...@reply.it www.reply.it -- The information transmitted is intended for the person or entity to which it is addressed and may contain confidential and/or privileged material. Any review, retransmission, dissemination or other use of, or taking of any action in reliance upon, this information by persons or entities other than the intended recipient is prohibited. If you received this in error, please contact the sender and delete the material from any computer. ___ Users mailing list Users@ovirt.org
Re: [Users] [rhev 3] add new domain fails: Could not retrieve LUNs
) Problem getting hbtl from device `cciss!c0d1` Traceback (most recent call last): File /usr/share/vdsm/storage/multipath.py, line 259, in pathListIter File /usr/share/vdsm/storage/multipath.py, line 182, in getHBTL OSError: [Errno 2] No such file or directory: '/sys/block/cciss!c0d1/device/scsi_disk/' Thread-47346::ERROR::2007-06-30 00:37:11,029::task::868::TaskManager.Task::(_setError) Unexpected error Traceback (most recent call last): File /usr/share/vdsm/storage/task.py, line 876, in _run File /usr/share/vdsm/storage/hsm.py, line 696, in public_getDeviceList File /usr/share/vdsm/storage/hsm.py, line 759, in _getDeviceList KeyError: 'hbtl' Thread-47346::DEBUG::2007-06-30 00:37:11,030::task::495::TaskManager.Task::(_debug) Task 0be1d461-f8fa-4c20-861d-27fde8124408: Task._run: 0be1d461-f8fa-4c20-861d-27fde8124408 () {} failed - stopping task Thread-47346::DEBUG::2007-06-30 00:37:11,030::task::495::TaskManager.Task::(_debug) Task 0be1d461-f8fa-4c20-861d-27fde8124408: stopping in state preparing (force False) Thread-47346::DEBUG::2007-06-30 00:37:11,030::task::495::TaskManager.Task::(_debug) Task 0be1d461-f8fa-4c20-861d-27fde8124408: ref 1 aborting True Thread-47346::INFO::2007-06-30 00:37:11,031::task::1171::TaskManager.Task::(prepare) aborting: Task is aborted: 'hbtl' - code 100 Thread-47346::DEBUG::2007-06-30 00:37:11,031::task::495::TaskManager.Task::(_debug) Task 0be1d461-f8fa-4c20-861d-27fde8124408: Prepare: aborted: 'hbtl' Thread-47346::DEBUG::2007-06-30 00:37:11,031::task::495::TaskManager.Task::(_debug) Task 0be1d461-f8fa-4c20-861d-27fde8124408: ref 0 aborting True Thread-47346::DEBUG::2007-06-30 00:37:11,032::task::495::TaskManager.Task::(_debug) Task 0be1d461-f8fa-4c20-861d-27fde8124408: Task._doAbort: force False Thread-47346::DEBUG::2007-06-30 00:37:11,032::resourceManager::821::ResourceManager.Owner::(cancelAll) Owner.cancelAll requests {} Thread-47346::DEBUG::2007-06-30 00:37:11,032::task::495::TaskManager.Task::(_debug) Task 0be1d461-f8fa-4c20-861d-27fde8124408: moving from state preparing - state aborting Thread-47346::DEBUG::2007-06-30 00:37:11,033::task::495::TaskManager.Task::(_debug) Task 0be1d461-f8fa-4c20-861d-27fde8124408: _aborting: recover policy none Thread-47346::DEBUG::2007-06-30 00:37:11,033::task::495::TaskManager.Task::(_debug) Task 0be1d461-f8fa-4c20-861d-27fde8124408: moving from state aborting - state failed Thread-47346::DEBUG::2007-06-30 00:37:11,033::resourceManager::786::ResourceManager.Owner::(releaseAll) Owner.releaseAll requests {} resources {} Thread-47346::DEBUG::2007-06-30 00:37:11,034::resourceManager::821::ResourceManager.Owner::(cancelAll) Owner.cancelAll requests {} Thread-47346::ERROR::2007-06-30 00:37:11,034::dispatcher::106::Storage.Dispatcher.Protect::(run) 'hbtl' Thread-47346::ERROR::2007-06-30 00:37:11,034::dispatcher::107::Storage.Dispatcher.Protect::(run) Traceback (most recent call last): File /usr/share/vdsm/storage/dispatcher.py, line 96, in run File /usr/share/vdsm/storage/task.py, line 1178, in prepare KeyError: 'hbtl' Alberto Scotto Blue Reply Via Cardinal Massaia, 83 10147 - Torino - ITALY phone: +39 011 29100 al.sco...@reply.it www.reply.it -Original Message- From: Haim [mailto:hat...@redhat.com] Sent: giovedì 23 agosto 2012 17:00 To: Scotto Alberto Cc: users@ovirt.org Subject: Re: [Users] [rhev 3] add new domain fails: Could not retrieve LUNs On 08/23/2012 05:54 PM, Scotto Alberto wrote: hi, can you attach full vdsm log during the execution of getDeviceList command? Hi all, I'm trying to configure a FCP storage domain on RHEV 3. I try to add a new domain from the console, but it can't find any LUNs: Could not retrieve LUNs, please check your storage Here is the output from /var/log/rhevm/rhevm.log: 2007-06-29 21:50:07,811 WARN [org.ovirt.engine.core.bll.GetConfigurationValueQuery] (http-0.0.0.0-8443-1) calling GetConfigurationValueQuery with null version, using default general for version 2007-06-29 21:50:07,911 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.GetDeviceListVDSCommand] (http-0.0.0.0-8443-1) START, GetDeviceListVDSCommand(vdsId = 7e077f4c-25d8-11dc-bbcb-001cc4c2469a, storageType=FCP), log id: 60bdafe6 2007-06-29 21:50:08,726 ERROR [org.ovirt.engine.core.vdsbroker.vdsbroker.BrokerCommandBase] (http-0.0.0.0-8443-1) Failed in GetDeviceListVDS method 2007-06-29 21:50:08,727 ERROR [org.ovirt.engine.core.vdsbroker.vdsbroker.BrokerCommandBase] (http-0.0.0.0-8443-1) Error code BlockDeviceActionError and error message VDSGenericException: VDSErrorException: Failed to GetDeviceListVDS, error = Error block device action: () 2007-06-29 21:50:08,727 INFO [org.ovirt.engine.core.vdsbroker.vdsbroker.BrokerCommandBase] (http-0.0.0.0-8443-1) Command org.ovirt.engine.core.vdsbroker.vdsbroker.GetDeviceListVDSCommand return value Class Name: org.ovirt.engine.core.vdsbroker.vdsbroker.LUNListReturnForXmlRpc lunList Null mStatus Class
Re: [Users] Status code 500 when adding first storage domain?
hi, thanks to Justin for providing me access to his environment, I managed to debug it live and find 2 issues: - first issue is for web-admin portal which from some reason has timeouts for createStorageDomain command set to 1 minute (where timeout should be 3 minutes and set in the engine-core). - second issue (more disturbing) is a deadlock in super-vdsm (it goes into defunct state) operation which prevents reading\writing storage domain metadata as a part of createStorageDomain thread. I will file 2 bz for those issues, Haim - Original Message - From: Yaniv Kaul yk...@redhat.com To: Justin Clift jcl...@redhat.com Cc: users users@ovirt.org Sent: Thursday, August 9, 2012 9:18:59 AM Subject: Re: [Users] Status code 500 when adding first storage domain? Engine log, and possibly VDSM log, would be most helpful. Y. Hi all, Is anyone else experiencing this error, when adding the first storage domain to a new oVirt 3.1 environment? Error: A Request to the Server failed with the following Status Code: 500 Screenshot attached. The environment here has 1 x host server and 1 x engine server. It's using the new oVirt 3.1 ISO, and the rpms's obtained by following the Get It instructions. The storage is NFS, and is known to work. Other useful data points: * Forward and reverse DNS is 100% working for everything. * On some level, the NFS mounting is working. oVirt even creates its uuid looking data structure inside the mount point before giving this error. * If I log in to the host remotely, manually mounting the NFS share from the command line is flawless. Works immediately. * In the above screenshot, the storage domain is even shown in the background behind the error message. oVirt seems to actively remove it after the error message though. It kind of feels like a timeout thing, as after the initial mount and creation of UUID on the NFS share, there's a 30-ish period of inactivity then this error is displayed. Any ideas? Regards and best wishes, Justin Clift -- Aeolus Community Manager http://www.aeolusproject.org ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] Status code 500 when adding first storage domain?
opened https://bugzilla.redhat.com/show_bug.cgi?id=847079 https://bugzilla.redhat.com/show_bug.cgi?id=847083 for tracking. Haim - Original Message - From: Justin Clift jcl...@redhat.com To: Haim Ateya hat...@redhat.com Cc: Yaniv Kaul yk...@redhat.com, users users@ovirt.org Sent: Thursday, August 9, 2012 6:58:14 PM Subject: Re: [Users] Status code 500 when adding first storage domain? On 10/08/2012, at 1:49 AM, Haim Ateya wrote: hi, thanks to Justin for providing me access to his environment, I managed to debug it live and find 2 issues: - first issue is for web-admin portal which from some reason has timeouts for createStorageDomain command set to 1 minute (where timeout should be 3 minutes and set in the engine-core). - second issue (more disturbing) is a deadlock in super-vdsm (it goes into defunct state) operation which prevents reading\writing storage domain metadata as a part of createStorageDomain thread. Thanks Haim, good work. :) For the deadlock issue, any ideas for a work around for now? It's not a killer, as there's a working oVirt 3.0 setup here which can be used for demoing instead. Just would have preferred 3.1 is all. (obviously) : Regards and best wishes, Justin Clift -- Aeolus Community Manager http://www.aeolusproject.org ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] Ovirt 3.1 Issue
Your engine fqdn is not resolvable from the node side. Haim On Jul 12, 2012, at 21:27, Jonathan Cox jonathanheber...@gmail.com wrote: Hello anyone have any ideas about this error? I am setting up Ovirt 3.1 on CentOS 6.3, the nodes are also running CentOS6.3 and the engine appears to be ok but when I try and add a Node I get this error.. Failed to install Host Node1. Step: INSTALLER LIB; Details: deployUtil.py download failed. Pathname could not be resolved (verify computer/domain name).. -- Jonathan H Cox ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] Can't add an NFS storage domain
Hi Ian, You seem to run into a bug in our advanced NFS options capability. in order to workaround this issue, please use 'Auto Negotiate' option for now, that shall work. you can track this issue in: https://bugzilla.redhat.com/show_bug.cgi?id=831469 Thanks, Haim - Original Message - From: Ian Levesque i...@crystal.harvard.edu To: users@ovirt.org Sent: Tuesday, June 12, 2012 10:17:47 PM Subject: [Users] Can't add an NFS storage domain Hi, I just did a clean install of ovirt version 3.1 on CentOS 6.2 (dreyou build @ http://www.dreyou.org/ovirt/). When I attempt to add storage (Data/NFS), the web GUI hangs and the storage is not added. The engine log reports: Class Name: org.ovirt.engine.core.vdsbroker.vdsbroker.ServerConnectionStatusReturnForXmlRpc mStatusList Null mStatus Class Name: org.ovirt.engine.core.vdsbroker.vdsbroker.StatusForXmlRpc mCode 451 mMessage Error storage server connection: (domType=1, spUUID=----, conList=[{'port': '', 'connection': 'cmcd-db-vip.in.hwlab:/vmstore', 'iqn': '', 'portal': '', 'user': '', 'protocol_version': '3', 'password': '**', 'id': '500d06d3-00f7-4d8f-b42b-eabd6a394be7'}],) The vdsm.log on the host I'm using (also CentOS 6.2, vdsm-4.9.6-0.274.git937a4b7) says: Thread-3810::ERROR::2012-06-12 14:58:19,879::task::853::TaskManager.Task::(_setError) Task=`c514ffd1-2777-44ae-9b77-06a9fb51f41e`::Unexpected error Traceback (most recent call last): File /usr/share/vdsm/storage/task.py, line 861, in _run return fn(*args, **kargs) File /usr/share/vdsm/logUtils.py, line 38, in wrapper res = f(*args, **kwargs) File /usr/share/vdsm/storage/hsm.py, line 1926, in connectStorageServer conObj = storageServer.ConnectionFactory.createConnection(conInfo) File /usr/share/vdsm/storage/storageServer.py, line 533, in createConnection return ctor(**params) File /usr/share/vdsm/storage/storageServer.py, line 244, in __init__ options.append(vers=%d % version) TypeError: %d format: a number is required, not str More context is available here: http://d.pr/n/dA5K+ Thanks for any insight. Ian ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] Fwd: Re: oVirt live snapshot problem
Federico\Kiril, is this problem known to you ? Haim - Original Message - From: Neil nwilson...@gmail.com To: users@ovirt.org Sent: Monday, June 11, 2012 5:06:20 PM Subject: [Users] Fwd: Re: oVirt live snapshot problem Hi there, Sorry if this is a re-post,but I don't recall seeing my message appear on the list, or any replies. Haim Ateya We have a known issue where live snapshot fails due to libvirt bug, please attach vdsm.log and libvirtd.log located on hypervisor also version of libvirt and vdsm Haim Hi Haim, Thank you for your reply! Apologies for only replying now. Please find /var/log/libvirtd.log and /var/log/vdsm/vdsm.log attached in archive ovirt-logs.tar.gz Versions: vdsm-4.9.6-0.150.git63061a9.el6.x86_64 libvirt-0.9.4-23.el6_2.8.x86_64 I am hoping to be able to enable live snapshot support and then find a way for me to export the snapshots to the exportdomain for backup purposes - hopefully this is possible? I don't mind contributing by writing scripts if required. Regards Neil Wilson. ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] Anyone started working on the documentation for GlusterFS and Ovirt 3.1 yet?
adding Shireesh\Daniel Hi Robert, afaik, there is no official documentation that show's how to work with glusterFS using oVirt since its too new and we are in early integration phases (shireesh, correct me if i'm wrong). I would start by browsing to http://www.gluster.org/ and read about the underlying technology (glusterFS, admin guide is good place to start). as for tips and hints, I will describe what you need in general (very high-level): - cluster and data-center - edit cluster and set the gluster check box - host side (vds): * install vdsm-gluster package * install all gluster-related packages * make sure all gluster related services are fully operational - create new volume (volume is a collection of bricks which represent a collection of export domains): * allocate storage space on your host * make sure this space is exposed via export (NFS) directory * create new volume\bricks Thanks, Haim - Original Message - From: Robert Middleswarth rob...@middleswarth.net To: users@ovirt.org Sent: Friday, June 8, 2012 9:23:40 PM Subject: [Users] Anyone started working on the documentation for GlusterFS and Ovirt 3.1 yet? I installed ovirt 3.1 using the assume repo provided by http://www.dreyou.org/ovirt/ and it works really well. Been a few glitches but overall it works well. I see an interface for GlusterFS is in there but no clue how to use it any hints or info been created yet? Thanks Robert ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] Host CPU type is not compatible with Cluster Properties
- Original Message - From: Itamar Heim ih...@redhat.com To: Mohsen Saeedi mohsen.sae...@gmail.com Cc: users@ovirt.org Sent: Saturday, June 9, 2012 5:35:45 PM Subject: Re: [Users] Host CPU type is not compatible with Cluster Properties On 06/09/2012 04:02 PM, Mohsen Saeedi wrote: Hi I have problem with cpu type in Ovirt. I have a HP server with Intel(R) Xeon(R) CPU 5130 @ 2.00GHz. when i try to add this host as Ovirt host in default cluster, i get the error message: Host CPU type is not compatible with Cluster Properties I know this CPU support VT-x and it's enable in bios. I tested it with KVM and virt-manager and i saw Qemu/KVM when i try to add new virtual machine. Thanks ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users please send: vdsClient -s 0 getVdsCaps grep -i flags what is the CPU type of the cluster host is attached to (Intel\AMD)? ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] Not able to add ISCSI storage to the Datacenter
Hi Jose, Please provide the following output: On storage server: lvs command output On host (vds): perform manual login of the target, and run: multipath -r iscsiadm -m sessions vdsClient -s 0 getDeviceList vgs -o+pv_name Haim On Jun 8, 2012, at 13:38, jose garcia johnny.cummi...@gmail.com wrote: Good morning, I followed the procedure in https://fedorahosted.org/ovirt/wiki/ISCSISetup, with LV: /dev/vg_iscsitest/iscsi1 as backing storage. When trying to import Export/ISCSI my volume discovery works, oVirt makes login without problem but the lun defined does not show up. Iptables and selinux are disabled. # tgtadm --mode target --op show Target 1: host:storage System information: Driver: iscsi State: ready I_T nexus information: LUN information: LUN: 0 Type: controller SCSI ID: IET 0001 SCSI SN: beaf10 Size: 0 MB, Block size: 1 Online: Yes Removable media: No Readonly: No Backing store type: null Backing store path: None Backing store flags: LUN: 1 Type: disk SCSI ID: IET 00010001 SCSI SN: beaf11 Size: 21475 MB, Block size: 512 Online: Yes Removable media: No Readonly: No Backing store type: rdwr Backing store path: /dev/vg_iscsitest/iscsi1 Backing store flags: Account information: ACL information: ALL output of /var/log/vdsm/vdsm.log is: Thread-301::DEBUG::2012-06-08 10:24:34,756::clientIF::261::Storage.Dispatcher.Protect::(wrapper) [10.10.30.101] Thread-301::DEBUG::2012-06-08 10:24:34,757::task::588::TaskManager.Task::(_updateState) Task=`81890ebc-37db-4771-978b-dd2089de33bd`::moving from state init - state preparing Thread-301::INFO::2012-06-08 10:24:34,757::logUtils::37::dispatcher::(wrapper) Run and protect: validateStorageServerConnection(domType=3, spUUID='----', conList=[{'connection': '10.10.30.101', 'iqn': 'host:storage', 'portal': '0', 'user': '', 'password': '**', 'id': '----', 'port': '3260'}], options=None) Thread-301::INFO::2012-06-08 10:24:34,758::storage_connection::188::Storage.ServerConnection::(validate) Request to validate ISCSI storage server Thread-301::INFO::2012-06-08 10:24:34,759::logUtils::39::dispatcher::(wrapper) Run and protect: validateStorageServerConnection, Return response: {'statuslist': [{'status': 0, 'id': '----'}]} Thread-301::DEBUG::2012-06-08 10:24:34,759::task::1174::TaskManager.Task::(prepare) Task=`81890ebc-37db-4771-978b-dd2089de33bd`::finished: {'statuslist': [{'status': 0, 'id': '----'}]} Thread-301::DEBUG::2012-06-08 10:24:34,760::task::588::TaskManager.Task::(_updateState) Task=`81890ebc-37db-4771-978b-dd2089de33bd`::moving from state preparing - state finished Thread-301::DEBUG::2012-06-08 10:24:34,761::resourceManager::806::ResourceManager.Owner::(releaseAll) Owner.releaseAll requests {} resources {} Thread-301::DEBUG::2012-06-08 10:24:34,761::resourceManager::841::ResourceManager.Owner::(cancelAll) Owner.cancelAll requests {} Thread-301::DEBUG::2012-06-08 10:24:34,762::task::980::TaskManager.Task::(_decref) Task=`81890ebc-37db-4771-978b-dd2089de33bd`::ref 0 aborting False Thread-302::DEBUG::2012-06-08 10:24:34,793::clientIF::261::Storage.Dispatcher.Protect::(wrapper) [10.10.30.101] Thread-302::DEBUG::2012-06-08 10:24:34,794::task::588::TaskManager.Task::(_updateState) Task=`55fed2df-3434-4bba-a47c-a400d0c568a8`::moving from state init - state preparing Thread-302::INFO::2012-06-08 10:24:34,795::logUtils::37::dispatcher::(wrapper) Run and protect: connectStorageServer(domType=3, spUUID='----', conList=[{'connection': '10.10.30.101', 'iqn': 'host:storage', 'portal': '0', 'user': '', 'password': '**', 'id': '----', 'port': '3260'}], options=None) Thread-302::INFO::2012-06-08 10:24:34,795::storage_connection::146::Storage.ServerConnection::(connect) Request to connect ISCSI storage server Thread-302::DEBUG::2012-06-08 10:24:34,796::iscsi::148::Storage.Misc.excCmd::(addiSCSIPortal) '/usr/bin/sudo -n /sbin/iscsiadm -m discoverydb -t sendtargets -p 10.10.30.101:3260 --discover' (cwd None) Thread-302::DEBUG::2012-06-08 10:24:34,821::iscsi::148::Storage.Misc.excCmd::(addiSCSIPortal) SUCCESS: err = ''; rc = 0 Thread-302::DEBUG::2012-06-08 10:24:34,822::iscsi::149::Storage.Misc.excCmd::(addiSCSIPortal) '/usr/bin/sudo -n /sbin/iscsiadm -m node -p 10.10.30.101:3260 -o update -n node.startup -v manual' (cwd None) Thread-302::DEBUG::2012-06-08 10:24:34,840::iscsi::149
Re: [Users] spice: Warning: failed to connect: Connection refused (111)
Arnon\Alon, How shall we proceed with debugging this issue? Haim On Jun 5, 2012, at 4:06, Eric Yao e...@erickate.idv.tw wrote: Hi There I got error same as subject when I lunch console on ovirt-webpage. even I follow the below documentation but still not workable. http://www.ovirt.org/w/images/a/a9/OVirt-3.0-Installation_Guide-en-US.pdf and I checked the /var/log/messages got below log, could you help to check and advise? becuase I really interesting in Linux virtualization. Jun 4 11:11:55 rhevm spice: starting spicec --controller ... Jun 4 11:11:55 rhevm spice: XDG_VTNR=2 Jun 4 11:11:55 rhevm spice: ORBIT_SOCKETDIR=/tmp/orbit-eric Jun 4 11:11:55 rhevm spice: XSUNTRANSPORT=shmem Jun 4 11:11:55 rhevm spice: XDG_SESSION_ID=3 Jun 4 11:11:55 rhevm spice: HOSTNAME=rhevm.erickate.idv.tw Jun 4 11:11:55 rhevm spice: GIO_LAUNCHED_DESKTOP_FILE_PID=32603 Jun 4 11:11:55 rhevm spice: IMSETTINGS_INTEGRATE_DESKTOP=yes Jun 4 11:11:55 rhevm spice: GPG_AGENT_INFO=/tmp/keyring-NFMZzQ/gpg:0:1 Jun 4 11:11:55 rhevm spice: TERM=dumb Jun 4 11:11:55 rhevm spice: SHELL=/bin/bash Jun 4 11:11:55 rhevm spice: XDG_SESSION_COOKIE=f523a941917cf33b7a6a4c39000a-1338814582.742987-427970052 Jun 4 11:11:55 rhevm spice: HISTSIZE=1000 Jun 4 11:11:55 rhevm spice: XRE_PROFILE_NAME= Jun 4 11:11:55 rhevm spice: GJS_DEBUG_OUTPUT=stderr Jun 4 11:11:55 rhevm spice: GNOME_KEYRING_CONTROL=/tmp/keyring-NFMZzQ Jun 4 11:11:55 rhevm spice: GJS_DEBUG_TOPICS=JS ERROR;JS LOG Jun 4 11:11:55 rhevm spice: MOZILLA_FIVE_HOME=/usr/lib64/firefox Jun 4 11:11:55 rhevm spice: IMSETTINGS_MODULE=none Jun 4 11:11:55 rhevm spice: XRE_PROFILE_LOCAL_PATH= Jun 4 11:11:55 rhevm spice: USER=eric Jun 4 11:11:55 rhevm spice: XRE_START_OFFLINE= Jun 4 11:11:55 rhevm spice: LD_LIBRARY_PATH=/usr/lib64/xulrunner-2:/usr/lib64/firefox:/usr/lib64/firefox/plugins:/usr/lib64/firefox Jun 4 11:11:55 rhevm spice: SSH_AUTH_SOCK=/tmp/keyring-NFMZzQ/ssh Jun 4 11:11:55 rhevm spice: GNOME_DISABLE_CRASH_DIALOG=1 Jun 4 11:11:55 rhevm spice: SESSION_MANAGER=local/unix:@/tmp/.ICE-unix/1434,unix/unix:/tmp/.ICE-unix/1434 Jun 4 11:11:55 rhevm spice: USERNAME=eric Jun 4 11:11:55 rhevm spice: LIBPATH=/usr/lib64/firefox:/usr/lib64/firefox Jun 4 11:11:55 rhevm spice: MOZ_APP_LAUNCHER=/usr/bin/firefox Jun 4 11:11:55 rhevm spice: MOZ_PLUGIN_PATH=/usr/lib64/mozilla/plugins:/usr/lib64/firefox/plugins Jun 4 11:11:55 rhevm spice: GIO_LAUNCHED_DESKTOP_FILE=/usr/share/applications/mozilla-firefox.desktop Jun 4 11:11:55 rhevm spice: FONTCONFIG_PATH=/etc/fonts:/usr/lib64/firefox/res/Xft Jun 4 11:11:55 rhevm spice: DESKTOP_SESSION=gnome Jun 4 11:11:55 rhevm spice: PATH=/usr/local/bin:/usr/bin:/bin:/usr/local/sbin:/usr/sbin:/sbin:/home/eric/.local/bin:/home/eric/bin Jun 4 11:11:55 rhevm spice: MAIL=/var/spool/mail/eric Jun 4 11:11:55 rhevm spice: QT_IM_MODULE=xim Jun 4 11:11:55 rhevm spice: PWD=/home/eric Jun 4 11:11:55 rhevm spice: XMODIFIERS=@im=none Jun 4 11:11:55 rhevm spice: LANG=en_US.UTF-8 Jun 4 11:11:55 rhevm spice: GNOME_KEYRING_PID=1425 Jun 4 11:11:55 rhevm spice: XRE_PROFILE_PATH= Jun 4 11:11:55 rhevm spice: XSUNSMESIZE=512 Jun 4 11:11:55 rhevm spice: GDMSESSION=gnome Jun 4 11:11:55 rhevm spice: HISTCONTROL=ignoredups Jun 4 11:11:55 rhevm spice: SHLVL=2 Jun 4 11:11:55 rhevm spice: XDG_SEAT=seat0 Jun 4 11:11:55 rhevm spice: HOME=/home/eric Jun 4 11:11:55 rhevm spice: GNOME_DESKTOP_SESSION_ID=this-is-deprecated Jun 4 11:11:55 rhevm spice: DYLD_LIBRARY_PATH=/usr/lib64/firefox:/usr/lib64/firefox Jun 4 11:11:55 rhevm spice: LOGNAME=eric Jun 4 11:11:55 rhevm spice: MOZ_LAUNCHED_CHILD= Jun 4 11:11:55 rhevm spice: MOZ_GRE_CONF=/etc/gre.d/gre64.conf Jun 4 11:11:55 rhevm spice: DBUS_SESSION_BUS_ADDRESS=unix:abstract=/tmp/dbus-NPZ70DEHmC,guid=2048d088b3bc8ec7b858736a00e7 Jun 4 11:11:55 rhevm spice: SPICE_XPI_SOCKET=/tmp/spicec-0LekjJ/spice-xpi Jun 4 11:11:55 rhevm spice: NO_EM_RESTART= Jun 4 11:11:55 rhevm spice: LESSOPEN=||/usr/bin/lesspipe.sh %s Jun 4 11:11:55 rhevm spice: MOZ_CRASHREPORTER_RESTART_ARG_1= Jun 4 11:11:55 rhevm spice: SHLIB_PATH=/usr/lib64/firefox:/usr/lib64/firefox Jun 4 11:11:55 rhevm spice: XRE_BINARY_PATH= Jun 4 11:11:55 rhevm spice: MOZ_CRASHREPORTER_RESTART_ARG_0=/usr/lib64/firefox/firefox Jun 4 11:11:55 rhevm spice: WINDOWPATH=2 Jun 4 11:11:55 rhevm spice: DISPLAY=:0 Jun 4 11:11:55 rhevm spice: XDG_RUNTIME_DIR=/run/user/eric Jun 4 11:11:55 rhevm spice: XUL_APP_FILE= Jun 4 11:11:55 rhevm spice: MOZ_CRASHREPORTER_DATA_DIRECTORY=/home/eric/.mozilla/firefox/Crash Reports Jun 4 11:11:55 rhevm spice: NO_AT_BRIDGE=1 Jun 4 11:11:55 rhevm spice: XAUTHORITY=/var/run/gdm/auth-for-eric-fuBlMF/database Jun 4 11:11:55 rhevm spice: _=/usr/bin/env Jun 4 11:11:58 rhevm spice: Warning: failed to connect: Connection refused (111) Jun 4 11:11:58 rhevm spice: spicec execution failed Cheers Regards, Eric Yao Mail : e...@erickate.idv.tw ericyao1...@gmail.com -- This message has
Re: [Users] Failed to enable SELinux boolean
Hi, I guess that if you enable selinux then setup runs positively ? Can you verify that? Please run: getenforce, of it's disabled, edit /etc/sysconfig/selinux ang change to enforcing, reboot the machine, re-run setup. Haim On Jun 5, 2012, at 0:50, Nerijus Baliunas neri...@users.sourceforge.net wrote: Hello, recent snapshot, engine-setup fails (selinux is disabled in the OS): 2012-06-05 03:34:26::DEBUG::setup_sequences::59::root:: running _configureSelinuxBoolean 2012-06-05 03:34:26::DEBUG::engine-setup::668::root:: Enable httpd_can_network_connect boolean 2012-06-05 03:34:26::DEBUG::common_utils::309::root:: Executing command -- '/usr/sbin/setsebool -P httpd_can_network_connect 1' 2012-06-05 03:34:26::DEBUG::common_utils::335::root:: output = 2012-06-05 03:34:26::DEBUG::common_utils::336::root:: stderr = setsebool: SELinux is disabled. 2012-06-05 03:34:26::DEBUG::common_utils::337::root:: retcode = 1 2012-06-05 03:34:26::DEBUG::setup_sequences::62::root:: Traceback (most recent call last): File /usr/share/ovirt-engine/scripts/setup_sequences.py, line 60, in run function() File /usr/bin/engine-setup, line 670, in _configureSelinuxBoolean out, rc = utils.execCmd(cmd, None, True, output_messages.ERR_FAILED_UPDATING_SELINUX_BOOLEAN) File /usr/share/ovirt-engine/scripts/common_utils.py, line 340, in execCmd raise Exception(msg) Exception: Failed to enable SELinux boolean Regards, Nerijus ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] Fail to add storage domain
- Original Message - From: Elad Tabak e...@tonian.com To: Jakub Libosvar libos...@redhat.com Cc: users@ovirt.org Sent: Thursday, May 31, 2012 9:13:40 AM Subject: Re: [Users] Fail to add storage domain Hi Jakub, The original attempt to mount took too long and I had to refresh the oVirt window and terminate the operation. The second attempt, this time with the correct path, fails with an error. I'm suspecting the first operation locked me from been able to mount again. Using the command you provided below works just fine. Any suggestion? Hi Elad, please attach full vdsm log of the second attempt. please also run ps aux | grep mount and see if you recognize your mount command, check process state (just to see its not stuck). please make sure you have the correct permission on your NFS share; should be vdsm:kvm (36:36) would appreciate if you could file a BZ on the timeout issue for oVirt (web-admin component). Haim Elad On Wed, May 30, 2012 at 6:36 PM, Jakub Libosvar libos...@redhat.com wrote: Hi Elad, seems like the mount command itself failed on timeout. Can you please try this command and let's see what will happen: mkdir -p /mnt/my_storage_test mount -t nfs -o soft,nosharecache,timeo=600,retrans=6 192.168.132.123:/tonian/dcns/vns/marketing/productA /mnt/my_storage_test it should fail as well. Please check that nfs service is running on 192.168.132.123 and is not blocked by iptables or other firewalls and that path is exported properly in /etc/exports. You can also check whether selinux denied the mount operation by `grep AVC /var/log/audit/audit.log | tail` after mount command fails. Kuba On 05/30/2012 05:16 PM, Elad Tabak wrote: Hi, I'm trying to add a storage domain with NFS, but it fails. Looking at the logs, I see the errors but nothing that can help me understand the root cause for the failure. Engine and vdsm logs attached. Thanks, Elad ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] oVirt live snapshot problem
We have a known issue where live snapshot fails due to libvirt bug, please attach vdsm.log and libvirtd.log located on hypervisor also version of libvirt and vdsm Haim On May 30, 2012, at 18:04, Neil nwilson...@gmail.com wrote: Hi there, When trying to create a snapshot of a running guest I get the following error message in the oVirt web console. Can anyone assist with further troubleshooting? Snapshots of guests that are powered down work fine. Failed to complete Snapshot creation for VM ... 2012-05-30 16:55:52,947 INFO [org.ovirt.engine.core.vdsbroker.irsbroker.CreateSnapshotVDSCommand] (ajp--0.0.0.0-8009-5) [43604c87] -- createVolume parameters: sdUUID=0e6991ae-6238-4c61-96d2-ca8fed35161e spUUID=28adaf38-a4f6-11e1-a859-cb68949043e4 imgGUID=269c72f4-1452-4c1f-b696-25048c849e77 size=53,687,091,200 bytes nbsp;nbs p; volFormat=COW volType=Sparse volUUID=c5cecedc-903f-4280-bd00-043a3e473fe7 descr= srcImgGUID=269c72f4-1452-4c1f-b696-25048c849e77 srcVolUUID=43c01d96-b429-4567-9f9c-218782eb0974 Any help is appreciated. Regards. Neil Wilson. ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] ITA-1990 iSCSI LUNs shown on console, but not in ovirt webgui.
Hi, please attach vdsm.log located under /var/log/vdsm/vdsm.log also, provide the following output: - multipath -ll - iscsiadm -m session - lsblk (if exists) - lsscsi (if exists) On 05/29/2012 08:58 AM, Sven Knohsalla wrote: Hi, we currently wanted to test oVirt, so we installed ovirt-engine ovirt-node (2.3 latest). When connecting to our san dell md3220i (default storage: iSCSI), oVirt cant find any mapped LUNs to the iscsi target. The iSCSI configuration is correct, as were able to connect via console to the SAN on the ovirt-node. Multipathing enabled, multipath ll shows the correct devices. Unfortunately, this blocks our testing phase, but we are really looking forward to change our complete virtual environment to ovirt We did try a NAS system, which is offering iSCSI, the oVirt webgui directly found the iSCSI target and binded LUNs, but we need to connect our SAN (full reduncance needed) How can we proceed ? (possible log files we can look at, or any configuration we have to pay attention to) Thanks in advance! Best, Sven Knohsalla | Systems Administration Netbiscuits - Enabling the mobile experience. Update: Netbiscuits is hiring mobile specialists in sales, marketing and development Recent awards accolades Mobile Entertainment Award Winner 2011 Deloitte Technology Fast 50 2011, ranked #28 Frost Sullivan Best Practices Award 2011 Register Court: Local Court Kaiserslautern | Commercial Register ID: HR B 3604 Management Board: Guido Moggert, Michael Neidhoefer, Christian Reitz, Martin Suess ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] Not possible to add VM
Is it a build env (using rpms and engine-setup) or built from sources? Haim On May 29, 2012, at 13:00, ov...@qip.ru wrote: With the last engine build from jenkins when I try to add VM it is not possible to choose cluster (dropdown list is empty) error from jboss console 2012-05-29 13:53:20,419 ERROR [org.ovirt.engine.core.bll.GetVdsGroupsByStoragePoolIdQuery] (ajp--0.0.0.0-8009-1) Query GetVdsGroupsByStoragePoolIdQuery failed. Exception message is PreparedStatementCallback; bad SQL grammar [select * from getvdsgroupsbystoragepoolid(?)]; nested exception is org.postgresql.util.PSQLException: ERROR: function getvdsgroupsbystoragepoolid(unknown) does not exist -- ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] How to change storage domain ip address
i'm not familiar with conventional way of doing such a change; the only way I can think of is altering storage related tables on data-base. i would start with the following table: SELECT * from storage_server_connections; then create a query that changes current ip address with new one. Haim - Original Message - From: T-Sinjon tscbj1...@gmail.com To: users@ovirt.org Sent: Tuesday, May 29, 2012 6:47:20 PM Subject: [Users] How to change storage domain ip address For some reason , the ip address of my NFS storage domain server has changed from 192.168.x.x to 172.16.x.x , then my VMDomain became inactive. the VMDomain NFS Export Path should change to 172.16.x.x:/Path/To/VMDomain, where can i change this to let the domain active again? ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] How to change storage domain ip address
- Original Message - From: T-Sinjon tscbj1...@gmail.com To: Haim Ateya hat...@redhat.com Cc: users@ovirt.org Sent: Tuesday, May 29, 2012 8:09:38 PM Subject: Re: [Users] How to change storage domain ip address After i update postgreSQL , the ip changed correctlly. then i try to active my VMDomain, but it throws the error: 2012-05-30 01:05:39,699 ERROR [org.ovirt.engine.core.bll.storage.ActivateStorageDomainCommand] (pool-5-thread-46) [277fd6c5] Command org.ovirt.engine.core.bll.storage.ActivateStorageDomainCommand throw Vdc Bll exception. With error message VdcBLLException: Cannot allocate IRS server what's the IRS server mean? and how to resolve it IRS = Image Repository Server. please attach full vdsm.log so we can examine the connectStorageServer command and the corresponding mount point. On 30 May, 2012, at 12:14 AM, Haim Ateya wrote: i'm not familiar with conventional way of doing such a change; the only way I can think of is altering storage related tables on data-base. i would start with the following table: SELECT * from storage_server_connections; then create a query that changes current ip address with new one. Haim - Original Message - From: T-Sinjon tscbj1...@gmail.com To: users@ovirt.org Sent: Tuesday, May 29, 2012 6:47:20 PM Subject: [Users] How to change storage domain ip address For some reason , the ip address of my NFS storage domain server has changed from 192.168.x.x to 172.16.x.x , then my VMDomain became inactive. the VMDomain NFS Export Path should change to 172.16.x.x:/Path/To/VMDomain, where can i change this to let the domain active again? ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] How to change storage domain ip address
- Original Message - From: T-Sinjon tscbj1...@gmail.com To: Haim Ateya hat...@redhat.com Cc: users@ovirt.org Sent: Tuesday, May 29, 2012 8:31:01 PM Subject: Re: [Users] How to change storage domain ip address I guess you mean engine.log , because i can't find any log new when i do this action. vdsm.log can be found on your host (hyper-visor) under /var/log/vdsm/vdsm.log. how many hosts do you have in your pool ? what's the status of the hosts ? here is the full log: 2012-05-30 01:28:24,852 INFO [org.ovirt.engine.core.bll.storage.ActivateStorageDomainCommand] (pool-5-thread-44) [18001bda] Lock Acquired to object EngineLock [exclusiveLocks= key: org.ovirt.engine.core.bll.storage.ActivateStorageDomainCommand value: 5e2ac537-6a73-4faf-8379-68f3ff26a75d , sharedLocks= ] 2012-05-30 01:28:24,864 INFO [org.ovirt.engine.core.bll.storage.ActivateStorageDomainCommand] (pool-5-thread-44) [18001bda] Running command: ActivateStorageDomainCommand internal: false. Entities affected : ID: 5e2ac537-6a73-4faf-8379-68f3ff26a75d Type: Storage 2012-05-30 01:28:24,876 INFO [org.ovirt.engine.core.bll.storage.ActivateStorageDomainCommand] (pool-5-thread-44) [18001bda] Lock freed to object EngineLock [exclusiveLocks= key: org.ovirt.engine.core.bll.storage.ActivateStorageDomainCommand value: 5e2ac537-6a73-4faf-8379-68f3ff26a75d , sharedLocks= ] 2012-05-30 01:28:24,876 INFO [org.ovirt.engine.core.bll.storage.ActivateStorageDomainCommand] (pool-5-thread-44) [18001bda] ActivateStorage Domain. Before Connect all hosts to pool. Time:5/30/12 1:28 AM 2012-05-30 01:28:24,901 INFO [org.ovirt.engine.core.bll.storage.ActivateStorageDomainCommand] (pool-5-thread-44) [18001bda] ActivateStorage Domain. After Connect all hosts to pool. Time:5/30/12 1:28 AM 2012-05-30 01:28:24,902 INFO [org.ovirt.engine.core.vdsbroker.irsbroker.ActivateStorageDomainVDSCommand] (pool-5-thread-44) [18001bda] START, ActivateStorageDomainVDSCommand(storagePoolId = 524a7003-edec-4f52-a38e-b15cadfbe3ef, ignoreFailoverLimit = false, compatabilityVersion = null, storageDomainId = 5e2ac537-6a73-4faf-8379-68f3ff26a75d), log id: 49e134ff 2012-05-30 01:28:24,906 INFO [org.ovirt.engine.core.vdsbroker.irsbroker.ActivateStorageDomainVDSCommand] (pool-5-thread-44) [18001bda] FINISH, ActivateStorageDomainVDSCommand, log id: 49e134ff 2012-05-30 01:28:24,907 ERROR [org.ovirt.engine.core.bll.storage.ActivateStorageDomainCommand] (pool-5-thread-44) [18001bda] Command org.ovirt.engine.core.bll.storage.ActivateStorageDomainCommand throw Vdc Bll exception. With error message VdcBLLException: Cannot allocate IRS server 2012-05-30 01:28:24,914 INFO [org.ovirt.engine.core.bll.storage.ActivateStorageDomainCommand] (pool-5-thread-44) [18001bda] Command [id=3c916f08-7fb3-44ea-882e-4f56bc9716a2]: Compensating CHANGED_STATUS_ONLY of org.ovirt.engine.core.common.businessentities.storage_pool_iso_map; snapshot: EntityStatusSnapshot [id=storagePoolId = 524a7003-edec-4f52-a38e-b15cadfbe3ef, storageId = 5e2ac537-6a73-4faf-8379-68f3ff26a75d, status=Maintenance] On 30 May, 2012, at 1:14 AM, Haim Ateya wrote: - Original Message - From: T-Sinjon tscbj1...@gmail.com To: Haim Ateya hat...@redhat.com Cc: users@ovirt.org Sent: Tuesday, May 29, 2012 8:09:38 PM Subject: Re: [Users] How to change storage domain ip address After i update postgreSQL , the ip changed correctlly. then i try to active my VMDomain, but it throws the error: 2012-05-30 01:05:39,699 ERROR [org.ovirt.engine.core.bll.storage.ActivateStorageDomainCommand] (pool-5-thread-46) [277fd6c5] Command org.ovirt.engine.core.bll.storage.ActivateStorageDomainCommand throw Vdc Bll exception. With error message VdcBLLException: Cannot allocate IRS server what's the IRS server mean? and how to resolve it IRS = Image Repository Server. please attach full vdsm.log so we can examine the connectStorageServer command and the corresponding mount point. On 30 May, 2012, at 12:14 AM, Haim Ateya wrote: i'm not familiar with conventional way of doing such a change; the only way I can think of is altering storage related tables on data-base. i would start with the following table: SELECT * from storage_server_connections; then create a query that changes current ip address with new one. Haim - Original Message - From: T-Sinjon tscbj1...@gmail.com To: users@ovirt.org Sent: Tuesday, May 29, 2012 6:47:20 PM Subject: [Users] How to change storage domain ip address For some reason , the ip address of my NFS storage domain server has changed from 192.168.x.x to 172.16.x.x , then my VMDomain became inactive. the VMDomain NFS Export Path should change to 172.16.x.x:/Path/To/VMDomain, where can i change this to let the domain active again? ___ Users mailing list Users@ovirt.org http
Re: [Users] SPM host in unknown status
Hi, first question that comes to mind is why host is in non-responsive state? Please check the following: 1. vdsmd service is running on host side 2. No firewall is blocking comm. in and out 3. No network issue between host and manager Now, for your question, you can use the manual fence option (confirm host has been rebooted), which will free spm role for faulty host, and engine will elect new spm. Haim On May 27, 2012, at 18:32, T-Sinjon tscbj1...@gmail.com wrote: Description of problem: i have 2 nodes ovirt-node1.localNon ResponsiveSPM ovirt-node2.localUpNone The SPM node stuck in Non-responsive status, it can't be actived, all vms in the node went into Unknown status and the master vm domain became inactived when i do Maintenace action to node1, it says: Error: Cannot switch Host to Maintenance mode. Host still has running VMs on it and is in Non-Responsive state. but there has no vm running in node1 , it only has 2 vms in Unknown status Because I can't active the SPM host , so i can't active the vm storage domain 1,How can i migrated the SPM role to other host in my data center , such us node2? 2,How can i send the node1 to UP status?(I have did 'confirm the host has been Rebooted' action , and rebooted the node1, but it did no sense) ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] Some problems wtith engine
- Original Message - From: ov...@qip.ru To: users@ovirt.org Sent: Friday, May 25, 2012 9:10:50 AM Subject: [Users] Some problems wtith engine 1. Problem to add second disk to VM I created shared disk and attach it to VM, but then when I tried to start VM got error XML error: Attempted double use of PCI Address '0:0:2.0'. VM not started even I detach and remove second disk. May it is possible to correct VM configuration parameters manually please attach full vdsm engine log - I would like to have a look on the create vm command both sides and XML sent to libvirt. also attach /var/log/libvirt/qemu/vmName.log 2. Sometimes engine lost the connection to VMs VM is working, but in engine it has status Not Responding, VM is Win2003R2 with no agent installed, now it has UP time 8 days, and i could connect to console using rdp , on vdsm host vdsClient show it status is UP it reminds me of: https://bugzilla.redhat.com/show_bug.cgi?id=821468 what version of libvirt are you running with? # vdsClient -s 0 list table 121119f1-7f82-437e- ba 68- ef 5299de443a 31874 VM01 Up* # virsh -r list --all Id Name State 6 VM01 running but it can't be stopped nor by engine UI nor but vdsClient # vdsClient -s 0 reset 121119f1-7f82-437e- ba 68- ef 5299de443a Not implemented # vdsClient -s 0 shutdown 121119f1-7f82-437e- ba 68- ef 5299de443a 1 mes # but VM still working 3. Can't using fedora16 as a guest. Fedora16 VM minimal install with updates is starting only in single-user mode (i can add packages, do update), but if i do telinit 3 or start it in muili -user mode it freeze what version of qemu-kvm are you running with ? once vm freezes, what does vdsClient list table and virsh -r list tells ? also, please provide /var/log/libvirt/qemu/vmName.log, are there any IO errors ? what the status of the corresponding qemu process on host ? is it on 100% ? 4. Can't restore VMs with old ovf format with first tag ovf :Envelope xmlns:rasd=http://schemas.dmtf.org/wbem/wscim/1/cim-schema/2/CIM_ResourceAllocationSettingData; xmlns:vssd=http://schemas.dmtf.org/wbem/wscim/1/cim-schema/2/CIM_VirtualSystemSettingData; xmlns:xsi=http://www.w3.org/2001/XMLSchema-instance; xmlns: ovf =http://schemas.dmtf.org/ ovf /envelope/1/ ovf :version=0.9 The error in engine: Failed to read VM 'MailSRV' OVF , it may be corrupted we need engine log. -- ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] Ovirt-guest-agent availabilty
afaik, we support RHEL Windows. As for RHEL, I guess you can clone our git repo and compile for your self. Gal - any additions? Haim On May 23, 2012, at 8:51, Rahul Upadhyaya rak...@gmail.com wrote: Hi folks, Which all OSs have a ovirt-guest-agent support? And where possibly can I find them ? I was successful in finding Guest agent for fedora in the ovirt repo. but apart from fedora i dont see any guest-agent compatible to other OSs there. -- Regards, Rahul === ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] Host get non operational status after activating ISO or Export SD
- Original Message - From: ov...@qip.ru To: users@ovirt.org Sent: Tuesday, May 22, 2012 10:04:23 AM Subject: [Users] Host get non operational status after activating ISO or Export SD in working DC after vdsm update if DC has active ISO or Export domain updated host set to non operational status with event message Host kvm04 cannot access one of the Storage Domains attached to it, or the Data Center object. Setting Host state to Non-Operational. if ISO and Export domain are in maintenance mode or detached host status is UP If i downgrade vdsm to vdsm -4.9.6-0.187.git8a14b63 host status is UP with attached and activated ISO and Export domains vdsm log is in attached file Hi, there is an issue introduced with 'betterpopen' patch. you can either cherry-pick the following patch-set http://gerrit.ovirt.org/#change,4414 on the top of vdsm-master or revert till we fix this issue. -- ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] Host can't join the cluster
This issue was introduced in Ic7b4a63a974bfc301f3294603d8fe91f534b74dd (http://gerrit.ovirt.org/4284), we are currently working to resolve this issue and patch will be sent soon. meantime, you can edit /usr/share/vdsm/dsaversion.py and change the following fields: software_version = 4.9 software_revision = 0 restart vdsmd service and activate host again. Haim - Original Message - From: ov...@qip.ru To: Haim Ateya hat...@redhat.com Cc: users@ovirt.org Sent: Wednesday, May 16, 2012 3:19:04 PM Subject: Re: Re: [Users] Host can't join the cluster in secure and non secure connections i have the same nonoperational status the output of getVdsCaps on host is empty this is event log in webadmin 2012-May-16, 16:05:04 Host kvm04 is compatible with versions (3.0,3.1) and cannot join Cluster Default which is set to version 3.1. 2012-May-16, 16:05:04 Detected new Host kvm04. Host state was set to Up. 2012-May-16, 16:05:03 Host kvm04 was autorecovered. 2012-May-16, 16:05:03 Host kvm04 is compatible with versions (3.0,3.1) and cannot join Cluster Default which is set to version 3.1. 2012-May-16, 16:01:02 Host kvm04 was activated by admin@internal. if i do downgrade to 4.9.6-0.196.gitb8b79b5 host state set's to Up. Срд 16 Май 2012 15:24:36 +0400, Haim Ateya hat...@redhat.com написал: I guess you compiled vdsm yourself, please run the following command from your host: vdsClient -s 0 getVdsCaps (assuming you work with SSL). anyhow, it smells like a known issue with latest build where vdsm returns supported_clusters = 3.0 and engine reject host, but lets find out. Thanks, Haim From: ov...@qip.ru To: users@ovirt.org Sent: Wednesday, May 16, 2012 1:58:33 PM Subject: [Users] Host can't join the cluster After upgrade of vdsm from 4.9.6-0.196.gitb8b79b5 to 4.9.6-0.201.git98e8078 engine set host in nonoperational mode, the error is Host kvm04 is compatible with versions (3.0,3.1) and cannot join Cluster Default which is set to version 3.1. -- ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users -- ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] engine-setup on rhel6.2 generates error
- Original Message - From: Andrei Vakhnin andrey.a.vakh...@nasa.gov To: users@ovirt.org Sent: Tuesday, May 15, 2012 7:27:52 PM Subject: [Users] engine-setup on rhel6.2 generates error I am trying to run engine-setup utility on rhels6.2 machines to configure ovirt-engine but get an error while starting jboss. (Exception: Error while updating jboss ROOT.war directory) Any ideas what is happening? Thanks Andrei oVirt Engine will be installed using the following configuration: = http-port: 80 https-port:443 host-fqdn: xx.cluster.net auth-pass: org-name: NASA LaRC default-dc-type: NFS db-remote-install: local db-local-pass: config-nfs:no override-iptables: no Proceed with the configuration listed above? (yes|no): yes Installing: Configuring oVirt-engine... [ DONE ] Creating CA... [ DONE ] Editing JBoss Configuration...[ ERROR ] Error while updating jboss ROOT.war directory Please check log file /var/log/ovirt-engine/engine-setup_2012_05_15_12_23_53.log for more information /var/log/ovirt-engine/engine-setup_2012_05_15_12_23_53.log: File /usr/share/ovirt-engine/scripts/setup_controller.py, line 54, in runAllSequences sequence.run() File /usr/share/ovirt-engine/scripts/setup_sequences.py, line 152, in run step.run() File /usr/share/ovirt-engine/scripts/setup_sequences.py, line 59, in run function() File /usr/bin/engine-setup, line 906, in _editRootWar raise Exception(output_messages.ERR_EXP_UPD_ROOT_WAR) Exception: Error while updating jboss ROOT.war directory Hi Andrei, We usually run oVirt on Fedora machine. Can you check if the following file exists on your machine: /etc/ovirt-engine/web-conf.js Alex\Ofer - any thoughts ? ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] Default storage pool won't activate after reboot
- Original Message - From: Ian Levesque i...@crystal.harvard.edu To: Haim Ateya hat...@redhat.com Cc: users@ovirt.org Sent: Tuesday, May 15, 2012 10:21:04 PM Subject: Re: [Users] Default storage pool won't activate after reboot On May 11, 2012, at 3:09 PM, Ian Levesque wrote: On May 9, 2012, at 4:02 PM, Haim Ateya wrote: - please access your NFS storage - change dir to export directory - /vmstore/ovirt - cat the following e12a0f53-ee72-44bc-ad26-93f9b4613c6c/dom_md/metadata CLASS=Data DESCRIPTION=glusternfs IOOPTIMEOUTSEC=10 LEASERETRIES=3 LEASETIMESEC=60 LOCKPOLICY= LOCKRENEWALINTERVALSEC=5 MASTER_VERSION=1 POOL_DESCRIPTION=Default POOL_DOMAINS=82503778-db5d-4ad3-a863-b5484b2a08a1:Active,e12a0f53-ee72-44bc-ad26-93f9b4613c6c:Active POOL_SPM_ID=3 POOL_SPM_LVER=5 POOL_UUID=af5bcc86-898a-11e1-9632-003048c85226 REMOTE_PATH=cmcd-db-vip.in.hwlab:/vmstore/ovirt ROLE=Master SDUUID=e12a0f53-ee72-44bc-ad26-93f9b4613c6c TYPE=NFS VERSION=0 _SHA_CKSUM=c193c6426fd840770124c2b1db81c6138337f2ef the params sent by the manager connecting host to pool fits to what written in metadata, hence, normally, vdsm shouldn't fail. however, could please run the following: - mount command (on host) - if 'cmcd-db-vip.in.hwlab:/vmstore/ovirt' exists, umount it. - restart vdsmd service - try active pool again (from manager) just want to make sure there is no caching issue. please attach logs (vdsm only) after you perform the above. I'm not even seeing the storage domain being passed to the node. I assume this is because the storage domain is inactive but I can't seem to figure out why, or what I need to do to reactivate it. The log (vdsm_log.txt) is attached, but it contains no mention of the storage domain. Also attached (vdsm_log_ref.txt) is a snippet of log that includes /rhev/data-center/mnt/cmcd-db-vip.in.hwlab:_vmstore_ovirt in the mount list. Is there any further troubleshooting I can do before giving up on this? This is extremely odd as everything looks right, I would have blame some caching issues, but you mentioned that vdsm service was restarted. Saggi\Edu\Federico - any insight on this case ? why would connectStoragePool fails on cannot find master domain if command sent by manager uses right params as mentioned in meta-data ? connectStorageServer succeeds as well (although I see no mount command - guess it due to OOP). Ian - you can always use the Re-initialize data-center flow which basically means you loss all your current data (iirc) and new master domain is created. let's wait for the guys to give their insight, meantime I would try reboot the host just to make sure there is no stale NFS entry there. Thanks, Ian ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] The SPM host node is in unresponsive mode
- Original Message - From: Shu Ming shum...@linux.vnet.ibm.com To: users@oVirt.org users@ovirt.org Sent: Tuesday, May 15, 2012 4:56:36 AM Subject: [Users] The SPM host node is in unresponsive mode Hi, I attached one host node in my engine. Because it is the only one node, it is automatically the SPM node. And it used to run well in my engine. Yesterday, some errors happened in the network work of the host node. That made the node become unresponsive in the engine. I am sure the network errors are fixed and want to bring the node back to life now. However, I found that the only one node could not be confirm as host been rebooted and could not be set into the maintenance mode. The reason given there is no active host in the datacenter and SPM can not enter into maintenance mode. It seems that it fell into a logic loop here. Losting network can be quite common in developing environment even in production environment, I think we should have a way to address this problem on how to repair a host node encountering network down for a while. Hi Shu, first, for the manual fence to work (confirm host have been rebooted) you will need another host in the cluster which will be used as a proxy and send the actual manual fence command. second, you are absolutely right, loss of network is a common scenario, and we should be able to recover, but lets try to understand why your host remain unresponsive after network returned. please ssh to the host and try the following: - vdsClient -s 0 getVdsCaps (validity check making sure vdsm service is up and running and communicate with its network socket from localhost) - please ping between host and engine - please make sure there is no firewall on blocking tcp 54321 (on both host and engine) also, please provide vdsm.log (from the time network issues begun) and spm-lock.log (both located on /var/log/vdsm/). as for a mitigation, we can always manipulate db and set it correctly, but first, lets try the above. -- Shu Mingshum...@linux.vnet.ibm.com IBM China Systems and Technology Laboratory ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] Default storage pool won't activate after reboot
Hi Ian, would it be possible for you to attach both ovirt-engine (manager) vdsm logs (host) so can investigate further? - Original Message - From: Ian Levesque i...@crystal.harvard.edu To: Jacob Wyatt jwy...@ggc.edu Cc: users@ovirt.org Sent: Wednesday, May 9, 2012 9:53:54 PM Subject: Re: [Users] Default storage pool won't activate after reboot On May 9, 2012, at 2:40 PM, Jacob Wyatt wrote: StoragePoolUnknown: Unknown pool id, pool not connected: ('af5bcc86-898a-11e1-9632-003048c85226',) I'm new to oVirt myself but I thought that the top level directory name was the UUID of the storage pool. In that case the UUID it lists (af5bcc86-898a-11e1-9632-003048c85226) isn't there. Have you looked directly on the storage machine itself? Perhaps the node can't see that directory due to permission issues? Hi Jacob - thanks for the response. The thought crossed my mind as well but I'm not sure if that UUID should indeed directly correspond to a directory. I haven't poked around the storage pools before. If that's the case, I do wonder what happened to my pool, and also what those other directories are... Best, Ian ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] Default storage pool won't activate after reboot
- Original Message - From: Ian Levesque i...@crystal.harvard.edu To: Haim Ateya hat...@redhat.com Cc: users@ovirt.org, Jacob Wyatt jwy...@ggc.edu Sent: Wednesday, May 9, 2012 10:08:17 PM Subject: Re: [Users] Default storage pool won't activate after reboot On May 9, 2012, at 2:59 PM, Haim Ateya wrote: Hi Ian, would it be possible for you to attach both ovirt-engine (manager) vdsm logs (host) so can investigate further? Sure -- you can downl load a zip of both logs here: http://cl.ly/1h3u322S3f1Y1W0p1T3E/logs.zip it appears that connectStoragePool command fails with the following error: Thread-65664::ERROR::2012-05-09 13:17:46,922::dispatcher::66::Storage.Dispatcher.Protect::(run) {'status': {'message': Cannot find master domain: 'spUUID=af5bcc86-898a-11e1-96 32-003048c85226, msdUUID=e12a0f53-ee72-44bc-ad26-93f9b4613c6c', 'code': 304}} and that's the reason your pool doesn't turn active. in order to understand why, I would like to get a view of your domain metadata: what kind of storage are you using ? NFS or local-domain ? - please access your NFS storage - change dir to export directory - /vmstore/ovirt - cat the following e12a0f53-ee72-44bc-ad26-93f9b4613c6c/dom_md/metadata Thanks, Ian ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] Adding Host
On 03/27/2012 11:52 AM, Rene Rosenberger wrote: Hi, ihave installed ovirt successfully. When I now want to add a host on which KVM is already installed I get an error. I try to add it over the web gui with add new host. Thne a installation progress is starting on the remote host and I get an error during this progress: Step: SetNetworking; Details: addNetwork error trying to add management bridge. What can I do? could you please attach installation logs located under /tmp/ on installed host ? what type of host are you trying to add ? is it RHEL or Fedora ? Regards, rene ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] Hey using NFS as my main datastore
- Original Message - From: Itamar Heim ih...@redhat.com To: Dominic Kaiser domi...@bostonvineyard.org Cc: users@ovirt.org Sent: Thursday, February 9, 2012 7:44:54 PM Subject: Re: [Users] Hey using NFS as my main datastore On 02/09/2012 07:41 PM, Dominic Kaiser wrote: I am using NFS share on an Openfile server as my main datastore went to fire up a virtual instance this is the error I get: VM Building_Automation is down. Exit message internal error process exited while connecting to monitor: qemu-kvm: -drive file=/rhev/data-center/c10c25e0-526a-11e1-a5e4-87fa674ffb5d/340dad49-2ec7-475e-b6c6-668504bdb5b1/images/da582689-2248-4d0e-a017-b09f8cfa9410/78340ab7-b1f5-4fad-9150-437e7ff9d14c,if=none,id=drive-virtio-disk0,format=raw,serial=0e-a017-b09f8cfa9410,cache=none,werror=stop,rerror=stop,aio=threads: could not open disk image /rhev/data-center/c10c25e0-526a-11e1-a5e4-87fa674ffb5d/340dad49-2ec7-475e-b6c6-668504bdb5b1/images/da582689-2248-4d0e-a017-b09f8cfa9410/78340ab7-b1f5-4fad-9150-437e7ff9d14c: Permission denied . It might be an selinux issue, please run the following: 1) run getenforce - if getenforce == enforcing, please run 'setenforce 0' and re-run the vm 2) ls -Zl /rhev/data-center/c10c25e0-526a-11e1-a5e4-87fa674ffb5d/340dad49-2ec7-475e-b6c6-668504bdb5b1/images/da582689-2248-4d0e-a017-b09f8cfa9410/78340ab7-b1f5-4fad-9150-437e7ff9d14c 3) rpm -qa | egrep selinux|qemu-kvm|vdsm|libvirt Does vdsm have to be installed on the server that shares the main datastore? Export and ISO are fine no problems. no need (say you would use a netapp, you couldn't install vdsm on it). but you get a permission denied. what does ls -l shows on that path? ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] addNetwork error trying to add management bridge.
On Feb 5, 2012, at 4:16, justin potts justinpotts...@gmail.com wrote: I've got the following errors: 2012-Feb-04, 20:07:46 Host ovirtnode.grims.com installation failed. Please refer to log files for further details.. 2012-Feb-04, 20:07:46 Failed to install Host ovirtnode.grims.com. Step: SetNetworking; Details: addNetwork error trying to add management bridge. Previous to this the node went through the step of installing various packages. This is an FC16/64bit/VMX capable host. I've been wondering for some time if install is RHEL specific. In any case 2 questions: What causes this error? What logs does this log speak of? Please attach host installation logs under /tmp/vds_*.log located on your node. Haim Thanks in advance I've had a lot of fun using your tool so far :) ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] Host discovery failing due to host network being lost
On Jan 28, 2012, at 10:27, Deepak C Shetty deepa...@linux.vnet.ibm.com wrote: Hi ovirt users, I was trying to check on IRC the below, did not get any replies, so sending this mail. Can someone help me understand what could be the issue that causing my managed host to lose network settings, when i try to discover it from ovirt-engine ? I have to then manually (using a remote console) re-setup the network.. it even overrites my ifcfg-eth0 file I see a new bridge getting added called ovirtmgmt and route entry corr. to that.. the ifcfg-eth0 also has that entry I feel that ovirt scripts running on the host should atleast take a backup of the ifcfg* files they modify so that in case of issues like these, user can login via remote console and restore the original ifcfg file. This is what i asked on IRC... responses appreciated... deepakcs Hello, i just configured ovirt-engine and discovering my first managed host.. got some Qs deepakcs what diff does it make when i select override ip tables check box during new Host workflow ? * djasa has quit (Ping timeout: 480 seconds) deepakcs this is the second time i am discovering this host and everytime the host discovery hangs during Installing and if i check the host, it goes out of network has anybody faced this problem before ? deepakcs While discovering the host, the host network ipv4 address is gone ! and thus the host goes out of network Some more updates.. 1) I manually re-setup the host network, made it pingable, esp from the ovirt-engine node, and clicked on Re-install on the web gui, this time with the 'override ip tables' check enabled 2) this time it successfully completed the node bootstrap steps (as seen from the Events window) and status was Reboot 3) But after reboot the same thing happened.. the host lost ipv4 addr, route entries are completely gone and onthe web gui the host is seen as non-responsive, which is expected bcos the host is not on the network anymore... what is the reason the above is happening ? Hi, First, in order to understand the reason for bootstrap failures, please attach engine.log (ovirt-engine) and node logs (under /tmp/vds_bootstrap.log, vds_installer.log). Second, what you describe regarding the loss of network could derive from the following reasons: - host is configured with bonding - but bonding is not configured correctly. - NetworkManager interfere with networking config. - bridge (ovirtmgmt) is not set to with any BOOTPROTO or ONBOOT. Please attach network scripts (ifcfg-eth0, ifcfg-ovirtmgmt) and /var/log/messages . Haim ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] Help - add a node ...
Hi, I think that the problem is your engine hostname (fqdn) is not resolvable from your node. Node tries to download deployUtil script which located on engine machine. Haim On Jan 27, 2012, at 10:09, Alexandre Frandemiche slobberbone4...@gmail.com wrote: Hello, I try to add a node to my oVirt manager server ! I've one on the same machine, no problème after installation, the server reboot, then I see it in my node. So ok ! I would have another node on another server. I use the oVirt Hypervisor 2.2.1-3.6.fc16.iso to install it. All it's configure. I can ping the server and I add th manager on the node and accept the SSL certificat. If I add this server on my oVirt Manager, I've got a message which says : Install Failed ... That's my logs : 2012-01-26 18:15:34,215 INFO [org.ovirt.engine.core.utils.hostinstall.MinaInstallWrapper] (pool-5-thread-50) Invoking chmod +x /tmp/vds_installer_64ff8b00-fd2f-4afd-a25b-d86fab01b0c3.py; /tmp/vds_installer_64ff8b00-fd2f-4afd-a25b-d86fab01b0c3.py -c 'ssl=true;management_port=54321' -O 'kvm' -t 2012-01-26T17:15:32 -u False -f /tmp/firewall.conf.64ff8b00-fd2f-4afd-a25b-d86fab01b0c3 -p 8080 http://kvm:8080/Components/vds/ http://kvm:8080/Components/vds/ 172.16.14.223 64ff8b00-fd2f-4afd-a25b-d86fab01b0c3 False on 172.16.14.223 2012-01-26 18:15:34,426 ERROR [org.ovirt.engine.core.bll.VdsInstaller] (pool-5-thread-50) Installation of 172.16.14.223. Recieved message: BSTRAP component='RHEV_INSTALL' status='OK' message='oVirt Node DETECTED'/ BSTRAP component='INSTALLER LIB' status='FAIL' message='deployUtil.py download failed. Pathname could not be resolved (verify computer/domain name).'/ BSTRAP component='INIT' status='FAIL' message='Error trying to deploy library.'/ BSTRAP component='RHEV_INSTALL' status='FAIL'/ . Error occured. (Stage: Running first installation script on Host) 2012-01-26 18:15:34,500 INFO [org.ovirt.engine.core.utils.hostinstall.MinaInstallWrapper] (pool-5-thread-50) RunSSHCommand returns true 2012-01-26 18:15:34,500 INFO [org.ovirt.engine.core.bll.VdsInstaller] (pool-5-thread-50) RunScript ended:true 2012-01-26 18:15:34,504 ERROR [org.ovirt.engine.core.bll.VdsInstaller] (pool-5-thread-50) Installation of 172.16.14.223. Operation failure. (Stage: Running first installation script on Host) 2012-01-26 18:15:34,508 INFO [org.ovirt.engine.core.bll.InstallVdsCommand] (pool-5-thread-50) After Installation pool-5-thread-50 2012-01-26 18:15:34,509 INFO [org.ovirt.engine.core.vdsbroker.SetVdsStatusVDSCommand] (pool-5-thread-50) START, SetVdsStatusVDSCommand(vdsId = 30f9327e-4836-11e1-953b-0024212cc302, status=InstallFailed, nonOperationalReason=NONE), log id: 1abe4139 2012-01-26 18:15:34,542 INFO [org.ovirt.engine.core.vdsbroker.SetVdsStatusVDSCommand] (pool-5-thread-50) FINISH, SetVdsStatusVDSCommand, log id: 1abe4139 IP are ok, names are in /etc/hosts of my oVirt manager ... So I don't why the install fail ... If you could help, I would appreciate :) ! Thanks ! Slobberbone ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users