Re: Max Number of VMs per Host
Hello, As you know, the number of Virtual Machines per host depends on available resources (CPU, RAM, Disk), as well as the demands of the image. As a general guide, I have found the following to be reasonable. Other admins, please share your opinions. For a single VM host with 12 virtual CPU and 98 GB RAM, I typically allocate 25 Windows VM. When necessary, I have raised it to 32 Windows XP machines. For linux based images, I have used 50 VM without issues. Monitor the performance tab on your vSphere client to see if the host is over-worked. On Virtual Machines that require higher performance, I usually will not place them on a host with more than 10 working VM. But the most important thing is to watch the performance closely. You will get a feel for how many VM per host by experience. As always, your mileage may vary. Regards, Al Quiros Florida International University From: Sanders, Arbin D asand...@nccu.edumailto:asand...@nccu.edu Reply-To: vcl-user@incubator.apache.orgmailto:vcl-user@incubator.apache.org vcl-user@incubator.apache.orgmailto:vcl-user@incubator.apache.org Date: Wednesday, May 23, 2012 3:53 PM To: 'vcl-...@incubator.apache.orgmailto:'vcl-...@incubator.apache.org' vcl-...@incubator.apache.orgmailto:vcl-...@incubator.apache.org, 'vcl-user@incubator.apache.orgmailto:'vcl-user@incubator.apache.org' vcl-user@incubator.apache.orgmailto:vcl-user@incubator.apache.org Subject: Max Number of VMs per Host All, What is the number of VMs do you all run per VMware host? Is it limited to the RAM limitation for vSphere 5? How many purchase vSphere licenses? Arbin Darren Sanders IT Manager – Academic Computing North Carolina Central University 712 Cecil Street Suite 3014 Durham, NC 27707 919.530.6307 919.530.5097 (Fax) For the Latest ITS Updates and Tips Join Us Online[Description: Description: Description: Description: cid:image003.png@01CB6AD0.97665900] http://www.facebook.com/profile.php?id=66100342#!/pages/Durham-NC/NCCU-Eagle-Technical-Assistance-Center-ETAC/249508718552?v=info [Description: Description: Description: Description: cid:image004.png@01CB6AD0.97665900] http://twitter.com/NCCUETAC CONFIDENTIALITY: This email (including any attachments) may contain confidential, proprietary and privileged information, and unauthorized disclosure or use is prohibited. If you received this email in error, please notify the sender and delete this e-mail from your system. __ This email has been scanned by the Symantec Email Security.cloud service. For more information please visit http://www.symanteccloud.com __ attachment: image001.pngattachment: image002.png
Rocks HPC and VCL
Hello, Has anyone tried to deploy a rocks-based HPC with vcl ? Rocks front end uses its own dhcp server to deploy its compute nodes. Will they interfere with the management node's dhcp, or other VCL functions ? Thanks, Al Quiros
Local and NFS VM Profiles
Hello, I have several VM servers using local storage. They are in use with several images stored on the management node. I want to add a few more VM servers with NFS storage. After adding them and setting the VM profile to Network storage, I cannot seem to reload my existing images from the management node to the server with NFS. I know that NFS VM profile doesn't store the images on the management node, but doesn't it try to pull the image from the management node if it does not exist in NFS ? Thanks, Al Quiros
reservation time out prevention
Hello, Is there a setting where I don't need to click the Connect button on reservations ? I want to prevent a reservation from timing out if I am too late to click the connect button to confirm the reservation in use. Perhaps get it to go to inuse immediately after reload, instead of it being reserved Is there something I can do to make that happen ? Thanks, Al Quiros
Persistent Images
Hello, Is there a way to specify an always persistent image on certain reservations ? I know to select Create/Update Image under Manage Images, but is there some way to say Every time THIS image is selected, make it persistent ? I want to create a group of always persistent images and subimages, so that when a user asks for a reservation, all the images and child images come up persistent. Thanks, Al Quiros
HPC cluster setup
Hello, Now that I have VCL working to provide single virtual machines, how do I setup a cluster of machines in VCL ? How do I setup the situation where a user can reserve a small cluster out of a group of resources (a head node and child compute nodes) ? Thanks, Al Quiros
Timed out reservations
Hello, Is there an easy way to end timed out reservations ? Once the reservations time out, the buttons to end or edit them disappear. Thanks, Al Quiros
Re: Xcat image capture issues
Thanks Dmitri, In following this article, I get to the bottom of the page, where I put in the 3 files needed for a stateless image. I place the 3 files in the located indicated, and when I run nodeset d6100a image I get this: Error: vmlinuz and initrd.img not found at /tftpboot/xcat/image/x86_64 Error: Some nodes failed to set up image resources, aborting But vmlinuz and initrd.img are at /tftpboot/xcat/image/x86_64 Is there something else that needs to happen first ? Also, the last step talks about NFS servers for the image store. Are these required ? Thanks for all your help, Al Quiros On 3/14/12 4:05 PM, Dmitri Chebotarov 4dim...@gmail.com wrote: Evelio Please follow this article to add support for 'image' and 'install' : https://cwiki.apache.org/VCL/adding-support-for-partimage-and-partimage-ng -to-xcat-2x-unofficial.html Thanks. -- Dmitri Chebotarov On Wednesday, March 14, 2012 at 15:37 , Evelio Quiros wrote: Hello again, Dimitri, thanks for the advice. I have a question about these commands. Below, you wrote nodeset d6100a image, But there is no image argument for nodeset. Here is the man page: nodeset [-h|--help|-v|--version] nodeset noderange [install|shell|boot|runcmd=bmcsetup|netboot|iscsiboot|osimage=imagename |s tatelite] When I try the command as-is, I get this: nodeset d6100a image d6100a: Error: Unable to identify plugin for this command, check relevant tables: nodetype.os Error: Some nodes failed to set up image resources, aborting I have done tabdump on the various tables, and done tabedit to modify them to my environment. I know this is not an xCAT list, but your guidance would be appreciated. Thanks, Al Quiros On 3/13/12 4:45 PM, Dmitri Chebotarov dcheb...@gmu.edu (mailto:dcheb...@gmu.edu) wrote: Evelio, You should be able to capture image via xCAT before you can use VCL/xCAT. To capture image via xCAT: nodech d6100a nodetype.os=image nodetype.profile=rh5image-rh66-v0 nodeset d6100a image - this will create PXEboot config file for d6100a: cat /tftpboot/pxelinux.cfg/d6100a-PXE-if-mac #image rh5image-rh66-v0 DEFAULT xCAT LABEL xCAT KERNEL xcat/image/x86_64/vmlinuz APPEND initrd=xcat/image/x86_64/initrd.img imgurl=http://xcat-master/install/image/x86_64/installer_files/rootimg.gz image=/install/image/x86_64/rh5image-rh66-v0.img blocks=512 action=save installnic=eth0 reboot noipv6 IPAPPEND 2 'tabdump nodetype' should look similar to this - note that arch is x86: #node,os,arch,profile,provmethod,supportedarchs,nodetype,comments,disable ... d6100a,image,x86,rh5image-rh66-v0,image,x86,x86_64,,, ... Then do 'rpower d6100a reset'. This will restart d6100a and when node boots over PXE it will boot into http://xcat-master/install/image/x86_64/installer_files/rootimg.gz and, given that image works fine and maps NFS store OK, it dump hd image to /install/image/x86_64/rh5image-rh66-v0.img . If it's successful then to install from the image: nodeset d6100a install 'tabdump nodetype' should look similar to this - provmethod changes from image to install: #node,os,arch,profile,provmethod,supportedarchs,nodetype,comments,disable ... d6100a,image,x86,rh5image-rh66-v0,install,x86,x86_64,,, ... If these steps work OK, then you can use VCL to capture bare-metal image. VCL uses 'nodeset node image' and 'nodeset node install' commands, so it's good idea to make them work before using VCL. Thanks. On Mar 13, 2012, at 16:11 , Evelio Quiros wrote: Hello Again, After trying again to capture a running image on a bare metal node, I get a failure message: Any ideas ? Al Q Added new image to database: 'rh6' image.name (http://image.name): rh5image-rh66-v0 image.id: 6 imagerevision.id: 6 imagemeta.id: 3 resource.id: 16 - -- - Inserted imaging request to the database: request ID: 1 reservation ID: 1 This process will now display the contents of the vcld.log file if the vcld daemon is running. If you do not see many lines of additional output, exit this process, start the vcld daemon, and monitor the image capture process by running the command: tail -f /var/log/vcld.log | grep '1:1' - -- - 2012-03-13 16:03:34|2012|vcld:main(167)|lastcheckin time updated for management node 1: 2012-03-13 16:03:34 2012-03-13 16:03:39|6899|utils.pm:setup_get_array_choice(9947)|choices argument: 2012-03-13 16:03:39|2012|vcld:main(167)|lastcheckin time updated for management node 1: 2012-03-13 16:03:39 2012-03-13 16:03:44|2012|vcld:main(167)|lastcheckin time updated for management node 1: 2012-03-13 16:03:44 2012-03-13 16:03:48|6899
Re: Xcat image capture issues
Hello, Yes, both listener.py and rootimg.gz are both located at /install/image/x86/installer_files/ But NOWHERE on the documentation page does it mention a symlink between x86_64 and x86. Ok, I added the symlinks in both locations, and it did seem to accept it: nodeset d6100a image d6100a: image image-x86_64-rh5image-rh66-v0 Which created this file in /tftpboot/pxelinux.cfg: lrwxrwxrwx 1 root root 6 Mar 15 11:36 0A19 - d6100a cat d6100a #image image-x86_64-rh5image-rh66-v0 DEFAULT xCAT LABEL xCAT KERNEL xcat/image/x86_64/vmlinuz APPEND initrd=xcat/image/x86_64/initrd.img imgurl=http://10.0.0.3//install/image/x86_64/installer_files/rootimg.gz image=/install/image/x86_64/rh5image-rh66-v0.img blocks=512 action=save installnic=eth0 reboot noipv6 IPAPPEND 2 I did a tabedit to add the nodetype.supportedarchs. tabdump nodetype #node,os,arch,profile,provmethod,supportedarchs,nodetype,comments,disable compute,rh6.0,x86_64,compute,,,osi,, d6100a,image,x86,rh5image-rh66-v0,image,x86,x86_64,,, However, when I do this: rpower d6100a reset I get this: d6100a: Error: timeout So, I guess I must be missing something that tells xCAT where the machine is, and how to reach it. Al Q On 3/15/12 11:05 AM, Dmitri Chebotarov dcheb...@gmu.edu wrote: Evelio Could you please check that you have these files/folders as well? /install/image/x86/installer_files/listener.py /install/image/x86/installer_files/rootimg.gz Also, can you check that there are symlinks for x86 or x86_64 in /install/image/ and /tftpboot/xcat/image/, ie: $ ls -l x86 x86_64 - x86 Thanks. On Mar 15, 2012, at 9:56 , Evelio Quiros wrote: Thanks Dmitri, In following this article, I get to the bottom of the page, where I put in the 3 files needed for a stateless image. I place the 3 files in the located indicated, and when I run nodeset d6100a image I get this: Error: vmlinuz and initrd.img not found at /tftpboot/xcat/image/x86_64 Error: Some nodes failed to set up image resources, aborting But vmlinuz and initrd.img are at /tftpboot/xcat/image/x86_64 Is there something else that needs to happen first ? Also, the last step talks about NFS servers for the image store. Are these required ? Thanks for all your help, Al Quiros On 3/14/12 4:05 PM, Dmitri Chebotarov 4dim...@gmail.com wrote: Evelio Please follow this article to add support for 'image' and 'install' : https://cwiki.apache.org/VCL/adding-support-for-partimage-and-partimage- ng -to-xcat-2x-unofficial.html Thanks. -- Dmitri Chebotarov On Wednesday, March 14, 2012 at 15:37 , Evelio Quiros wrote: Hello again, Dimitri, thanks for the advice. I have a question about these commands. Below, you wrote nodeset d6100a image, But there is no image argument for nodeset. Here is the man page: nodeset [-h|--help|-v|--version] nodeset noderange [install|shell|boot|runcmd=bmcsetup|netboot|iscsiboot|osimage=imagenam e |s tatelite] When I try the command as-is, I get this: nodeset d6100a image d6100a: Error: Unable to identify plugin for this command, check relevant tables: nodetype.os Error: Some nodes failed to set up image resources, aborting I have done tabdump on the various tables, and done tabedit to modify them to my environment. I know this is not an xCAT list, but your guidance would be appreciated. Thanks, Al Quiros On 3/13/12 4:45 PM, Dmitri Chebotarov dcheb...@gmu.edu (mailto:dcheb...@gmu.edu) wrote: Evelio, You should be able to capture image via xCAT before you can use VCL/xCAT. To capture image via xCAT: nodech d6100a nodetype.os=image nodetype.profile=rh5image-rh66-v0 nodeset d6100a image - this will create PXEboot config file for d6100a: cat /tftpboot/pxelinux.cfg/d6100a-PXE-if-mac #image rh5image-rh66-v0 DEFAULT xCAT LABEL xCAT KERNEL xcat/image/x86_64/vmlinuz APPEND initrd=xcat/image/x86_64/initrd.img imgurl=http://xcat-master/install/image/x86_64/installer_files/rootimg. gz image=/install/image/x86_64/rh5image-rh66-v0.img blocks=512 action=save installnic=eth0 reboot noipv6 IPAPPEND 2 'tabdump nodetype' should look similar to this - note that arch is x86: #node,os,arch,profile,provmethod,supportedarchs,nodetype,comments,disab le ... d6100a,image,x86,rh5image-rh66-v0,image,x86,x86_64,,, ... Then do 'rpower d6100a reset'. This will restart d6100a and when node boots over PXE it will boot into http://xcat-master/install/image/x86_64/installer_files/rootimg.gz and, given that image works fine and maps NFS store OK, it dump hd image to /install/image/x86_64/rh5image-rh66-v0.img . If it's successful then to install from the image: nodeset d6100a install 'tabdump nodetype' should look similar to this - provmethod changes from image to install: #node,os,arch,profile,provmethod,supportedarchs,nodetype,comments,disab le ... d6100a,image,x86,rh5image-rh66-v0,install,x86,x86_64
Re: Permanent reservations ?
That was too easyŠ.Thank you ! Q On 3/14/12 9:29 AM, Josh Thompson josh_thomp...@ncsu.edu wrote: -BEGIN PGP SIGNED MESSAGE- Hash: SHA1 There is a user group named Specify End Time. Adding users to this group will allow them to manually set the end time of the reservation to anything. Josh On Wednesday, March 14, 2012 1:22:23 PM Evelio Quiros wrote: Hello, How do you setup reservations so that they never end, until the user decides to end them ? I have setup an image that does not check the the user login, and I increased the reservation time to 8640 minutes (6 days) for just a certain group. How do I setup a group that has never ending reservations ? Thanks, Al Quiros Florida International University - -- - --- Josh Thompson VCL Developer North Carolina State University my GPG/PGP key can be found at pgp.mit.edu All electronic mail messages in connection with State business which are sent to or received by this account are subject to the NC Public Records Law and may be disclosed to third parties. -BEGIN PGP SIGNATURE- Version: GnuPG v2.0.17 (GNU/Linux) iEYEARECAAYFAk9gnSIACgkQV/LQcNdtPQPNiQCeJ12nST8JXgQjnGauMn1xz/+r kvQAnjIupEp1bRk4KpzJG2j4SRsTdJX8 =oAar -END PGP SIGNATURE-
Re: Xcat image capture issues
Hello again, Dimitri, thanks for the advice. I have a question about these commands. Below, you wrote nodeset d6100a image, But there is no image argument for nodeset. Here is the man page: nodeset [-h|--help|-v|--version] nodeset noderange [install|shell|boot|runcmd=bmcsetup|netboot|iscsiboot|osimage=imagename|s tatelite] When I try the command as-is, I get this: nodeset d6100a image d6100a: Error: Unable to identify plugin for this command, check relevant tables: nodetype.os Error: Some nodes failed to set up image resources, aborting I have done tabdump on the various tables, and done tabedit to modify them to my environment. I know this is not an xCAT list, but your guidance would be appreciated. Thanks, Al Quiros On 3/13/12 4:45 PM, Dmitri Chebotarov dcheb...@gmu.edu wrote: Evelio, You should be able to capture image via xCAT before you can use VCL/xCAT. To capture image via xCAT: nodech d6100a nodetype.os=image nodetype.profile=rh5image-rh66-v0 nodeset d6100a image - this will create PXEboot config file for d6100a: cat /tftpboot/pxelinux.cfg/d6100a-PXE-if-mac #image rh5image-rh66-v0 DEFAULT xCAT LABEL xCAT KERNEL xcat/image/x86_64/vmlinuz APPEND initrd=xcat/image/x86_64/initrd.img imgurl=http://xcat-master/install/image/x86_64/installer_files/rootimg.gz image=/install/image/x86_64/rh5image-rh66-v0.img blocks=512 action=save installnic=eth0 reboot noipv6 IPAPPEND 2 'tabdump nodetype' should look similar to this - note that arch is x86: #node,os,arch,profile,provmethod,supportedarchs,nodetype,comments,disable ... d6100a,image,x86,rh5image-rh66-v0,image,x86,x86_64,,, ... Then do 'rpower d6100a reset'. This will restart d6100a and when node boots over PXE it will boot into http://xcat-master/install/image/x86_64/installer_files/rootimg.gz and, given that image works fine and maps NFS store OK, it dump hd image to /install/image/x86_64/rh5image-rh66-v0.img . If it's successful then to install from the image: nodeset d6100a install 'tabdump nodetype' should look similar to this - provmethod changes from image to install: #node,os,arch,profile,provmethod,supportedarchs,nodetype,comments,disable ... d6100a,image,x86,rh5image-rh66-v0,install,x86,x86_64,,, ... If these steps work OK, then you can use VCL to capture bare-metal image. VCL uses 'nodeset node image' and 'nodeset node install' commands, so it's good idea to make them work before using VCL. Thanks. On Mar 13, 2012, at 16:11 , Evelio Quiros wrote: Hello Again, After trying again to capture a running image on a bare metal node, I get a failure message: Any ideas ? Al Q Added new image to database: 'rh6' image.name: rh5image-rh66-v0 image.id: 6 imagerevision.id: 6 imagemeta.id: 3 resource.id: 16 - -- - Inserted imaging request to the database: request ID: 1 reservation ID: 1 This process will now display the contents of the vcld.log file if the vcld daemon is running. If you do not see many lines of additional output, exit this process, start the vcld daemon, and monitor the image capture process by running the command: tail -f /var/log/vcld.log | grep '1:1' - -- - 2012-03-13 16:03:34|2012|vcld:main(167)|lastcheckin time updated for management node 1: 2012-03-13 16:03:34 2012-03-13 16:03:39|6899|utils.pm:setup_get_array_choice(9947)|choices argument: 2012-03-13 16:03:39|2012|vcld:main(167)|lastcheckin time updated for management node 1: 2012-03-13 16:03:39 2012-03-13 16:03:44|2012|vcld:main(167)|lastcheckin time updated for management node 1: 2012-03-13 16:03:44 2012-03-13 16:03:48|6899|utils.pm:insert_request(8358)|inserted new reload request into request table, request id=1 2012-03-13 16:03:48|6899|utils.pm:insert_request(8389)|inserted new reload request into reservation table, reservation id=1 2012-03-13 16:03:50|2012|vcld:main(167)|lastcheckin time updated for management node 1: 2012-03-13 16:03:49 2012-03-13 16:03:50|2012|1:1|image|utils.pm:reservation_being_processed(8938)|comput er loadlog 'begin' entry does NOT exist for reservation 1 2012-03-13 16:03:50|2012|1:1|image|utils.pm:run_command(9010)|executed command: pgrep -fl 'vcld [0-9]+:1 ', pid: 7004, exit status: 0, output: 2012-03-13 16:03:50|2012|1:1|image|utils.pm:is_management_node_process_running(9192) |p rocess is NOT running, identifier: 'vcld [0-9]+:1 ' 2012-03-13 16:03:50|2012|1:1|image|utils.pm:reservation_being_processed(8959)|reserv at ion is NOT currently being processed 2012-03-13 16:03:50|2012|1:1|image|vcld:main(278)|reservation 1 is NOT already being processed 2012-03-13 16:03:50|2012|1:1|image|utils.pm:get_request_info(4520)|standalone affiliation found: Local 2012-03-13 16:03:50|2012|1:1
Xcat image capture issues
Hello, After working a while with VmWare images, we have started experimenting with Xcat bare metal images. To accommodate the added level of complexity with Xcat, I created a separate development environment with VCL and some dell 6100 high density nodes. I have installed RHEL 6.1 with Xcat on all the nodes. Following the instructions on the doc page, I installed partimage to the xcat installation. I think its correct, except that these are not IBM blades, so I am unsure how to build a stateless image on this hardware. Just attempting to capture the image as-is failed, and the error messages are below. Is there any more documentation on this process, and how to make Xcat work with VCL? Being new to Xcat, I guess that I am missing some background material on what is supposed to happen here. For example, the error log states that /opt/xcat does not exist, but it does exist on the node I was attempting to capture. Does Xcat need to be on the management node too ? The information I am using is located at: http://sourceforge.net/apps/mediawiki/xcat/index.php?title=Setting_Up_a_Linux_xCAT_Mgmt_Node And https://cwiki.apache.org/VCL/adding-support-for-partimage-and-partimage-ng-to-xcat-2x-unofficial.html Any other information available ? Thanks, Al Quiros Florida International University ** Error message from my attempt at capturing the image below: VCL::image object could not be created and initialized time: 2012-03-07 14:49:31 caller: vcld:make_new_child(571) ( 0) vcld, make_new_child (line: 571) (-1) vcld, main (line: 346) management node: vcldev01.p.fiu.edu reservation PID: 12372 parent vcld PID: 2044 request ID: 1 reservation ID: 1 request state/laststate: image/image request start time: 2012-03-07 14:49:30 request end time: 2012-03-07 15:49:30 for imaging: no log ID: none computer: 6100a computer id: 1 computer type: blade computer eth0 MAC address: undefined computer eth1 MAC address: undefined computer private IP address: 10.0.0.25 computer public IP address: 10.106.128.14 computer in block allocation: no provisioning module: VCL::Module::Provisioning::xCAT2 image: rh5image-rh6xcat5-v0 image display name: rh6xcat image ID: 5 image revision ID: 5 image size: 1450 MB use Sysprep: yes root access: yes image owner ID: 1 image owner affiliation: Local image revision date created: 2012-03-07 14:49:29 image revision production: yes OS module: VCL::Module::OS::Linux user: admin user name: vcl admin user ID: 1 user affiliation: Local RECENT LOG ENTRIES FOR THIS PROCESS: 2012-03-07 14:49:31|12372|1:1|image|Module.pm:new(172)|set 'id' key for VCL::image object from arguments 2012-03-07 14:49:31|12372|1:1|image|Module.pm:new(172)|set 'PID' key for VCL::image object from arguments 2012-03-07 14:49:31|12372|1:1|image|Module.pm:new(172)|set 'DURATION' key for VCL::image object from arguments 2012-03-07 14:49:31|12372|1:1|image|Module.pm:new(172)|set 'PPID' key for VCL::image object from arguments 2012-03-07 14:49:31|12372|1:1|image|Module.pm:new(172)|set 'datemodified' key for VCL::image object from arguments 2012-03-07 14:49:31|12372|1:1|image|Module.pm:new(172)|set 'RESERVATIONCOUNT' key for VCL::image object from arguments 2012-03-07 14:49:31|12372|1:1|image|Module.pm:new(172)|set 'end' key for VCL::image object from arguments 2012-03-07 14:49:31|12372|1:1|image|Module.pm:new(172)|set 'stateid' key for VCL::image object from arguments 2012-03-07 14:49:31|12372|1:1|image|Module.pm:new(172)|set 'PARENTIMAGE' key for VCL::image object from arguments 2012-03-07 14:49:31|12372|1:1|image|Module.pm:new(172)|set 'checkuser' key for VCL::image object from arguments 2012-03-07 14:49:31|12372|1:1|image|Module.pm:new(172)|set 'preload' key for VCL::image object from arguments 2012-03-07 14:49:31|12372|1:1|image|Module.pm:new(172)|set 'NOTICEINTERVAL' key for VCL::image object from arguments 2012-03-07 14:49:31|12372|1:1|image|Module.pm:new(172)|set 'start' key for VCL::image object from arguments 2012-03-07 14:49:31|12372|1:1|image|Module.pm:new(172)|set 'logid' key for VCL::image object from arguments 2012-03-07 14:49:31|12372|1:1|image|Module.pm:create_mn_os_object(361)|VCL::Module::OS::Linux::ManagementNode module loaded 2012-03-07 14:49:31|12372|1:1|image|Module.pm:new(196)|VCL::Module::OS::Linux::ManagementNode object created for image not set, address: 2417400 2012-03-07 14:49:31|12372|1:1|image|DataStructure.pm:_automethod(834)|data structure updated: $self-request_data-{reservation}{0}{computer}{hostname} |12372|1:1|image| computer_hostname = vcldev01.p.fiu.edu 2012-03-07 14:49:31|12372|1:1|image|DataStructure.pm:_automethod(834)|data structure updated: $self-request_data-{reservation}{0}{computer}{NODENAME}
Shibboleth Authentication in VCL
Hello, We are trying to configure Shibboleth for VCL. We have gotten to the point where a user is able to log in successfully, so the $_SERVER values are ok, but it does not add the new user into the database. Doing a search for that username shows nothing, even though we were able to login successfully. Consequently, the logged in user has no privileges, and cannot make reservations. Any ideas on what we may be missing ? Thanks, Al Quiros Florida International University
Re: Shibboleth Authentication in VCL
On 2/23/12 3:08 PM, Josh Thompson josh_thomp...@ncsu.edu wrote: -BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Al, Are you using Shibboleth by itself or in conjunction with LDAP? If not with LDAP, do you have affiliation.shibonly set to 1 for any affiliations logging in with Shibboleth? Josh On Thursday 23 February 2012 7:06:40 PM Evelio Quiros wrote: Hello, We are trying to configure Shibboleth for VCL. We have gotten to the point where a user is able to log in successfully, so the $_SERVER values are ok, but it does not add the new user into the database. Doing a search for that username shows nothing, even though we were able to login successfully. Consequently, the logged in user has no privileges, and cannot make reservations. Any ideas on what we may be missing ? Thanks, Al Quiros Florida International University - -- - --- Josh Thompson VCL Developer North Carolina State University my GPG/PGP key can be found at pgp.mit.edu All electronic mail messages in connection with State business which are sent to or received by this account are subject to the NC Public Records Law and may be disclosed to third parties. -BEGIN PGP SIGNATURE- Version: GnuPG v2.0.17 (GNU/Linux) iEYEARECAAYFAk9GnL0ACgkQV/LQcNdtPQNvqACfeE54okUDqMmymEFZoA99MyS2 MfoAnjmhgE6Q3ZYUi9OVZZat1/U/8gmd =O91X -END PGP SIGNATURE-
Re: Shibboleth Authentication in VCL
Ok, we see two affiliations in the database, Local Global. Both were set to 0. We are using Shibboleth without LDAP, since Shib uses LDAP in it's back end authentication. We set the Global shibonly field to 1 and tried it. It still behaves the same, no new users are entered in the database. Thanks, Al Quiros Florida International University On 2/23/12 3:08 PM, Josh Thompson josh_thomp...@ncsu.edu wrote: -BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Al, Are you using Shibboleth by itself or in conjunction with LDAP? If not with LDAP, do you have affiliation.shibonly set to 1 for any affiliations logging in with Shibboleth? Josh On Thursday 23 February 2012 7:06:40 PM Evelio Quiros wrote: Hello, We are trying to configure Shibboleth for VCL. We have gotten to the point where a user is able to log in successfully, so the $_SERVER values are ok, but it does not add the new user into the database. Doing a search for that username shows nothing, even though we were able to login successfully. Consequently, the logged in user has no privileges, and cannot make reservations. Any ideas on what we may be missing ? Thanks, Al Quiros Florida International University - -- - --- Josh Thompson VCL Developer North Carolina State University my GPG/PGP key can be found at pgp.mit.edu All electronic mail messages in connection with State business which are sent to or received by this account are subject to the NC Public Records Law and may be disclosed to third parties. -BEGIN PGP SIGNATURE- Version: GnuPG v2.0.17 (GNU/Linux) iEYEARECAAYFAk9GnL0ACgkQV/LQcNdtPQNvqACfeE54okUDqMmymEFZoA99MyS2 MfoAnjmhgE6Q3ZYUi9OVZZat1/U/8gmd =O91X -END PGP SIGNATURE-
Occasional Reservation Failures Performance Tuning
Hello, We have a small VCL system serving 200 images on 8 VMware servers. Our system seems to operate pretty well, except that we occasionally get reservation failures. The servers (web, mgt, db) do not appear overloaded, max at about 10% utilization. Out of about 200 reservations (roughly simultaneous after a block allocation), about 4 reservations fail. These failures usually indicate failed to update private IP address. I have included some of the failure entries from vcld.log below. What can we do to improve the reliability and performance of our VCL system ? Thanks, Al Quiros Florida International University |4443|2053:2053|new| 2012-01-17 10:24:25|4443|2053:2053|new|OS.pm:update_public_ip_address(608)|failed to retrieve dynamic public IP address from vclimg9 2012-01-17 10:24:25|4443|2053:2053|new|utils.pm:insertloadlog(3875)|inserted computer=20, dynamicDHCPaddress, failed to retrieve dynamic public IP address from vclimg9 |4443|2053:2053|new| 2012-01-17 10:24:25|4443|2053:2053|new|State.pm:reservation_failed(213)|reservation failed on vclimg9: failed to update private IP address |4443|2053:2053|new| ( 0) State.pm, reservation_failed (line: 213) 2012-01-17 10:24:26|4443|2053:2053|new|utils.pm:insertloadlog(3875)|inserted computer=20, failed, failed to update private IP address 2012-01-17 10:24:26|4443|2053:2053|new|State.pm:reservation_failed(216)|inserted computerloadlog entry 2012-01-17 10:24:26|4443|2053:2053|new|State.pm:reservation_failed(224)|updated log ending value to 'failed', logid=474 2012-01-17 10:24:26|4443|2053:2053|new|utils.pm:update_computer_state(2033)|computer 20 state updated to: failed 2012-01-17 10:24:26|4443|2053:2053|new|State.pm:reservation_failed(235)|computer vclimg9 (20) state set to failed 2012-01-17 10:24:26|4443|2053:2053|new|utils.pm:update_request_state(1991)|request 2053 state updated to: failed, laststate to: new 2012-01-17 10:24:26|4443|2053:2053|new|State.pm:reservation_failed(248)|set request state to 'failed'/'new' 2012-01-17 10:24:26|4443|2053:2053|new|State.pm:reservation_failed(257)|vclimg9 in blockcomputers table 2012-01-17 10:24:26|4443|2053:2053|new|State.pm:reservation_failed(258)|removed vclimg9 from blockcomputers table 2012-01-17 10:24:26|4443|2053:2053|new|State.pm:reservation_failed(269)|exiting 1 |32292|1990:1990|reload| 2012-01-17 10:26:19|32292|1990:1990|reload|OS.pm:wait_for_response(465)|failed to connect to vclimg148 via SSH after 600 seconds |32292|1990:1990|reload| 2012-01-17 10:26:19|32292|1990:1990|reload|VMware.pm:load(419)|failed to perform OS post-load tasks on VM vclimg148 on VM host: idp06.fiu.edu |32292|1990:1990|reload| 2012-01-17 10:26:19|32292|1990:1990|reload|new.pm:reload_image(623)|vmwarelinux-LinuxBase19-v0 failed to load on vclimg148, returning 2012-01-17 10:26:19|32292|1990:1990|reload|utils.pm:insertloadlog(3875)|inserted computer=166, loadimagefailed, vmwarelinux-LinuxBase19-v0 failed to load on vclimg148 |32292|1990:1990|reload| 2012-01-17 10:26:19|32292|1990:1990|reload|new.pm:process(295)|failed to load vclimg148 with vmwarelinux-LinuxBase19-v0 |32292|1990:1990|reload| 2012-01-17 10:26:19|32292|1990:1990|reload|State.pm:reservation_failed(213)|reservation failed on vclimg148: process failed after trying to load or make available |32292|1990:1990|reload| ( 0) State.pm, reservation_failed (line: 213) 2012-01-17 10:26:20|32292|1990:1990|reload|utils.pm:insertloadlog(3875)|inserted computer=166, failed, process failed after trying to load or make available 2012-01-17 10:26:20|32292|1990:1990|reload|State.pm:reservation_failed(216)|inserted computerloadlog entry 2012-01-17 10:26:20|32292|1990:1990|reload|utils.pm:update_computer_state(2033)|computer 166 state updated to: failed 2012-01-17 10:26:20|32292|1990:1990|reload|State.pm:reservation_failed(235)|computer vclimg148 (166) state set to failed 2012-01-17 10:26:20|32292|1990:1990|reload|utils.pm:update_request_state(1991)|request 1990 state updated to: failed, laststate to: reload 2012-01-17 10:26:20|32292|1990:1990|reload|State.pm:reservation_failed(248)|set request state to 'failed'/'reload' 2012-01-17 10:26:20|32292|1990:1990|reload|State.pm:reservation_failed(266)|vclimg148 is NOT in blockcomputers table 2012-01-17 10:26:20|32292|1990:1990|reload|State.pm:reservation_failed(269)|exiting 1 2012-01-17 10:26:20|2218|1990:1990|failed|vcld:main(252)|request deleted |6463|2090:2090|new| 2012-01-17 10:28:47|6463|2090:2090|new|Linux.pm:get_network_configuration(2635)|failed to determine the public interface name |6463|2090:2090|new| 2012-01-17 10:28:47|6463|2090:2090|new|Linux.pm:get_public_ip_address(2736)|failed to retrieve public network configuration |6463|2090:2090|new| 2012-01-17 10:28:47|6463|2090:2090|new|OS.pm:update_public_ip_address(608)|failed to retrieve dynamic public IP address from vclimg128 2012-01-17 10:28:47|6463|2090:2090|new|utils.pm:insertloadlog(3875)|inserted computer=146,
Re: Resource Group Error
Thanks ! Al Q On 1/3/12 11:36 AM, Dmitri Chebotarov dcheb...@gmu.edu wrote: Al Take at look at https://issues.apache.org/jira/browse/VCL-456 You need to add $groupwasnone = 0; line to groups.php. Thanks. On Jan 3, 2012, at 11:28 , Evelio Quiros wrote: Hello, I am getting this error on the VCL web interface when I try to edit a resource group: Error encountered Unkown error type: [8] Undefined variable: groupwasnone Error at 511 in /var/www/html/vcl/.ht-inc/groups.php Any idea what this means ? Thanks, Al Quiros Thank you. - Dmitri Chebotarov dcheb...@gmu.edu 703-993-6175 -
Re: Capturing Windows 2003 Server Image
That was it ! Thank you. Windows 2003 Server was successfully captured. I would not have thought that the Windows Firewall was an issue. Thanks again, Al Quiros On 12/28/11 8:50 AM, Andy Kurth andy_ku...@ncsu.edu wrote: I have seen this before. For some reason, the firewall isn't enabled by default for new Windows 2003 installations and the firewall services aren't running. Enable the firewall via the Control Panel. Make sure TCP port 22 is open and that SSH is still working and then restart the capture process. Don't rerun 'vcld -setup' once you have run it once for a given base image because it will result in extra unnecessary image entries in the database. Instead, manually change the request table in the database. Find the imaging request that was created by 'vcld -setup'. The request.state value will be 10 (maintenance) and request.laststate will be 16 (image). change request.state to 16 and the normal running vcld process will restart the capture. -Andy On Tue, Dec 27, 2011 at 3:11 PM, Evelio Quiros evq...@fiu.edu wrote: Hello, Happy New Year to you all. I am having trouble capturing a Windows 2003 Server image. I setup Windows 2003 on the VM server and installed and configured Cygwin. I setup the gen-keys, and my Management node can SSH to the server fine without a password. When I tried to capture the new base image, it begins the process, and a little while later it reports having failed. I had been able to capture a linux image and a windows XP base image without any trouble, so I know VCL itself is working ok. Any ideas on what might be wrong ? Included below is the last part of the vcl logŠ. 2011-12-27 14:42:28|16120|34:34|image|Module.pm:code_loop_timeout(759)|attempt 9: waiting for vclimg12 to respond to SSH 2011-12-27 14:42:34|13511|vcld:main(167)|lastcheckin time updated for management node 1: 2011-12-27 14:42:34 2011-12-27 14:42:41|16120|34:34|image|OS.pm:is_ssh_responding(401)|vclimg12 is responding to SSH, port 22: open, port 24: closed 2011-12-27 14:42:41|16120|34:34|image|Module.pm:code_loop_timeout(762)|waiting for vclimg12 to respond to SSH, code returned true 2011-12-27 14:42:41|16120|34:34|image|Windows.pm:reboot(3454)|vclimg12 responded to ssh 2011-12-27 14:42:41|16120|34:34|image|Windows.pm:reboot(3471)|reboot complete on vclimg12, took 318 seconds 2011-12-27 14:42:41|16120|34:34|image|Windows.pm:disable_pagefile(2077)|computer was rebooted after disabling pagefile in the registry 2011-12-27 14:42:41|16120|34:34|image|utils.pm:run_ssh_command(5380)|executing SSH command on vclimg12: |16120|34:34|image| /usr/bin/ssh -i /etc/vcl/vcl.key -o StrictHostKeyChecking=no -l root -p 22 -x vclimg12 'for i in `ls /cygdrive 2/dev/null`; do echo $i; done;' 21 2011-12-27 14:42:42|16120|34:34|image|Windows.pm:get_volume_list(7958)|retrieved drive letter list under /cygdrive: |16120|34:34|image| c 2011-12-27 14:42:45|16120|34:34|image|Windows.pm:file_exists(1340)|file exists on vclimg12: 'c:/pagefile.sys' 2011-12-27 14:42:45|16120|34:34|image|Windows.pm:delete_file(1102)|attempting to delete file: 'c:/pagefile.sys' 2011-12-27 14:43:32|16120|34:34|image|Windows.pm:file_exists(1336)|file does NOT exist on vclimg12: 'c:/pagefile.sys' 2011-12-27 14:43:32|16120|34:34|image|Windows.pm:delete_file(1157)|deleted file: 'c:/pagefile.sys' 2011-12-27 14:43:32|16120|34:34|image|Windows.pm:disable_pagefile(2094)|deleted pagefile.sys on all c: 2011-12-27 14:43:32|16120|34:34|image|Windows.pm:get_network_configuration(5040)|net work configuration has already been retrieved 2011-12-27 14:43:32|16120|34:34|image|DataStructure.pm:get_computer_private_ip_addre ss(1557)|returning private IP address previously retrieved: 10.0.0.42 2011-12-27 14:43:32|16120|34:34|image|Windows.pm:get_network_configuration(5075)|pri vate interface found: Local Area Connection, description: Intel(R) PRO/1000 MT Network Connection, address(es): 10.0.0.42 2011-12-27 14:43:32|16120|34:34|image|Windows.pm:get_private_interface_name(5177)|re turning private interface name: Local Area Connection 2011-12-27 14:43:32|16120|34:34|image|Windows.pm:firewall_disable_rdp(4856)|RDP will be disabled on private interface: Local Area Connection 2011-12-27 14:43:32|16120|34:34|image|Windows.pm:get_network_configuration(5040)|net work configuration has already been retrieved 2011-12-27 14:43:32|16120|34:34|image|DataStructure.pm:get_computer_private_ip_addre ss(1557)|returning private IP address previously retrieved: 10.0.0.42 2011-12-27 14:43:32|16120|34:34|image|Windows.pm:get_network_configuration(5075)|pri vate interface found: Local Area Connection, description: Intel(R) PRO/1000 MT Network Connection, address(es): 10.0.0.42 2011-12-27 14:43:32|16120|34:34|image|utils.pm:is_valid_ip_address(9266)|IP address is valid: 10.106.128.49 2011-12-27 14:43:32|16120|34:34|image|utils.pm:is_public_ip_address(9375)|private IP address: 10.106.128.49
Capturing Windows 2003 Server Image
Hello, Happy New Year to you all. I am having trouble capturing a Windows 2003 Server image. I setup Windows 2003 on the VM server and installed and configured Cygwin. I setup the gen-keys, and my Management node can SSH to the server fine without a password. When I tried to capture the new base image, it begins the process, and a little while later it reports having failed. I had been able to capture a linux image and a windows XP base image without any trouble, so I know VCL itself is working ok. Any ideas on what might be wrong ? Included below is the last part of the vcl log…. 2011-12-27 14:42:28|16120|34:34|image|Module.pm:code_loop_timeout(759)|attempt 9: waiting for vclimg12 to respond to SSH 2011-12-27 14:42:34|13511|vcld:main(167)|lastcheckin time updated for management node 1: 2011-12-27 14:42:34 2011-12-27 14:42:41|16120|34:34|image|OS.pm:is_ssh_responding(401)|vclimg12 is responding to SSH, port 22: open, port 24: closed 2011-12-27 14:42:41|16120|34:34|image|Module.pm:code_loop_timeout(762)|waiting for vclimg12 to respond to SSH, code returned true 2011-12-27 14:42:41|16120|34:34|image|Windows.pm:reboot(3454)|vclimg12 responded to ssh 2011-12-27 14:42:41|16120|34:34|image|Windows.pm:reboot(3471)|reboot complete on vclimg12, took 318 seconds 2011-12-27 14:42:41|16120|34:34|image|Windows.pm:disable_pagefile(2077)|computer was rebooted after disabling pagefile in the registry 2011-12-27 14:42:41|16120|34:34|image|utils.pm:run_ssh_command(5380)|executing SSH command on vclimg12: |16120|34:34|image| /usr/bin/ssh -i /etc/vcl/vcl.key -o StrictHostKeyChecking=no -l root -p 22 -x vclimg12 'for i in `ls /cygdrive 2/dev/null`; do echo $i; done;' 21 2011-12-27 14:42:42|16120|34:34|image|Windows.pm:get_volume_list(7958)|retrieved drive letter list under /cygdrive: |16120|34:34|image| c 2011-12-27 14:42:45|16120|34:34|image|Windows.pm:file_exists(1340)|file exists on vclimg12: 'c:/pagefile.sys' 2011-12-27 14:42:45|16120|34:34|image|Windows.pm:delete_file(1102)|attempting to delete file: 'c:/pagefile.sys' 2011-12-27 14:43:32|16120|34:34|image|Windows.pm:file_exists(1336)|file does NOT exist on vclimg12: 'c:/pagefile.sys' 2011-12-27 14:43:32|16120|34:34|image|Windows.pm:delete_file(1157)|deleted file: 'c:/pagefile.sys' 2011-12-27 14:43:32|16120|34:34|image|Windows.pm:disable_pagefile(2094)|deleted pagefile.sys on all c: 2011-12-27 14:43:32|16120|34:34|image|Windows.pm:get_network_configuration(5040)|network configuration has already been retrieved 2011-12-27 14:43:32|16120|34:34|image|DataStructure.pm:get_computer_private_ip_address(1557)|returning private IP address previously retrieved: 10.0.0.42 2011-12-27 14:43:32|16120|34:34|image|Windows.pm:get_network_configuration(5075)|private interface found: Local Area Connection, description: Intel(R) PRO/1000 MT Network Connection, address(es): 10.0.0.42 2011-12-27 14:43:32|16120|34:34|image|Windows.pm:get_private_interface_name(5177)|returning private interface name: Local Area Connection 2011-12-27 14:43:32|16120|34:34|image|Windows.pm:firewall_disable_rdp(4856)|RDP will be disabled on private interface: Local Area Connection 2011-12-27 14:43:32|16120|34:34|image|Windows.pm:get_network_configuration(5040)|network configuration has already been retrieved 2011-12-27 14:43:32|16120|34:34|image|DataStructure.pm:get_computer_private_ip_address(1557)|returning private IP address previously retrieved: 10.0.0.42 2011-12-27 14:43:32|16120|34:34|image|Windows.pm:get_network_configuration(5075)|private interface found: Local Area Connection, description: Intel(R) PRO/1000 MT Network Connection, address(es): 10.0.0.42 2011-12-27 14:43:32|16120|34:34|image|utils.pm:is_valid_ip_address(9266)|IP address is valid: 10.106.128.49 2011-12-27 14:43:32|16120|34:34|image|utils.pm:is_public_ip_address(9375)|private IP address: 10.106.128.49, returning 0 2011-12-27 14:43:32|16120|34:34|image|Windows.pm:get_network_configuration(5117)|interface found with non-public address not matching private address for reservation: Local Area Connection 2, address(es): 10.106.128.49 2011-12-27 14:43:32|16120|34:34|image|Windows.pm:get_network_configuration(5124)|interface will be returned if another with a public address isn't found 2011-12-27 14:43:32|16120|34:34|image|Windows.pm:get_public_interface_name(5207)|returning public interface name: Local Area Connection 2 2011-12-27 14:43:32|16120|34:34|image|Windows.pm:firewall_disable_rdp(4872)|RDP will be disabled on public interface: Local Area Connection 2 2011-12-27 14:43:32|16120|34:34|image|utils.pm:run_ssh_command(5380)|executing SSH command on vclimg12: |16120|34:34|image| /usr/bin/ssh -i /etc/vcl/vcl.key -o StrictHostKeyChecking=no -l root -p 22 -x vclimg12 'C:/Windows/System32/netsh.exe firewall delete portopening protocol = TCP port = 3389 interface = Local Area Connection ;C:/Windows/System32/netsh.exe firewall delete portopening protocol = TCP port = 3389 interface = Local
Re: Images and Connections
Hello everybody, I was able to solve the problem with the SSH session closing immediately. The issue was SELINUX. There was no security profile for the newly created user on the VCL image, so SELINUX closed the session. Just disabling SELINUX on the base image solved the issue. Thanks again for all your help. I hope this information helps anyone else encountering this problem. Cheers, Al Q From: Al Quiros evq...@fiu.edumailto:evq...@fiu.edu Reply-To: vcl-user@incubator.apache.orgmailto:vcl-user@incubator.apache.org Date: Tue, 15 Nov 2011 21:31:21 + To: vcl-user@incubator.apache.orgmailto:vcl-user@incubator.apache.org vcl-user@incubator.apache.orgmailto:vcl-user@incubator.apache.org Subject: Images and Connections Hello, First of all, thanks for all your help so far. I have been able to create a base image, and create new images based on that. The trouble that I am having now is that when I try to connect to the newly created Linux image, the SSH session closes immediately. When I connect to that image directly on the console, I am able to connect using the credentials supplied by VCL. Any ideas on why that is happening ? Regards, Al Q
Image Creation from Base Image
Hello, I was able to successfully capture a base image. I have added the image to the image group. However, when I try to make a reservation, or create a new image, It eventually fails. I watch the VM server and see that a new VM was started. It does start a new VM with the correct MAC addresses, but Redhat detects these changed MACs as new interfaces, and assigns them to eth2 and eth3. Consequently, they are not brought up, the machine never dhcp's, and the image fails to contact the vcl server. Should I delete the network persistence rules before capturing the image ? Regards, Q
Images and Connections
Hello, First of all, thanks for all your help so far. I have been able to create a base image, and create new images based on that. The trouble that I am having now is that when I try to connect to the newly created Linux image, the SSH session closes immediately. When I connect to that image directly on the console, I am able to connect using the credentials supplied by VCL. Any ideas on why that is happening ? Regards, Al Q
Re: Base Image creation - problems - PLEASE HELP
Hello, Ok, I actually have tried many times to create a base image. I had it with NIC 1 in the private network and NIC 2 in the public previously. Each time, when VCL deletes my base image and copies it back to the VM server, the networks on the nics are REVERSED. When I start with nic 1 in Private and nic 2 in Public, it ends up with nic 1 in Pub and nic 2 in Priv. When I start with nic 1 in Public and nic 2 in Private, it ends up with nic 1 in Priv and nic 2 in Pub. The VM host profile has Virtual Switch 0 in Public and Virtual Switch 1 in Private. The DHCP server is only listening on the private network. I see unauthorized dhcp requests come in for public network IP addresses when VCL swaps them. In the last attempt, I did something new: When the new image is brought up, I manually swapped the nics back before VCL times out waiting for the image to boot. At that time, it declares the image a success, but it should not need such manual intervention. Any clues why VCL decides to swap the networks ? Al Quiros From: Aaron Coburn acob...@amherst.edumailto:acob...@amherst.edu Reply-To: vcl-user@incubator.apache.orgmailto:vcl-user@incubator.apache.org Date: Wed, 2 Nov 2011 16:24:02 -0400 To: vcl-user@incubator.apache.orgmailto:vcl-user@incubator.apache.org Subject: Re: Base Image creation - problems - PLEASE HELP Evelio, in our VCL setup and according to the docs on this page: https://cwiki.apache.org/confluence/display/VCL/Create+a+Windows+Base+Image you may want to try configuring your base image to use NIC 1 for the private network and NIC 2 for the public network. The sshd configuration script that you ran on your base image typically closes ssh access over the public network, but if the two networks are switched, you may have inadvertently cut off access over the private network. Aaron -- Aaron Coburn Systems Administrator and Programmer Academic Technology Services, Amherst College (413) 542-5451 acob...@amherst.edumailto:acob...@amherst.edu On Nov 2, 2011, at 4:09 PM, Evelio Quiros wrote: Hello, Ok, I'm stumped. I continue to have issues creating a base image on our vcl setup. Here is my info: I have a single server running web, management node, and database. It has two interfaces, one on our private network, another on our public net. I have a single Vmware server, also with two interfaces. One interface, switch 0, is on our public net. It also has the VM management net. Another interface, switch 1, is on our private net. Each of these switches is labeled the same as in the Management node config. When I create the base image to capture, I create eth0 (nic1) on the public network, then I create eth1 (nic2) on the private network. I set them both to dhcp, and the dhcp server on the VCL server responds and gives it an address of an image in my database. I set the ssh key in the new image. I set the image in /etc/hosts so that the server can resolve it. I set the vcld to capture the image, It begins and copies the image to the management node directory, It deletes the image from the Vmware server, and re-creates the image All well and good. but then : When the recreated image comes up, The OTHER interface begins asking the dhcp server for addresses in the public network. The dhcp server denies it as non-authoritative, and the private interface never comes up. Even though it was able to dhcp previously. The image creation completes successfully, but the newly created image cannot connect to the vcl server. So, I eventually get a failure message from VCL. WHAT GIVES ??? Here is my error email: reservation failed on image6: process failed after trying to load or make available time: 2011-11-02 16:05:30 caller: State.pm:reservation_failed(213) ( 0) State.pm, reservation_failed (line: 213) (-1) new.pm, process (line: 298) (-2) vcld, make_new_child (line: 568) (-3) vcld, main (line: 346) management node: vcltst.fiu.eduhttp://vcltst.fiu.edu reservation PID: 29777 parent vcld PID: 23341 request ID: 11 reservation ID: 11 request state/laststate: reload/image request start time: 2011-11-02 15:53:56 request end time: 2011-11-02 16:23:56 for imaging: no log ID: none computer: image6 computer id: 7 computer type: virtualmachine computer eth0 MAC address: 00:50:56:00:10:0a computer eth1 MAC address: 00:50:56:00:10:0b computer private IP address: 10.0.0.9 computer public IP address: 10.106.128.59 computer in block allocation: no provisioning module: VCL::Module::Provisioning::VMware::VMware vm host: vcldell01 vm host ID: 1 vm host computer ID: 1 vm profile: VMware ESX - local storage vm profile VM path: /vmfs/volumes/datastore1 vm profile repository path: /images vm profile datastore path: /vmfs/volumes/datastore1 vm profile disk type: localdisk image: vmwarelinux-redhatbase10-v0 image display name: redhatbase image ID: 10
Base Image creation - problems - PLEASE HELP
Hello, Ok, I'm stumped. I continue to have issues creating a base image on our vcl setup. Here is my info: I have a single server running web, management node, and database. It has two interfaces, one on our private network, another on our public net. I have a single Vmware server, also with two interfaces. One interface, switch 0, is on our public net. It also has the VM management net. Another interface, switch 1, is on our private net. Each of these switches is labeled the same as in the Management node config. When I create the base image to capture, I create eth0 (nic1) on the public network, then I create eth1 (nic2) on the private network. I set them both to dhcp, and the dhcp server on the VCL server responds and gives it an address of an image in my database. I set the ssh key in the new image. I set the image in /etc/hosts so that the server can resolve it. I set the vcld to capture the image, It begins and copies the image to the management node directory, It deletes the image from the Vmware server, and re-creates the image All well and good. but then : When the recreated image comes up, The OTHER interface begins asking the dhcp server for addresses in the public network. The dhcp server denies it as non-authoritative, and the private interface never comes up. Even though it was able to dhcp previously. The image creation completes successfully, but the newly created image cannot connect to the vcl server. So, I eventually get a failure message from VCL. WHAT GIVES ??? Here is my error email: reservation failed on image6: process failed after trying to load or make available time: 2011-11-02 16:05:30 caller: State.pm:reservation_failed(213) ( 0) State.pm, reservation_failed (line: 213) (-1) new.pm, process (line: 298) (-2) vcld, make_new_child (line: 568) (-3) vcld, main (line: 346) management node: vcltst.fiu.edu reservation PID: 29777 parent vcld PID: 23341 request ID: 11 reservation ID: 11 request state/laststate: reload/image request start time: 2011-11-02 15:53:56 request end time: 2011-11-02 16:23:56 for imaging: no log ID: none computer: image6 computer id: 7 computer type: virtualmachine computer eth0 MAC address: 00:50:56:00:10:0a computer eth1 MAC address: 00:50:56:00:10:0b computer private IP address: 10.0.0.9 computer public IP address: 10.106.128.59 computer in block allocation: no provisioning module: VCL::Module::Provisioning::VMware::VMware vm host: vcldell01 vm host ID: 1 vm host computer ID: 1 vm profile: VMware ESX - local storage vm profile VM path: /vmfs/volumes/datastore1 vm profile repository path: /images vm profile datastore path: /vmfs/volumes/datastore1 vm profile disk type: localdisk image: vmwarelinux-redhatbase10-v0 image display name: redhatbase image ID: 10 image revision ID: 10 image size: 16384 MB use Sysprep: yes root access: yes image owner ID: 1 image owner affiliation: Local image revision date created: 2011-11-02 15:51:12 image revision production: yes OS module: VCL::Module::OS::Linux user: vclreload user name: vcl reload user ID: 2 user affiliation: Local RECENT LOG ENTRIES FOR THIS PROCESS: 2011-11-02 16:01:20|29777|11:11|reload|OS.pm:is_ssh_responding(386)|image6 is NOT responding to SSH, ports 22 or 24 are both closed 2011-11-02 16:01:20|29777|11:11|reload|Module.pm:code_loop_timeout(755)|attempt 24: code returned false, seconds elapsed/remaining: 360/240, sleeping for 15 seconds 2011-11-02 16:01:35|29777|11:11|reload|Module.pm:code_loop_timeout(759)|attempt 25: waiting for image6 to respond to SSH 2011-11-02 16:01:35|29777|11:11|reload|OS.pm:is_ssh_responding(386)|image6 is NOT responding to SSH, ports 22 or 24 are both closed 2011-11-02 16:01:35|29777|11:11|reload|Module.pm:code_loop_timeout(755)|attempt 25: code returned false, seconds elapsed/remaining: 375/225, sleeping for 15 seconds 2011-11-02 16:01:50|29777|11:11|reload|Module.pm:code_loop_timeout(759)|attempt 26: waiting for image6 to respond to SSH 2011-11-02 16:01:51|29777|11:11|reload|OS.pm:is_ssh_responding(386)|image6 is NOT responding to SSH, ports 22 or 24 are both closed 2011-11-02 16:01:51|29777|11:11|reload|Module.pm:code_loop_timeout(755)|attempt 26: code returned false, seconds elapsed/remaining: 391/209, sleeping for 15 seconds 2011-11-02 16:02:06|29777|11:11|reload|Module.pm:code_loop_timeout(759)|attempt 27: waiting for image6 to respond to SSH 2011-11-02 16:02:07|29777|11:11|reload|OS.pm:is_ssh_responding(386)|image6 is NOT responding to SSH, ports 22 or 24 are both closed 2011-11-02 16:02:07|29777|11:11|reload|Module.pm:code_loop_timeout(755)|attempt 27: code returned false, seconds elapsed/remaining: 407/193, sleeping for 15 seconds 2011-11-02 16:02:22|29777|11:11|reload|Module.pm:code_loop_timeout(759)|attempt 28: waiting
Re: Base Image creation error - VMware host config
Thank you Josh. I found the missing Dojo javascript. Now I get the correct response when adding the VM to the host. Cheers, Q On 10/25/11 10:32 AM, Josh Thompson josh_thomp...@ncsu.edu wrote: -BEGIN PGP SIGNED MESSAGE- Hash: SHA1 If nothing is happening when you click on Configure Host on the Manage Virtual Hosts page, then you are most likely getting a javascript error due to a misconfiguration. Can you check to see if your browser is saying there was an error? If you use Firefox, you can install the Firebug Add-on to really see what is happening. If you can give me an error message, I could probably tell you what is not configured correctly. Josh On Tuesday October 25, 2011, Evelio Quiros wrote: Yes, that was done. The server is set to vmhostinuse. In the Manage Virtual Hosts page, none of the buttons seem to do anything. In troubleshooting this, I did set the state to available, and then found I could not set it back to vmhostinuse. I deleted the entry, and added a new one with a slightly different hostname, but same IP, and set that to vmhostinuse. I added the new hostname to my hosts file. Shall I attempt to add the vSphere perl module with cpan ? Al Quiros On 10/25/11 10:12 AM, Aaron Peeler aaron_pee...@ncsu.edu wrote: When you added your VM host servers did you set the state to vmhostinuse? https://cwiki.apache.org/confluence/display/VCL/VCL+2.2.1+-+Further+Step s+ if+Using+VMware If not then you'll need to go into the mysql cmdline or phpmyadmin and set that server to vmhostinuse state and assigne the vmhost profile. Aaron 2011/10/25 Evelio Quiros evq...@fiu.edu: Thanks again Aaron. It seems we are taking baby steps here. Ok, next issue: When I click Configure Host in the vcl web interface, nothing happens. My Vmware server is listed there. It is in vmhostinuse state. I noticed some complaints below about the Vsphere perl module not being installed. I thought that was done in the install_perl_libraries script. Thanks again for all your help. Al Quiros On 10/25/11 9:43 AM, Aaron Peeler fapee...@ncsu.edu wrote: In the log output your missing the vmhost information. Make sure your vm host server have a correctly configured vmhost profile and you have assigned your vms to the virtual host server by using the Virtual Host interface. https://cwiki.apache.org/confluence/display/VCL/VCL+2.2.1+-+Further+St ep s+ if+Using+VMware https://cwiki.apache.org/confluence/display/VCL/VMware+Configuration I'm not seeing the virtual host interface mentioned directly in the vcl 2.2.1 install guide. I'll added that to the first link. Here is a link from a bootcamp that you can review this morning while I work on adding the content to Apache VCL: https://sites.google.com/a/ncsu.edu/vcl-bootcamp/exercises/assign-a-vm -t o- a-virtual-host Aaron On Tue, Oct 25, 2011 at 9:19 AM, Evelio Quiros evq...@fiu.edu wrote: Thank you Aaron. Ok, now it can resolve the name redhat64base to the private IP address 10.0.0.2. I tried to add the base image as the name linux64. It resides in the database as redhat64base, with the public IP of 10.106.128.53, and set as available. I'm not sure what this error message is telling meŠ VCL::image object could not be created and initialized - -- - time: 2011-10-25 09:30:44 caller: vcld:make_new_child(571) ( 0) vcld, make_new_child (line: 571) (-1) vcld, main (line: 346) - -- - management node: vcltst.fiu.edu reservation PID: 31318 parent vcld PID: 15838 request ID: 4 reservation ID: 4 request state/laststate: image/image request start time: 2011-10-25 09:30:37 request end time: 2011-10-25 10:30:37 for imaging: no log ID: none computer: redhat64base computer id: 12 computer type: blade computer eth0 MAC address: undefined computer eth1 MAC address: undefined computer private IP address: 10.0.0.2 computer public IP address: 10.106.128.53 computer in block allocation: no provisioning module: VCL::Module::Provisioning::VMware::VMware image: vmwarelinux-linux6414-v0 image display name: linux64 image ID: 14 image revision ID: 8 image size: 1450 MB use Sysprep: yes root access: yes image owner ID: 1 image owner affiliation: Local image revision date created: 2011-10-25 09:30:37 image revision production: yes OS module: VCL::Module::OS::Linux user: admin user name: vcl admin user ID: 1 user affiliation: Local - -- - RECENT LOG ENTRIES FOR THIS PROCESS: 2011-10-25 09:30:43|31318|4:4|image|VMware.pm:get_vmhost_api_object(1293)|attemp ti ng to load VMware control module: VCL::Module::Provisioning::VMware::vSphere_SDK 2011-10-25 09:30
vcld setup error
Hello, I am working on the initial vcld setup. This is what we get when we run vcld —setup: [root@vcltst bin]# ./vcld --setup VCL Management Node Setup Select a module to configure: [vcld] Make a selection (1, 'c' to cancel): 1 *** Choice must be an integer between 1 and 0 *** Any choice except c will result in the above error. Any ideas why this is happening ? Regards, Al Quiros Unix System Administrator Florida International University